Search: tokens/sec — AI Glossary

Tokenization Intermediate

Converting text into discrete units (tokens) for modeling; subword tokenizers balance vocabulary size and coverage.

Foundations & Theory

Vocabulary Intermediate

The set of tokens a model can represent; impacts efficiency, multilinguality, and handling of rare strings.

Transformers & LLMs

Language Model Intermediate

A model that assigns probabilities to sequences of tokens; often trained by next-token prediction.

Large Language Models

Next-Token Prediction Intermediate

Training objective where the model predicts the next token given previous tokens (causal modeling).

Foundations & Theory

Autoregressive Model Intermediate

Generates sequences one token at a time, conditioning on past tokens.

Foundations & Theory

Masked Language Model Intermediate

Predicts masked tokens in a sequence, enabling bidirectional context; often used for embeddings rather than generation.

Foundations & Theory

Context Window Intermediate

Maximum number of tokens the model can attend to in one forward pass; constrains long-document reasoning.

Transformers & LLMs

Throughput Intermediate

How many requests or tokens can be processed per unit time; affects scalability and cost.

Transformers & LLMs

Top-k Intermediate

Samples from the k highest-probability tokens to limit unlikely outputs.

Foundations & Theory

Top-p Intermediate

Samples from the smallest set of tokens whose probabilities sum to p, adapting set size by context.

Foundations & Theory

Causal Mask Intermediate

Prevents attention to future tokens during training/inference.

AI Economics & Strategy

Canary Tokens Intermediate

Detecting unauthorized model outputs or data leaks.

AI Economics & Strategy

Results for "tokens/sec"