Results for "autoregressive"
Autoregressive Model
IntermediateGenerates sequences one token at a time, conditioning on past tokens.
An autoregressive model is like a storyteller that creates a story one word at a time. It starts with the first word and then looks at all the words it has already written to decide what comes next. For example, if it starts with 'Once upon a time', it might add 'there was a princess' as the next...
Generates sequences one token at a time, conditioning on past tokens.
Prevents attention to future tokens during training/inference.
A model that assigns probabilities to sequences of tokens; often trained by next-token prediction.
Stores past attention states to speed up autoregressive decoding.
Training objective where the model predicts the next token given previous tokens (causal modeling).
Predicts masked tokens in a sequence, enabling bidirectional context; often used for embeddings rather than generation.
Sequential data indexed by time.
Classical statistical time-series model.