Search: sequence modeling

Self-Attention Intermediate

Attention where queries/keys/values come from the same sequence, enabling token-to-token interactions.

Transformers & LLMs

Causal Mask Intermediate

Prevents attention to future tokens during training/inference.

AI Economics & Strategy

Positional Encoding Intermediate

Injects sequence order into Transformers, since attention alone is permutation-invariant.

Foundations & Theory

Autoregressive Model Intermediate

Generates sequences one token at a time, conditioning on past tokens.

Foundations & Theory

Exposure Bias Intermediate

Differences between training and inference conditions.

Model Failure Modes

Latent Dynamics Frontier

Modeling environment evolution in latent space.

World Models & Cognition

Recurrent Neural Network Intermediate

Networks with recurrent connections for sequences; largely supplanted by Transformers for many tasks.

Neural Networks

Next-Token Prediction Intermediate

Training objective where the model predicts the next token given previous tokens (causal modeling).

Foundations & Theory

LSTM Intermediate

An RNN variant using gates to mitigate vanishing gradients and capture longer context.

Foundations & Theory

Temporal Convolution Intermediate

CNNs applied to time series.

Time Series

Masked Language Model Intermediate

Predicts masked tokens in a sequence, enabling bidirectional context; often used for embeddings rather than generation.

Foundations & Theory

Beam Search Intermediate

Search algorithm for generation that keeps top-k partial sequences; can improve likelihood but reduce diversity.

Foundations & Theory

Perplexity Intermediate

Exponential of average negative log-likelihood; lower means better predictive fit, not necessarily better utility.

Evaluation & Benchmarking

Absolute Positional Encoding Intermediate

Encodes token position explicitly, often via sinusoids.

AI Economics & Strategy

Attention Head Intermediate

A single attention mechanism within multi-head attention.

AI Economics & Strategy

Vision Transformer Intermediate

Transformer applied to image patches.

Computer Vision

Natural Language Instruction Frontier

Controlling robots via language.

World Models & Cognition

Language Model Intermediate

A model that assigns probabilities to sequences of tokens; often trained by next-token prediction.

Large Language Models

Chain-of-Thought Intermediate

Stepwise reasoning patterns that can improve multi-step tasks; often handled implicitly or summarized for safety/privacy.

Foundations & Theory

AlphaFold Advanced

Deep learning system for protein structure prediction.

AI in Science

Markov Decision Process Intermediate

Formal framework for sequential decision-making under uncertainty.

AI Economics & Strategy

Self-Supervised Learning Intermediate

Learning from data by constructing “pseudo-labels” (e.g., next-token prediction, masked modeling) without manual annotation.

Machine Learning

Log Loss Intermediate

Penalizes confident wrong predictions heavily; standard for classification and language modeling.

Optimization

Tokenization Intermediate

Converting text into discrete units (tokens) for modeling; subword tokenizers balance vocabulary size and coverage.

Foundations & Theory

State Space Model Intermediate

Models time evolution via hidden states.

Time Series

Contact Dynamics Advanced

Modeling interactions with environment.

Dynamics & Physics

Reality Gap Advanced

Differences between simulated and real physics.

Simulation & Sim-to-Real

Context Window Intermediate

Maximum number of tokens the model can attend to in one forward pass; constrains long-document reasoning.

Transformers & LLMs

Large Language Model Intermediate

A high-capacity language model trained on massive corpora, exhibiting broad generalization and emergent behaviors.

Large Language Models

Sampling Intermediate

Stochastic generation strategies that trade determinism for diversity; key knobs include temperature and nucleus sampling.

Foundations & Theory

Results for "sequence modeling"

Welcome to AI Glossary

Search

Browse

3D WordGraph