Search: intra-sequence relations

Knowledge Graph Intermediate

Structured graph encoding facts as entityâ€“relationâ€“entity triples.

Model Architectures

Self-Attention Intermediate

Attention where queries/keys/values come from the same sequence, enabling token-to-token interactions.

Transformers & LLMs

Restricted Boltzmann Machine Intermediate

Simplified Boltzmann Machine with bipartite structure.

Model Architectures

Causal Mask Intermediate

Prevents attention to future tokens during training/inference.

AI Economics & Strategy

Positional Encoding Intermediate

Injects sequence order into Transformers, since attention alone is permutation-invariant.

Foundations & Theory

Autoregressive Model Intermediate

Generates sequences one token at a time, conditioning on past tokens.

Foundations & Theory

Masked Language Model Intermediate

Predicts masked tokens in a sequence, enabling bidirectional context; often used for embeddings rather than generation.

Foundations & Theory

Beam Search Intermediate

Search algorithm for generation that keeps top-k partial sequences; can improve likelihood but reduce diversity.

Foundations & Theory

Perplexity Intermediate

Exponential of average negative log-likelihood; lower means better predictive fit, not necessarily better utility.

Evaluation & Benchmarking

Absolute Positional Encoding Intermediate

Encodes token position explicitly, often via sinusoids.

AI Economics & Strategy

Attention Head Intermediate

A single attention mechanism within multi-head attention.

AI Economics & Strategy

Vision Transformer Intermediate

Transformer applied to image patches.

Computer Vision

Exposure Bias Intermediate

Differences between training and inference conditions.

Model Failure Modes

Natural Language Instruction Frontier

Controlling robots via language.

World Models & Cognition

Recurrent Neural Network Intermediate

Networks with recurrent connections for sequences; largely supplanted by Transformers for many tasks.

Neural Networks

Next-Token Prediction Intermediate

Training objective where the model predicts the next token given previous tokens (causal modeling).

Foundations & Theory

LSTM Intermediate

An RNN variant using gates to mitigate vanishing gradients and capture longer context.

Foundations & Theory

Context Window Intermediate

Maximum number of tokens the model can attend to in one forward pass; constrains long-document reasoning.

Transformers & LLMs

Language Model Intermediate

A model that assigns probabilities to sequences of tokens; often trained by next-token prediction.

Large Language Models

Chain-of-Thought Intermediate

Stepwise reasoning patterns that can improve multi-step tasks; often handled implicitly or summarized for safety/privacy.

Foundations & Theory

Large Language Model Intermediate

A high-capacity language model trained on massive corpora, exhibiting broad generalization and emergent behaviors.

Large Language Models

Sampling Intermediate

Stochastic generation strategies that trade determinism for diversity; key knobs include temperature and nucleus sampling.

Foundations & Theory

Top-k Intermediate

Samples from the k highest-probability tokens to limit unlikely outputs.

Foundations & Theory

Top-p Intermediate

Samples from the smallest set of tokens whose probabilities sum to p, adapting set size by context.

Foundations & Theory

Logits Intermediate

Raw model outputs before converting to probabilities; manipulated during decoding and calibration.

Foundations & Theory

Agent Intermediate

A system that perceives state, selects actions, and pursues goals—often combining LLM reasoning with tools and memory.

Agents & Autonomy

Orchestration Intermediate

Coordinating tools, models, and steps (retrieval, calls, validation) to deliver reliable end-to-end behavior.

Foundations & Theory

Key-Value Cache Intermediate

Stores past attention states to speed up autoregressive decoding.

AI Economics & Strategy

Rotary Positional Embeddings Intermediate

Encodes positional information via rotation in embedding space.

AI Economics & Strategy

Context Compression Intermediate

Techniques to handle longer documents without quadratic cost.

AI Economics & Strategy

Results for "intra-sequence relations"

Welcome to AI Glossary

Search

Browse

3D WordGraph