Results for "attention"

Attention

Intermediate

Mechanism that computes context-aware mixtures of representations; scales well and captures long-range dependencies.

Attention is like a spotlight that helps a model focus on the most important parts of the input data when making predictions. For example, when translating a sentence, attention allows the model to pay more attention to certain words that are crucial for understanding the meaning. Instead of trea...

AdvertisementAd space — search-top

31 results

Attention Head Intermediate

A single attention mechanism within multi-head attention.

AI Economics & Strategy
Sparse Attention Intermediate

Attention mechanisms that reduce quadratic complexity.

AI Economics & Strategy
Attention Intermediate

Mechanism that computes context-aware mixtures of representations; scales well and captures long-range dependencies.

Transformers & LLMs
Self-Attention Intermediate

Attention where queries/keys/values come from the same sequence, enabling token-to-token interactions.

Transformers & LLMs
Graph Attention Network Intermediate

GNN using attention to weight neighbor contributions dynamically.

Model Architectures
Cross-Attention Intermediate

Attention between different modalities.

Computer Vision
Transformer Intermediate

Architecture based on self-attention and feedforward layers; foundation of modern LLMs and many multimodal models.

Transformers & LLMs
Multi-Head Attention Intermediate

Allows model to attend to information from different subspaces simultaneously.

AI Economics & Strategy
Causal Mask Intermediate

Prevents attention to future tokens during training/inference.

AI Economics & Strategy
Context Compression Intermediate

Techniques to handle longer documents without quadratic cost.

AI Economics & Strategy
Positional Encoding Intermediate

Injects sequence order into Transformers, since attention alone is permutation-invariant.

Foundations & Theory
Interpretability Intermediate

Studying internal mechanisms or input influence on outputs (e.g., saliency maps, SHAP, attention analysis).

Foundations & Theory
Key-Value Cache Intermediate

Stores past attention states to speed up autoregressive decoding.

AI Economics & Strategy
Vision Transformer Intermediate

Transformer applied to image patches.

Computer Vision
Delimited Prompt Intro

Using markers to isolate context segments.

Prompting & Instructions
Autoregressive Model Intermediate

Generates sequences one token at a time, conditioning on past tokens.

Foundations & Theory
Recurrent Neural Network Intermediate

Networks with recurrent connections for sequences; largely supplanted by Transformers for many tasks.

Neural Networks
Context Window Intermediate

Maximum number of tokens the model can attend to in one forward pass; constrains long-document reasoning.

Transformers & LLMs
Large Language Model Intermediate

A high-capacity language model trained on massive corpora, exhibiting broad generalization and emergent behaviors.

Large Language Models
Memory Augmentation Intermediate

Extending agents with long-term memory stores.

AI Economics & Strategy
NLP Intermediate

AI subfield dealing with understanding and generating human language, including syntax, semantics, and pragmatics.

Foundations & Theory
Prompt Leakage Intermediate

Extracting system prompts or hidden instructions.

AI Economics & Strategy
Speech Recognition Intermediate

Converting audio speech into text, often using encoder-decoder or transducer architectures.

Speech & Audio AI
Reward Hacking Advanced

Maximizing reward without fulfilling real goal.

AI Safety & Alignment
Toolformer Intermediate

Models trained to decide when to call tools.

AI Economics & Strategy
Role Prompting Intro

Assigning a role or identity to the model.

Prompting & Instructions
Decomposition Prompt Intro

Breaking tasks into sub-steps.

Prompting & Instructions
Scratchpad Intro

Temporary reasoning space (often hidden).

Prompting & Instructions
Prompt Sensitivity Intermediate

Small prompt changes cause large output changes.

Model Failure Modes
Natural Language Instruction Frontier

Controlling robots via language.

World Models & Cognition

Welcome to AI Glossary

The free, self-building AI dictionary. Help us keep it free—click an ad once in a while!

Search

Type any question or keyword into the search bar at the top.

Browse

Tap a letter in the A–Z bar to browse terms alphabetically, or filter by domain, industry, or difficulty level.

3D WordGraph

Fly around the interactive 3D graph to explore how AI concepts connect. Click any word to read its full definition.