Search: parallel attention

Attention Head Intermediate

A single attention mechanism within multi-head attention.

AI Economics & Strategy

Sparse Attention Intermediate

Attention mechanisms that reduce quadratic complexity.

AI Economics & Strategy

Transformer Intermediate

Architecture based on self-attention and feedforward layers; foundation of modern LLMs and many multimodal models.

Transformers & LLMs

Attention Intermediate

Mechanism that computes context-aware mixtures of representations; scales well and captures long-range dependencies.

Transformers & LLMs

Self-Attention Intermediate

Attention where queries/keys/values come from the same sequence, enabling token-to-token interactions.

Transformers & LLMs

Graph Attention Network Intermediate

GNN using attention to weight neighbor contributions dynamically.

Model Architectures

Cross-Attention Intermediate

Attention between different modalities.

Computer Vision

Multi-Head Attention Intermediate

Allows model to attend to information from different subspaces simultaneously.

AI Economics & Strategy

Causal Mask Intermediate

Prevents attention to future tokens during training/inference.

AI Economics & Strategy

Context Compression Intermediate

Techniques to handle longer documents without quadratic cost.

AI Economics & Strategy

Reproducibility Intermediate

Ability to replicate results given same code/data; harder in distributed training and nondeterministic ops.

Foundations & Theory

Throughput Intermediate

How many requests or tokens can be processed per unit time; affects scalability and cost.

Transformers & LLMs

Compute Intermediate

Hardware resources used for training/inference; constrained by memory bandwidth, FLOPs, and parallelism.

Foundations & Theory

Shadow Deployment Intermediate

Running new model alongside production without user impact.

MLOps & Infrastructure

Batch Inference Intermediate

Running predictions on large datasets periodically.

MLOps & Infrastructure

Positional Encoding Intermediate

Injects sequence order into Transformers, since attention alone is permutation-invariant.

Foundations & Theory

Interpretability Intermediate

Studying internal mechanisms or input influence on outputs (e.g., saliency maps, SHAP, attention analysis).

Foundations & Theory

Key-Value Cache Intermediate

Stores past attention states to speed up autoregressive decoding.

AI Economics & Strategy

Vision Transformer Intermediate

Transformer applied to image patches.

Computer Vision

Delimited Prompt Intro

Using markers to isolate context segments.

Prompting & Instructions

Recurrent Neural Network Intermediate

Networks with recurrent connections for sequences; largely supplanted by Transformers for many tasks.

Neural Networks

Autoregressive Model Intermediate

Generates sequences one token at a time, conditioning on past tokens.

Foundations & Theory

Large Language Model Intermediate

A high-capacity language model trained on massive corpora, exhibiting broad generalization and emergent behaviors.

Large Language Models

Context Window Intermediate

Maximum number of tokens the model can attend to in one forward pass; constrains long-document reasoning.

Transformers & LLMs

NLP Intermediate

AI subfield dealing with understanding and generating human language, including syntax, semantics, and pragmatics.

Foundations & Theory

Memory Augmentation Intermediate

Extending agents with long-term memory stores.

AI Economics & Strategy

Speech Recognition Intermediate

Converting audio speech into text, often using encoder-decoder or transducer architectures.

Speech & Audio AI

Prompt Leakage Intermediate

Extracting system prompts or hidden instructions.

AI Economics & Strategy

Toolformer Intermediate

Models trained to decide when to call tools.

AI Economics & Strategy

Reward Hacking Advanced

Maximizing reward without fulfilling real goal.

AI Safety & Alignment

Results for "parallel attention"

Welcome to AI Glossary

Search

Browse

3D WordGraph