Search: attention weights

Model Watermarking Intermediate

Embedding signals to prove model ownership.

AI Economics & Strategy

Restricted Boltzmann Machine Intermediate

Simplified Boltzmann Machine with bipartite structure.

Model Architectures

Conditional Random Field Intermediate

Probabilistic graphical model for structured prediction.

Model Architectures

Particle Filter Intermediate

Monte Carlo method for state estimation.

Time Series

Catastrophic Forgetting Intermediate

Loss of old knowledge when learning new tasks.

Model Failure Modes

Distillation Intermediate

Training a smaller “student” model to mimic a larger “teacher,” often improving efficiency while retaining performance.

Foundations & Theory

Recurrent Neural Network Intermediate

Networks with recurrent connections for sequences; largely supplanted by Transformers for many tasks.

Neural Networks

Autoregressive Model Intermediate

Generates sequences one token at a time, conditioning on past tokens.

Foundations & Theory

Large Language Model Intermediate

A high-capacity language model trained on massive corpora, exhibiting broad generalization and emergent behaviors.

Large Language Models

Context Window Intermediate

Maximum number of tokens the model can attend to in one forward pass; constrains long-document reasoning.

Transformers & LLMs

NLP Intermediate

AI subfield dealing with understanding and generating human language, including syntax, semantics, and pragmatics.

Foundations & Theory

Memory Augmentation Intermediate

Extending agents with long-term memory stores.

AI Economics & Strategy

Speech Recognition Intermediate

Converting audio speech into text, often using encoder-decoder or transducer architectures.

Speech & Audio AI

Prompt Leakage Intermediate

Extracting system prompts or hidden instructions.

AI Economics & Strategy

Toolformer Intermediate

Models trained to decide when to call tools.

AI Economics & Strategy

Reward Hacking Advanced

Maximizing reward without fulfilling real goal.

AI Safety & Alignment

Role Prompting Intro

Assigning a role or identity to the model.

Prompting & Instructions

Decomposition Prompt Intro

Breaking tasks into sub-steps.

Prompting & Instructions

Scratchpad Intro

Temporary reasoning space (often hidden).

Prompting & Instructions

Prompt Sensitivity Intermediate

Small prompt changes cause large output changes.

Model Failure Modes

Natural Language Instruction Frontier

Controlling robots via language.

World Models & Cognition

AlphaFold Advanced

Deep learning system for protein structure prediction.

AI in Science

Results for "attention weights"

Welcome to AI Glossary

Search

Browse

3D WordGraph