Results for "self-reinforcement"

AdvertisementAd space — search-top

94 results

Self-Model Frontier

Internal representation of the agent itself.

AGI & General Intelligence
Self-Attention Intermediate

Attention where queries/keys/values come from the same sequence, enabling token-to-token interactions.

Transformers & LLMs
Transformer Intermediate

Architecture based on self-attention and feedforward layers; foundation of modern LLMs and many multimodal models.

Transformers & LLMs
Self-Reflection Intermediate

Models evaluating and improving their own outputs.

AI Economics & Strategy
Self-Consistency Intro

Sampling multiple outputs and selecting consensus.

Prompting & Instructions
Reflection Prompting Intro

Asking model to review and improve output.

Prompting & Instructions
Self-Supervised Learning Intermediate

Learning from data by constructing “pseudo-labels” (e.g., next-token prediction, masked modeling) without manual annotation.

Machine Learning
Safety Filter Intermediate

Automated detection/prevention of disallowed outputs (toxicity, self-harm, illegal instruction, etc.).

Foundations & Theory
Vision Transformer Intermediate

Transformer applied to image patches.

Computer Vision
Feedback Loop Collapse Intermediate

Model trained on its own outputs degrades quality.

Model Failure Modes
Inverse Reinforcement Learning Advanced

Inferring reward function from observed behavior.

Reinforcement Learning
RLHF Intermediate

Reinforcement learning from human feedback: uses preference data to train a reward model and optimize the policy.

Optimization
Policy Intermediate

Strategy mapping states to actions.

AI Economics & Strategy
Imitation Learning Advanced

Learning policies from expert demonstrations.

Reinforcement Learning
Semi-Supervised Learning Intermediate

Training with a small labeled dataset plus a larger unlabeled dataset, leveraging assumptions like smoothness/cluster structure.

Machine Learning
Positional Encoding Intermediate

Injects sequence order into Transformers, since attention alone is permutation-invariant.

Foundations & Theory
Recurrent Neural Network Intermediate

Networks with recurrent connections for sequences; largely supplanted by Transformers for many tasks.

Neural Networks
Autoregressive Model Intermediate

Generates sequences one token at a time, conditioning on past tokens.

Foundations & Theory
Large Language Model Intermediate

A high-capacity language model trained on massive corpora, exhibiting broad generalization and emergent behaviors.

Large Language Models
Context Window Intermediate

Maximum number of tokens the model can attend to in one forward pass; constrains long-document reasoning.

Transformers & LLMs
Causal Mask Intermediate

Prevents attention to future tokens during training/inference.

AI Economics & Strategy
Emergence Advanced

System-level behavior arising from interactions.

Dynamics & Physics
Graph Attention Network Intermediate

GNN using attention to weight neighbor contributions dynamically.

Model Architectures
Swarm Dynamics Advanced

Collective behavior without central control.

Dynamics & Physics
Swarm Intelligence Advanced

Distributed agents producing emergent intelligence.

Agents & Autonomy
Meta-Cognition Frontier

Awareness and regulation of internal processes.

AGI & General Intelligence
Shutdown Problem Advanced

Ensuring AI allows shutdown.

AI Safety & Alignment
Instrumental Goals Advanced

Goals useful regardless of final objective.

AI Safety & Alignment
Reinforcement Learning Intermediate

A learning paradigm where an agent interacts with an environment and learns to choose actions to maximize cumulative reward.

Reinforcement Learning
State Space Intermediate

All possible configurations an agent may encounter.

AI Economics & Strategy

Welcome to AI Glossary

The free, self-building AI dictionary. Help us keep it free—click an ad once in a while!

Search

Type any question or keyword into the search bar at the top.

Browse

Tap a letter in the A–Z bar to browse terms alphabetically, or filter by domain, industry, or difficulty level.

3D WordGraph

Fly around the interactive 3D graph to explore how AI concepts connect. Click any word to read its full definition.