Search: aligned agents

Mixed-Motive Game Advanced

Combination of cooperation and competition.

Agents & Autonomy

Multi-Agent System Intermediate

Multiple agents interacting cooperatively or competitively.

AI Economics & Strategy

Swarm Intelligence Advanced

Distributed agents producing emergent intelligence.

Agents & Autonomy

Coordination Failure Advanced

Agents fail to coordinate optimally.

Agents & Autonomy

Mechanism Design Advanced

Designing systems where rational agents behave as desired.

Agents & Autonomy

Prediction Drift Intermediate

Shift in model outputs.

MLOps & Infrastructure

Alignment Problem Advanced

Ensuring AI systems pursue intended human goals.

AI Safety & Alignment

Value Misalignment Advanced

Model optimizes objectives misaligned with human values.

AI Safety & Alignment

Outer Alignment Advanced

Correctly specifying goals.

AI Safety & Alignment

Mesa-Optimizer Advanced

Learned subsystem that optimizes its own objective.

AI Safety & Alignment

Deceptive Alignment Advanced

Model behaves well during training but not deployment.

AI Safety & Alignment

Corrigibility Advanced

Willingness of system to accept correction or shutdown.

AI Safety & Alignment

Change Management Intermediate

Governance of model changes.

Governance & Ethics

Tripwire Advanced

Signals indicating dangerous behavior.

AI Safety & Alignment

On-Policy Learning Intermediate

Learning only from current policy’s data.

AI Economics & Strategy

Memory Augmentation Intermediate

Extending agents with long-term memory stores.

AI Economics & Strategy

Autonomous Agent Advanced

System that independently pursues goals over time.

Agents & Autonomy

Hierarchical Planning Advanced

Decomposing goals into sub-tasks.

Agents & Autonomy

Reflex Agent Advanced

Simple agent responding directly to inputs.

Agents & Autonomy

Blackboard System Advanced

Agents communicate via shared state.

Agents & Autonomy

Instrumental Convergence Advanced

Tendency for agents to pursue resources regardless of final goal.

AI Safety & Alignment

Emergent Competition Advanced

Competition arises without explicit design.

Agents & Autonomy

Algorithmic Collusion Advanced

AI tacitly coordinating prices.

Agents & Autonomy

Information Asymmetry Advanced

Some agents know more than others.

Agents & Autonomy

Cooperative AI Intermediate

Designing AI to cooperate with humans and each other.

Governance & Ethics

Simulation Advanced

Artificial environment for training/testing agents.

Simulation & Sim-to-Real

Norm Formation Advanced

Emergence of conventions among agents.

Dynamics & Physics

Agent Intermediate

A system that perceives state, selects actions, and pursues goals—often combining LLM reasoning with tools and memory.

Agents & Autonomy

Observability Intermediate

A broader capability to infer internal system state from telemetry, crucial for AI services and agents.

Evaluation & Benchmarking

Emergent Coordination Intermediate

Coordination arising without explicit programming.

AI Economics & Strategy

Results for "aligned agents"

Welcome to AI Glossary

Search

Browse

3D WordGraph