Results for "aligned agents"

AdvertisementAd space — search-top

75 results

Mixed-Motive Game Advanced

Combination of cooperation and competition.

Agents & Autonomy
Multi-Agent System Intermediate

Multiple agents interacting cooperatively or competitively.

AI Economics & Strategy
Swarm Intelligence Advanced

Distributed agents producing emergent intelligence.

Agents & Autonomy
Coordination Failure Advanced

Agents fail to coordinate optimally.

Agents & Autonomy
Mechanism Design Advanced

Designing systems where rational agents behave as desired.

Agents & Autonomy
Prediction Drift Intermediate

Shift in model outputs.

MLOps & Infrastructure
Alignment Problem Advanced

Ensuring AI systems pursue intended human goals.

AI Safety & Alignment
Value Misalignment Advanced

Model optimizes objectives misaligned with human values.

AI Safety & Alignment
Outer Alignment Advanced

Correctly specifying goals.

AI Safety & Alignment
Mesa-Optimizer Advanced

Learned subsystem that optimizes its own objective.

AI Safety & Alignment
Deceptive Alignment Advanced

Model behaves well during training but not deployment.

AI Safety & Alignment
Corrigibility Advanced

Willingness of system to accept correction or shutdown.

AI Safety & Alignment
Change Management Intermediate

Governance of model changes.

Governance & Ethics
Tripwire Advanced

Signals indicating dangerous behavior.

AI Safety & Alignment
On-Policy Learning Intermediate

Learning only from current policy’s data.

AI Economics & Strategy
Memory Augmentation Intermediate

Extending agents with long-term memory stores.

AI Economics & Strategy
Autonomous Agent Advanced

System that independently pursues goals over time.

Agents & Autonomy
Hierarchical Planning Advanced

Decomposing goals into sub-tasks.

Agents & Autonomy
Reflex Agent Advanced

Simple agent responding directly to inputs.

Agents & Autonomy
Blackboard System Advanced

Agents communicate via shared state.

Agents & Autonomy
Instrumental Convergence Advanced

Tendency for agents to pursue resources regardless of final goal.

AI Safety & Alignment
Emergent Competition Advanced

Competition arises without explicit design.

Agents & Autonomy
Algorithmic Collusion Advanced

AI tacitly coordinating prices.

Agents & Autonomy
Information Asymmetry Advanced

Some agents know more than others.

Agents & Autonomy
Cooperative AI Intermediate

Designing AI to cooperate with humans and each other.

Governance & Ethics
Simulation Advanced

Artificial environment for training/testing agents.

Simulation & Sim-to-Real
Norm Formation Advanced

Emergence of conventions among agents.

Dynamics & Physics
Agent Intermediate

A system that perceives state, selects actions, and pursues goals—often combining LLM reasoning with tools and memory.

Agents & Autonomy
Observability Intermediate

A broader capability to infer internal system state from telemetry, crucial for AI services and agents.

Evaluation & Benchmarking
Emergent Coordination Intermediate

Coordination arising without explicit programming.

AI Economics & Strategy

Welcome to AI Glossary

The free, self-building AI dictionary. Help us keep it free—click an ad once in a while!

Search

Type any question or keyword into the search bar at the top.

Browse

Tap a letter in the A–Z bar to browse terms alphabetically, or filter by domain, industry, or difficulty level.

3D WordGraph

Fly around the interactive 3D graph to explore how AI concepts connect. Click any word to read its full definition.