Search: hidden objectives

Inner Alignment Advanced

Ensuring learned behavior matches intended objective.

AI Safety & Alignment

Robust Alignment Advanced

Maintaining alignment under new conditions.

AI Safety & Alignment

Deceptive Alignment Advanced

Model behaves well during training but not deployment.

AI Safety & Alignment

Change Management Intermediate

Governance of model changes.

Governance & Ethics

Model Orchestration Intermediate

Coordinating models, tools, and logic.

AI Economics & Strategy

Token Budgeting Intermediate

Limiting inference usage.

AI Economics & Strategy

Plant Intermediate

The physical system being controlled.

Foundations & Theory

Cooperative Game Advanced

Agents optimize collective outcomes.

Agents & Autonomy

Shutdown Problem Advanced

Ensuring AI allows shutdown.

AI Safety & Alignment

Power-Seeking Behavior Advanced

Tendency to gain control/resources.

AI Safety & Alignment

Orthogonality Thesis Advanced

Intelligence and goals are independent.

AI Safety & Alignment

Instrumental Goals Advanced

Goals useful regardless of final objective.

AI Safety & Alignment

Alignment Research Intermediate

Research ensuring AI remains safe.

Governance & Ethics

Cooperative AI Intermediate

Designing AI to cooperate with humans and each other.

Governance & Ethics

Existential Risk Advanced

Risk threatening humanity’s survival.

AI Safety & Alignment

Results for "hidden objectives"

Welcome to AI Glossary

Search

Browse

3D WordGraph