Search: proxy exploitation

Exploration-Exploitation Tradeoff Intermediate

Balancing learning new behaviors vs exploiting known rewards.

AI Economics & Strategy

Deceptive Alignment Advanced

Model behaves well during training but not deployment.

AI Safety & Alignment

Beam Search Intermediate

Search algorithm for generation that keeps top-k partial sequences; can improve likelihood but reduce diversity.

Foundations & Theory

Autonomous Agent Advanced

System that independently pursues goals over time.

Agents & Autonomy

Scalable Oversight Advanced

Using limited human feedback to guide large models.

AI Safety & Alignment

Model-Based RL Advanced

RL using learned or known environment models.

Reinforcement Learning

Surrogate Model Advanced

Fast approximation of costly simulations.

AI in Science

Results for "proxy exploitation"