Results for "self-reinforcement"
Power-Seeking Behavior
Advanced
Tendency to gain control/resources.
Value Learning
Intermediate
Inferring and aligning with human preferences.
Agent
Intermediate
A system that perceives state, selects actions, and pursues goals—often combining LLM reasoning with tools and memory.
On-Policy Learning
Intermediate
Learning only from current policy’s data.