Results for "goal divergence"
Reward Hacking
Advanced
Maximizing reward without fulfilling real goal.
Instrumental Convergence
Advanced
Tendency for agents to pursue resources regardless of final goal.
Path Planning
Advanced
Finding routes from start to goal.
Exploding Gradient
Intermediate
Gradients grow too large, causing divergence; mitigated by clipping, normalization, careful init.
Cross-Entropy
Intermediate
Measures divergence between true and predicted probability distributions.
Warmup
Intermediate
Gradually increasing learning rate at training start to avoid divergence.
KL Divergence
Intermediate
Measures how one probability distribution diverges from another.