Results for "recursive value"
Bellman Equation
Intermediate
Fundamental recursive relationship defining optimal value functions.
Actor-Critic
Intermediate
Combines value estimation (critic) with policy learning (actor).
Expectation
Advanced
Average value under a distribution.
Law of Large Numbers
Advanced
Sample mean converges to expected value.
Key-Value Cache
Intermediate
Stores past attention states to speed up autoregressive decoding.
Value Function
Intermediate
Expected cumulative reward from a state or state-action pair.
Singular Value Decomposition
Advanced
Decomposes a matrix into orthogonal components; used in embeddings and compression.
Value Misalignment
Advanced
Model optimizes objectives misaligned with human values.
Value at Risk
Intermediate
Maximum expected loss under normal conditions.
Value Learning
Intermediate
Inferring and aligning with human preferences.