Results for "query-key-value"
Tracking where data came from and how it was transformed; key for debugging and compliance.
Stochastic generation strategies that trade determinism for diversity; key knobs include temperature and nucleus sampling.
Stores past attention states to speed up autoregressive decoding.
Fundamental recursive relationship defining optimal value functions.
Combines value estimation (critic) with policy learning (actor).
Average value under a distribution.
Sample mean converges to expected value.
Expected cumulative reward from a state or state-action pair.
Decomposes a matrix into orthogonal components; used in embeddings and compression.
Model optimizes objectives misaligned with human values.
Maximum expected loss under normal conditions.
Inferring and aligning with human preferences.