Results for "optimal tradeoff"
Balancing learning new behaviors vs exploiting known rewards.
A conceptual framework describing error as the sum of systematic error (bias) and sensitivity to data (variance).
Tradeoff between safety and performance.
Fundamental recursive relationship defining optimal value functions.
Expected return of taking action in a state.
Finding control policies minimizing cumulative cost.
Optimal control for linear systems with quadratic cost.
Techniques that discourage overly complex solutions to improve generalization (reduce overfitting).
A robust evaluation technique that trains/evaluates across multiple splits to estimate performance variability.
Error due to sensitivity to fluctuations in the training dataset.
Systematic error introduced by simplifying assumptions in a learning algorithm.
Scaling law optimizing compute vs data.
Optimal estimator for linear dynamic systems.
Alternative formulation providing bounds.
Optimizing continuous action sequences.
Optimal pathfinding algorithm.
Fast approximation of costly simulations.
When a model cannot capture underlying structure, performing poorly on both training and test data.
Number of samples per gradient update; impacts compute efficiency, generalization, and stability.
One complete traversal of the training dataset during training.
Halting training when validation performance stops improving to reduce overfitting.
Tradeoffs between many layers vs many neurons per layer.
Formal framework for sequential decision-making under uncertainty.
Set of all actions available to the agent.
Strategy mapping states to actions.
Number of steps considered in planning.
Expected cumulative reward from a state or state-action pair.
Visualization of optimization landscape.
Lowest possible loss.
Choosing step size along gradient direction.