Results for "recursive value"

AdvertisementAd space — search-top

48 results

Bellman Equation Intermediate

Fundamental recursive relationship defining optimal value functions.

AI Economics & Strategy
Value Function Intermediate

Expected cumulative reward from a state or state-action pair.

AI Economics & Strategy
Feedback Loop Collapse Intermediate

Model trained on its own outputs degrades quality.

Model Failure Modes
Key-Value Cache Intermediate

Stores past attention states to speed up autoregressive decoding.

AI Economics & Strategy
Actor-Critic Intermediate

Combines value estimation (critic) with policy learning (actor).

AI Economics & Strategy
Value at Risk Intermediate

Maximum expected loss under normal conditions.

AI Economics & Strategy
Q-Function Intermediate

Expected return of taking action in a state.

AI Economics & Strategy
Law of Large Numbers Advanced

Sample mean converges to expected value.

Probability & Statistics
Policy Gradient Intermediate

Optimizing policies directly via gradient ascent on expected reward.

AI Economics & Strategy
Monte Carlo Estimation Advanced

Approximating expectations via random sampling.

Probability & Statistics
Value Misalignment Advanced

Model optimizes objectives misaligned with human values.

AI Safety & Alignment
Policy Search Advanced

Directly optimizing control policies.

Reinforcement Learning
Value Learning Intermediate

Inferring and aligning with human preferences.

Governance & Ethics
Mean Squared Error Intermediate

Average of squared residuals; common regression objective.

Optimization
Gradient Descent Intermediate

Iterative method that updates parameters in the direction of negative gradient to minimize loss.

Optimization
SHAP Intermediate

Feature attribution method grounded in cooperative game theory for explaining predictions in tabular settings.

Foundations & Theory
Gradient Clipping Intermediate

Limiting gradient magnitude to prevent exploding gradients.

AI Economics & Strategy
Energy-Based Model Intermediate

Models that define an energy landscape rather than explicit probabilities.

Model Architectures
Cross-Attention Intermediate

Attention between different modalities.

Computer Vision
Singular Value Decomposition Advanced

Decomposes a matrix into orthogonal components; used in embeddings and compression.

Mathematics
Expectation Advanced

Average value under a distribution.

Probability & Statistics
Cooperative Game Advanced

Agents optimize collective outcomes.

Agents & Autonomy
Reinforcement Learning Intermediate

A learning paradigm where an agent interacts with an environment and learns to choose actions to maximize cumulative reward.

Reinforcement Learning
Precision Intermediate

Of predicted positives, the fraction that are truly positive; sensitive to false positives.

Foundations & Theory
AUC Intermediate

Scalar summary of ROC; measures ranking ability, not calibration.

Foundations & Theory
Flat Minimum Intermediate

A wide basin often correlated with better generalization.

AI Economics & Strategy
Bias Term Intermediate

Systematic error introduced by simplifying assumptions in a learning algorithm.

AI Economics & Strategy
Warmup Intermediate

Gradually increasing learning rate at training start to avoid divergence.

AI Economics & Strategy
Maximum Likelihood Estimation Intermediate

Estimating parameters by maximizing likelihood of observed data.

AI Economics & Strategy
Policy Intermediate

Strategy mapping states to actions.

AI Economics & Strategy

Welcome to AI Glossary

The free, self-building AI dictionary. Help us keep it free—click an ad once in a while!

Search

Type any question or keyword into the search bar at the top.

Browse

Tap a letter in the A–Z bar to browse terms alphabetically, or filter by domain, industry, or difficulty level.

3D WordGraph

Fly around the interactive 3D graph to explore how AI concepts connect. Click any word to read its full definition.