Results for "expected return"

AdvertisementAd space — search-top

51 results

Policy Gradient Intermediate

Optimizing policies directly via gradient ascent on expected reward.

AI Economics & Strategy
Alpha Intermediate

Returns above benchmark.

AI Economics & Strategy
Value Function Intermediate

Expected cumulative reward from a state or state-action pair.

AI Economics & Strategy
Q-Function Intermediate

Expected return of taking action in a state.

AI Economics & Strategy
Policy Intermediate

Strategy mapping states to actions.

AI Economics & Strategy
Reinforcement Learning Intermediate

A learning paradigm where an agent interacts with an environment and learns to choose actions to maximize cumulative reward.

Reinforcement Learning
Markov Decision Process Intermediate

Formal framework for sequential decision-making under uncertainty.

AI Economics & Strategy
Exploration-Exploitation Tradeoff Intermediate

Balancing learning new behaviors vs exploiting known rewards.

AI Economics & Strategy
Policy Search Advanced

Directly optimizing control policies.

Reinforcement Learning
On-Policy Learning Intermediate

Learning only from current policy’s data.

AI Economics & Strategy
Law of Large Numbers Advanced

Sample mean converges to expected value.

Probability & Statistics
AI Center of Excellence Intermediate

Centralized AI expertise group.

Governance & Ethics
Cost Attribution Intermediate

Assigning AI costs to business units.

AI Economics & Strategy
Stability Intermediate

System returns to equilibrium after disturbance.

Foundations & Theory
Lyapunov Stability Intermediate

Stability proven via monotonic decrease of Lyapunov function.

Foundations & Theory
Risk Model Intermediate

Quantifying financial risk.

AI Economics & Strategy
Variance Term Intermediate

Error due to sensitivity to fluctuations in the training dataset.

AI Economics & Strategy
Average Treatment Effect Advanced

Expected causal effect of a treatment.

Causal AI & Interpretability
Monte Carlo Estimation Advanced

Approximating expectations via random sampling.

Probability & Statistics
Value at Risk Intermediate

Maximum expected loss under normal conditions.

AI Economics & Strategy
Meta-Learning Intermediate

Methods that learn training procedures or initializations so models can adapt quickly to new tasks with little data.

Machine Learning
Objective Function Intermediate

A scalar measure optimized during training, typically expected loss over data, sometimes with regularization terms.

Optimization
Generalization Intermediate

How well a model performs on new data drawn from the same (or similar) distribution as training.

Foundations & Theory
Dropout Intermediate

Randomly zeroing activations during training to reduce co-adaptation and overfitting.

Foundations & Theory
System Prompt Intermediate

A high-priority instruction layer setting overarching behavior constraints for a chat model.

Reinforcement Learning
RLHF Intermediate

Reinforcement learning from human feedback: uses preference data to train a reward model and optimize the policy.

Optimization
SHAP Intermediate

Feature attribution method grounded in cooperative game theory for explaining predictions in tabular settings.

Foundations & Theory
Function Calling Intermediate

Constraining model outputs into a schema used to call external APIs/tools safely and deterministically.

Foundations & Theory
Planner-Executor Intermediate

Separates planning from execution in agent architectures.

AI Economics & Strategy
Bias Term Intermediate

Systematic error introduced by simplifying assumptions in a learning algorithm.

AI Economics & Strategy

Welcome to AI Glossary

The free, self-building AI dictionary. Help us keep it free—click an ad once in a while!

Search

Type any question or keyword into the search bar at the top.

Browse

Tap a letter in the A–Z bar to browse terms alphabetically, or filter by domain, industry, or difficulty level.

3D WordGraph

Fly around the interactive 3D graph to explore how AI concepts connect. Click any word to read its full definition.