Results for "policies"

AdvertisementAd space — search-top

24 results

Policy Gradient Intermediate

Optimizing policies directly via gradient ascent on expected reward.

AI Economics & Strategy
DPO Intermediate

A preference-based training method optimizing policies directly from pairwise comparisons without explicit RL loops.

Optimization
Model Governance Intermediate

Policies and practices for approving, monitoring, auditing, and documenting models in production.

Governance & Ethics
Policy Intermediate

Strategy mapping states to actions.

AI Economics & Strategy
Optimal Control Intermediate

Finding control policies minimizing cumulative cost.

Foundations & Theory
Causal Inference Intermediate

Framework for reasoning about cause-effect relationships beyond correlation, often using structural assumptions and experiments.

Foundations & Theory
Data Governance Intermediate

Processes and controls for data quality, access, lineage, retention, and compliance across the AI lifecycle.

Foundations & Theory
KL Divergence Intermediate

Measures how one probability distribution diverges from another.

AI Economics & Strategy
Audit Intermediate

Systematic review of model/data processes to ensure performance, fairness, security, and policy compliance.

Governance & Ethics
Action Space Intermediate

Set of all actions available to the agent.

AI Economics & Strategy
Markov Decision Process Intermediate

Formal framework for sequential decision-making under uncertainty.

AI Economics & Strategy
Bellman Equation Intermediate

Fundamental recursive relationship defining optimal value functions.

AI Economics & Strategy
Value Function Intermediate

Expected cumulative reward from a state or state-action pair.

AI Economics & Strategy
Exploration-Exploitation Tradeoff Intermediate

Balancing learning new behaviors vs exploiting known rewards.

AI Economics & Strategy
Q-Function Intermediate

Expected return of taking action in a state.

AI Economics & Strategy
Caching Intermediate

Storing results to reduce compute.

AI Economics & Strategy
Off-Policy Learning Intermediate

Learning from data generated by a different policy.

AI Economics & Strategy
Controller Intermediate

Algorithm computing control actions.

Foundations & Theory
Deceptive Alignment Advanced

Model behaves well during training but not deployment.

AI Safety & Alignment
Model-Free RL Advanced

RL without explicit dynamics model.

Reinforcement Learning
Model-Based RL Advanced

RL using learned or known environment models.

Reinforcement Learning
Policy Search Advanced

Directly optimizing control policies.

Reinforcement Learning
Imitation Learning Advanced

Learning policies from expert demonstrations.

Reinforcement Learning
Compute Governance Intermediate

Regulating access to large-scale compute.

Governance & Ethics

Welcome to AI Glossary

The free, self-building AI dictionary. Help us keep it free—click an ad once in a while!

Search

Type any question or keyword into the search bar at the top.

Browse

Tap a letter in the A–Z bar to browse terms alphabetically, or filter by domain, industry, or difficulty level.

3D WordGraph

Fly around the interactive 3D graph to explore how AI concepts connect. Click any word to read its full definition.