Search: policies — AI Glossary

Policy Gradient Intermediate

Optimizing policies directly via gradient ascent on expected reward.

AI Economics & Strategy

DPO Intermediate

A preference-based training method optimizing policies directly from pairwise comparisons without explicit RL loops.

Optimization

Model Governance Intermediate

Policies and practices for approving, monitoring, auditing, and documenting models in production.

Governance & Ethics

Policy Intermediate

Strategy mapping states to actions.

AI Economics & Strategy

Optimal Control Intermediate

Finding control policies minimizing cumulative cost.

Foundations & Theory

Causal Inference Intermediate

Framework for reasoning about cause-effect relationships beyond correlation, often using structural assumptions and experiments.

Foundations & Theory

Data Governance Intermediate

Processes and controls for data quality, access, lineage, retention, and compliance across the AI lifecycle.

Foundations & Theory

KL Divergence Intermediate

Measures how one probability distribution diverges from another.

AI Economics & Strategy

Audit Intermediate

Systematic review of model/data processes to ensure performance, fairness, security, and policy compliance.

Governance & Ethics

Action Space Intermediate

Set of all actions available to the agent.

AI Economics & Strategy

Markov Decision Process Intermediate

Formal framework for sequential decision-making under uncertainty.

AI Economics & Strategy

Bellman Equation Intermediate

Fundamental recursive relationship defining optimal value functions.

AI Economics & Strategy

Value Function Intermediate

Expected cumulative reward from a state or state-action pair.

AI Economics & Strategy

Exploration-Exploitation Tradeoff Intermediate

Balancing learning new behaviors vs exploiting known rewards.

AI Economics & Strategy

Q-Function Intermediate

Expected return of taking action in a state.

AI Economics & Strategy

Caching Intermediate

Storing results to reduce compute.

AI Economics & Strategy

Off-Policy Learning Intermediate

Learning from data generated by a different policy.

AI Economics & Strategy

Controller Intermediate

Algorithm computing control actions.

Foundations & Theory

Deceptive Alignment Advanced

Model behaves well during training but not deployment.

AI Safety & Alignment

Model-Free RL Advanced

RL without explicit dynamics model.

Reinforcement Learning

Model-Based RL Advanced

RL using learned or known environment models.

Reinforcement Learning

Policy Search Advanced

Directly optimizing control policies.

Reinforcement Learning

Imitation Learning Advanced

Learning policies from expert demonstrations.

Reinforcement Learning

Compute Governance Intermediate

Regulating access to large-scale compute.

Governance & Ethics

Results for "policies"

Welcome to AI Glossary

Search

Browse

3D WordGraph