Results for "policy"

Policy

Intermediate

Strategy mapping states to actions.

A policy is like a game plan for an AI, telling it what to do in different situations. Imagine you're playing a sport: your coach gives you a strategy for how to play based on the situation on the field. In the same way, a policy helps an AI decide which action to take when it finds itself in a p...

AdvertisementAd space — search-top

31 results

Policy Search Advanced

Directly optimizing control policies.

Reinforcement Learning
On-Policy Learning Intermediate

Learning only from current policy’s data.

AI Economics & Strategy
Off-Policy Learning Intermediate

Learning from data generated by a different policy.

AI Economics & Strategy
Policy Gradient Intermediate

Optimizing policies directly via gradient ascent on expected reward.

AI Economics & Strategy
Policy Intermediate

Strategy mapping states to actions.

AI Economics & Strategy
Actor-Critic Intermediate

Combines value estimation (critic) with policy learning (actor).

AI Economics & Strategy
Agent Loop Intermediate

Continuous cycle of observation, reasoning, action, and feedback.

AI Economics & Strategy
Reinforcement Learning Intermediate

A learning paradigm where an agent interacts with an environment and learns to choose actions to maximize cumulative reward.

Reinforcement Learning
RLHF Intermediate

Reinforcement learning from human feedback: uses preference data to train a reward model and optimize the policy.

Optimization
Red Teaming Intermediate

Stress-testing models for failures, vulnerabilities, policy violations, and harmful behaviors before release.

Security & Privacy
Guardrails Intermediate

Rules and controls around generation (filters, validators, structured outputs) to reduce unsafe or invalid behavior.

Reinforcement Learning
Planner-Executor Intermediate

Separates planning from execution in agent architectures.

AI Economics & Strategy
Value Function Intermediate

Expected cumulative reward from a state or state-action pair.

AI Economics & Strategy
Explainability Requirement Intermediate

Legal or policy requirement to explain AI decisions.

AI Economics & Strategy
Controller Intermediate

Algorithm computing control actions.

Foundations & Theory
Model-Free RL Advanced

RL without explicit dynamics model.

Reinforcement Learning
Behavior Cloning Advanced

Learning action mapping directly from demonstrations.

Reinforcement Learning
System Prompt Intermediate

A high-priority instruction layer setting overarching behavior constraints for a chat model.

Reinforcement Learning
Reward Model Intermediate

Model trained to predict human preferences (or utility) for candidate outputs; used in RLHF-style pipelines.

Foundations & Theory
State Space Intermediate

All possible configurations an agent may encounter.

AI Economics & Strategy
Audit Intermediate

Systematic review of model/data processes to ensure performance, fairness, security, and policy compliance.

Governance & Ethics
Bellman Equation Intermediate

Fundamental recursive relationship defining optimal value functions.

AI Economics & Strategy
Markov Decision Process Intermediate

Formal framework for sequential decision-making under uncertainty.

AI Economics & Strategy
Structural Causal Model Advanced

Formal model linking causal mechanisms and variables.

Causal AI & Interpretability
Q-Function Intermediate

Expected return of taking action in a state.

AI Economics & Strategy
Average Treatment Effect Advanced

Expected causal effect of a treatment.

Causal AI & Interpretability
Inner Alignment Advanced

Ensuring learned behavior matches intended objective.

AI Safety & Alignment
Domain Randomization Advanced

Randomizing simulation parameters to improve real-world transfer.

Simulation & Sim-to-Real
Model-Based RL Advanced

RL using learned or known environment models.

Reinforcement Learning
Imitation Learning Advanced

Learning policies from expert demonstrations.

Reinforcement Learning

Welcome to AI Glossary

The free, self-building AI dictionary. Help us keep it free—click an ad once in a while!

Search

Type any question or keyword into the search bar at the top.

Browse

Tap a letter in the A–Z bar to browse terms alphabetically, or filter by domain, industry, or difficulty level.

3D WordGraph

Fly around the interactive 3D graph to explore how AI concepts connect. Click any word to read its full definition.