Results for "policy"

Policy

Intermediate

Strategy mapping states to actions.

A policy is like a game plan for an AI, telling it what to do in different situations. Imagine you're playing a sport: your coach gives you a strategy for how to play based on the situation on the field. In the same way, a policy helps an AI decide which action to take when it finds itself in a p...

Full Definition View in 3D WordGraph

31 results

Policy Search Advanced

Directly optimizing control policies.

Reinforcement Learning

On-Policy Learning Intermediate

Learning only from current policy’s data.

AI Economics & Strategy

Off-Policy Learning Intermediate

Learning from data generated by a different policy.

AI Economics & Strategy

Policy Gradient Intermediate

Optimizing policies directly via gradient ascent on expected reward.

AI Economics & Strategy

Policy Intermediate

Strategy mapping states to actions.

AI Economics & Strategy

Actor-Critic Intermediate

Combines value estimation (critic) with policy learning (actor).

AI Economics & Strategy

Agent Loop Intermediate

Continuous cycle of observation, reasoning, action, and feedback.

AI Economics & Strategy

Reinforcement Learning Intermediate

A learning paradigm where an agent interacts with an environment and learns to choose actions to maximize cumulative reward.

Reinforcement Learning

RLHF Intermediate

Reinforcement learning from human feedback: uses preference data to train a reward model and optimize the policy.

Red Teaming Intermediate

Stress-testing models for failures, vulnerabilities, policy violations, and harmful behaviors before release.

Security & Privacy

Guardrails Intermediate

Rules and controls around generation (filters, validators, structured outputs) to reduce unsafe or invalid behavior.

Reinforcement Learning

Planner-Executor Intermediate

Separates planning from execution in agent architectures.

AI Economics & Strategy

Value Function Intermediate

Expected cumulative reward from a state or state-action pair.

AI Economics & Strategy

Explainability Requirement Intermediate

Legal or policy requirement to explain AI decisions.

AI Economics & Strategy

Controller Intermediate

Algorithm computing control actions.

Foundations & Theory

Model-Free RL Advanced

RL without explicit dynamics model.

Reinforcement Learning

Behavior Cloning Advanced

Learning action mapping directly from demonstrations.

Reinforcement Learning

System Prompt Intermediate

A high-priority instruction layer setting overarching behavior constraints for a chat model.

Reinforcement Learning

Reward Model Intermediate

Model trained to predict human preferences (or utility) for candidate outputs; used in RLHF-style pipelines.

Foundations & Theory

State Space Intermediate

All possible configurations an agent may encounter.

AI Economics & Strategy

Audit Intermediate

Systematic review of model/data processes to ensure performance, fairness, security, and policy compliance.

Governance & Ethics

Bellman Equation Intermediate

Fundamental recursive relationship defining optimal value functions.

AI Economics & Strategy

Markov Decision Process Intermediate

Formal framework for sequential decision-making under uncertainty.

AI Economics & Strategy

Structural Causal Model Advanced

Formal model linking causal mechanisms and variables.

Causal AI & Interpretability

Q-Function Intermediate

Expected return of taking action in a state.

AI Economics & Strategy

Average Treatment Effect Advanced

Expected causal effect of a treatment.

Causal AI & Interpretability

Inner Alignment Advanced

Ensuring learned behavior matches intended objective.

AI Safety & Alignment

Domain Randomization Advanced

Randomizing simulation parameters to improve real-world transfer.

Simulation & Sim-to-Real

Model-Based RL Advanced

RL using learned or known environment models.

Reinforcement Learning

Imitation Learning Advanced

Learning policies from expert demonstrations.

Reinforcement Learning