Results for "policy consistency"

13 results

Data Labeling Intermediate

Human or automated process of assigning targets; quality, consistency, and guidelines matter heavily.

Foundations & Theory
Inter-Annotator Agreement Intermediate

Measure of consistency across labelers; low agreement indicates ambiguous tasks or poor guidelines.

Foundations & Theory
Off-Policy Learning Intermediate

Learning from data generated by a different policy.

AI Economics & Strategy
On-Policy Learning Intermediate

Learning only from current policy’s data.

AI Economics & Strategy
RLHF Intermediate

Reinforcement learning from human feedback: uses preference data to train a reward model and optimize the policy.

Optimization
Audit Intermediate

Systematic review of model/data processes to ensure performance, fairness, security, and policy compliance.

Governance & Ethics
Red Teaming Intermediate

Stress-testing models for failures, vulnerabilities, policy violations, and harmful behaviors before release.

Security & Privacy
Actor-Critic Intermediate

Combines value estimation (critic) with policy learning (actor).

AI Economics & Strategy
Explainability Requirement Intermediate

Legal or policy requirement to explain AI decisions.

AI Economics & Strategy
Self-Consistency Intro

Sampling multiple outputs and selecting consensus.

Prompting & Instructions
Policy Intermediate

Strategy mapping states to actions.

AI Economics & Strategy
Policy Gradient Intermediate

Optimizing policies directly via gradient ascent on expected reward.

AI Economics & Strategy
Policy Search Advanced

Directly optimizing control policies.

Reinforcement Learning