Results for "direct preference optimization"

AdvertisementAd space — search-top

123 results

Covariance Advanced

Measures joint variability between variables.

Probability & Statistics
Reward Hacking Advanced

Maximizing reward without fulfilling real goal.

AI Safety & Alignment
Mesa-Optimizer Advanced

Learned subsystem that optimizes its own objective.

AI Safety & Alignment
Scalable Oversight Advanced

Using limited human feedback to guide large models.

AI Safety & Alignment
Constraint Prompting Intro

Explicit output constraints (format, tone).

Prompting & Instructions
Reflection Prompting Intro

Asking model to review and improve output.

Prompting & Instructions
Decomposition Prompt Intro

Breaking tasks into sub-steps.

Prompting & Instructions
Model Orchestration Intermediate

Coordinating models, tools, and logic.

AI Economics & Strategy
Explainability Mandate Intermediate

Requirement to provide explanations.

Governance & Ethics
Token Budgeting Intermediate

Limiting inference usage.

AI Economics & Strategy
Throughput Ceiling Intermediate

Maximum system processing rate.

AI Economics & Strategy
Control Theory Intermediate

Mathematical framework for controlling dynamic systems.

Foundations & Theory
Model Predictive Control Intermediate

Optimizes future actions using a model of dynamics.

Foundations & Theory
Robust Control Intermediate

Control that remains stable under model uncertainty.

Foundations & Theory
Inverse Kinematics Advanced

Computing joint angles for desired end-effector pose.

Dynamics & Physics
Digital Twin Advanced

High-fidelity virtual model of a physical system.

Simulation & Sim-to-Real
Domain Randomization Advanced

Randomizing simulation parameters to improve real-world transfer.

Simulation & Sim-to-Real
RRT Advanced

Sampling-based motion planner.

Motion Planning & Navigation
Latent Dynamics Frontier

Modeling environment evolution in latent space.

World Models & Cognition
Physical Safety Frontier

Ensuring robots do not harm humans.

World Models & Cognition
Active Inference Frontier

Acting to minimize surprise or free energy.

World Models & Cognition
Lifelong Learning Advanced

Learning without catastrophic forgetting.

Agents & Autonomy
AI Hallucination Intermediate

Fabrication of cases or statutes by LLMs.

AI in Law
Algorithmic Trading Intermediate

AI-driven buying/selling of financial assets.

AI Economics & Strategy
Scientific ML Advanced

AI applied to scientific problems.

AI in Science
Symbolic Regression Advanced

Finding mathematical equations from data.

AI in Science
Market Design Advanced

Designing efficient marketplaces.

Agents & Autonomy
Swarm Dynamics Advanced

Collective behavior without central control.

Dynamics & Physics
Capability Overhang Advanced

Stored compute or algorithms enabling rapid jumps.

AI Safety & Alignment
Alignment Tax Advanced

Tradeoff between safety and performance.

AI Safety & Alignment

Welcome to AI Glossary

The free, self-building AI dictionary. Help us keep it free—click an ad once in a while!

Search

Type any question or keyword into the search bar at the top.

Browse

Tap a letter in the A–Z bar to browse terms alphabetically, or filter by domain, industry, or difficulty level.

3D WordGraph

Fly around the interactive 3D graph to explore how AI concepts connect. Click any word to read its full definition.