Search: MDP — AI Glossary

Markov Decision Process Intermediate

Formal framework for sequential decision-making under uncertainty.

AI Economics & Strategy

RLHF Intermediate

Reinforcement learning from human feedback: uses preference data to train a reward model and optimize the policy.

Optimization

Action Space Intermediate

Set of all actions available to the agent.

AI Economics & Strategy

Shared Autonomy Frontier

Control shared between human and agent.

World Models & Cognition

Active Experimentation Advanced

AI selecting next experiments.

AI in Science

Results for "MDP"