RLHF — AI Glossary

Reinforcement learning from human feedback: uses preference data to train a reward model and optimize the policy.

Full Definition

Reinforcement learning from human feedback: uses preference data to train a reward model and optimize the policy.

preference optimization

Optimization

See how RLHF connects to other concepts.