Results for "preference scoring"
Brier Score
Intermediate
A proper scoring rule measuring squared error of predicted probabilities for binary outcomes.
RLHF
Intermediate
Reinforcement learning from human feedback: uses preference data to train a reward model and optimize the policy.
DPO
Intermediate
A preference-based training method optimizing policies directly from pairwise comparisons without explicit RL loops.
Credit Scoring
Intermediate
Predicting borrower default risk.