Results for "preference scoring"
Predicting borrower default risk.
Reinforcement learning from human feedback: uses preference data to train a reward model and optimize the policy.
A preference-based training method optimizing policies directly from pairwise comparisons without explicit RL loops.
A proper scoring rule measuring squared error of predicted probabilities for binary outcomes.
Inferring reward function from observed behavior.
Inferring and aligning with human preferences.
Legal or policy requirement to explain AI decisions.
Central log of AI-related risks.
Running predictions on large datasets periodically.
Credit models with interpretable logic.