Search: delayed feedback

RLHF Intermediate

Reinforcement learning from human feedback: uses preference data to train a reward model and optimize the policy.

Optimization

Agent Loop Intermediate

Continuous cycle of observation, reasoning, action, and feedback.

AI Economics & Strategy

Scalable Oversight Advanced

Using limited human feedback to guide large models.

AI Safety & Alignment

Control Loop Advanced

Continuous loop adjusting actions based on state feedback.

Robotics & Embodied AI

Closed-Loop Control Advanced

Control using real-time sensor feedback.

Robotics & Embodied AI

Open-Loop Control Advanced

Control without feedback after execution begins.

Robotics & Embodied AI

Feedback Loop Intermediate

Using production outcomes to improve models.

MLOps & Infrastructure

Feedback Loop Collapse Intermediate

Model trained on its own outputs degrades quality.

Model Failure Modes

Feedback Intermediate

Using output to adjust future inputs.

Foundations & Theory

Feedback Amplification Advanced

AI reinforcing market trends.

Agents & Autonomy

Results for "delayed feedback"