Search: delayed feedback

Sparse Reward Advanced

Reward only given upon task completion.

Reinforcement Learning

Agent Loop Intermediate

Continuous cycle of observation, reasoning, action, and feedback.

AI Economics & Strategy

Feedback Loop Intermediate

Using production outcomes to improve models.

MLOps & Infrastructure

Control Loop Advanced

Continuous loop adjusting actions based on state feedback.

Robotics & Embodied AI

Closed-Loop Control Advanced

Control using real-time sensor feedback.

Robotics & Embodied AI

Feedback Intermediate

Using output to adjust future inputs.

Foundations & Theory

Feedback Amplification Advanced

AI reinforcing market trends.

Agents & Autonomy

RLHF Intermediate

Reinforcement learning from human feedback: uses preference data to train a reward model and optimize the policy.

Optimization

Scalable Oversight Advanced

Using limited human feedback to guide large models.

AI Safety & Alignment

Feedback Loop Collapse Intermediate

Model trained on its own outputs degrades quality.

Model Failure Modes

Self-Reflection Intermediate

Models evaluating and improving their own outputs.

AI Economics & Strategy

Open-Loop Control Advanced

Control without feedback after execution begins.

Robotics & Embodied AI

System Dynamics Advanced

Equations governing how system states change over time.

Dynamics & Physics

Teleoperation Frontier

Human controlling robot remotely.

World Models & Cognition

Sensorimotor Loop Advanced

Closed loop linking sensing and acting.

Agents & Autonomy

Reinforcement Learning Intermediate

A learning paradigm where an agent interacts with an environment and learns to choose actions to maximize cumulative reward.

Reinforcement Learning

Online Learning Intermediate

Learning where data arrives sequentially and the model updates continuously, often under changing distributions.

Machine Learning

Reward Model Intermediate

Model trained to predict human preferences (or utility) for candidate outputs; used in RLHF-style pipelines.

Foundations & Theory

Alignment Intermediate

Ensuring model behavior matches human goals, norms, and constraints, including reducing harmful or deceptive outputs.

Foundations & Theory

Human-in-the-Loop Intermediate

System design where humans validate or guide model outputs, especially for high-stakes decisions.

Foundations & Theory

Actor-Critic Intermediate

Combines value estimation (critic) with policy learning (actor).

AI Economics & Strategy

Emergent Coordination Intermediate

Coordination arising without explicit programming.

AI Economics & Strategy

Shadow Deployment Intermediate

Running new model alongside production without user impact.

MLOps & Infrastructure

Canary Release Intermediate

Incrementally deploying new models to reduce risk.

MLOps & Infrastructure

Data Drift Intermediate

Shift in feature distribution over time.

MLOps & Infrastructure

Prediction Drift Intermediate

Shift in model outputs.

MLOps & Infrastructure

Corrigibility Advanced

Willingness of system to accept correction or shutdown.

AI Safety & Alignment

Constraint Prompting Intro

Explicit output constraints (format, tone).

Prompting & Instructions

Embodied AI Advanced

AI systems that perceive and act in the physical world through sensors and actuators.

Robotics & Embodied AI

Actuator Advanced

Hardware components that execute physical actions.

Robotics & Embodied AI

Results for "delayed feedback"

Welcome to AI Glossary

Search

Browse

3D WordGraph