Results for "self-reinforcement"

AdvertisementAd space — search-top

94 results

Feedback Loop Intermediate

Using production outcomes to improve models.

MLOps & Infrastructure
Autonomous Agent Advanced

System that independently pursues goals over time.

Agents & Autonomy
Planning Horizon Advanced

Number of steps considered in planning.

Agents & Autonomy
Importance Sampling Advanced

Sampling from easier distribution with reweighting.

Probability & Statistics
Stochastic Approximation Intermediate

Optimization under uncertainty.

Foundations & Theory
Value Misalignment Advanced

Model optimizes objectives misaligned with human values.

AI Safety & Alignment
Inner Alignment Advanced

Ensuring learned behavior matches intended objective.

AI Safety & Alignment
Mesa-Optimizer Advanced

Learned subsystem that optimizes its own objective.

AI Safety & Alignment
Deceptive Alignment Advanced

Model behaves well during training but not deployment.

AI Safety & Alignment
Constraint Prompting Intro

Explicit output constraints (format, tone).

Prompting & Instructions
Controller Intermediate

Algorithm computing control actions.

Foundations & Theory
Simulation Advanced

Artificial environment for training/testing agents.

Simulation & Sim-to-Real
Domain Randomization Advanced

Randomizing simulation parameters to improve real-world transfer.

Simulation & Sim-to-Real
Hybrid Training Advanced

Combining simulation and real-world data.

Simulation & Sim-to-Real
Model-Free RL Advanced

RL without explicit dynamics model.

Reinforcement Learning
Model-Based RL Advanced

RL using learned or known environment models.

Reinforcement Learning
Trajectory Optimization Advanced

Optimizing continuous action sequences.

Reinforcement Learning
Reward Shaping Advanced

Modifying reward to accelerate learning.

Reinforcement Learning
World Model Frontier

Learned model of environment dynamics.

World Models & Cognition
Latent Dynamics Frontier

Modeling environment evolution in latent space.

World Models & Cognition
Predictive Coding Frontier

Learning by minimizing prediction error.

World Models & Cognition
Mental Simulation Frontier

Imagined future trajectories.

World Models & Cognition
Human-in-the-Loop Control Frontier

Humans assist or override autonomous behavior.

World Models & Cognition
Sensorimotor Loop Advanced

Closed loop linking sensing and acting.

Agents & Autonomy
Developmental Robotics Advanced

Robots learning via exploration and growth.

Agents & Autonomy
Scientific ML Advanced

AI applied to scientific problems.

AI in Science
Active Experimentation Advanced

AI selecting next experiments.

AI in Science
Competitive Game Advanced

Agents have opposing objectives.

Agents & Autonomy
Algorithmic Collusion Advanced

AI tacitly coordinating prices.

Agents & Autonomy
Narrow AI Frontier

AI limited to specific domains.

AGI & General Intelligence

Welcome to AI Glossary

The free, self-building AI dictionary. Help us keep it free—click an ad once in a while!

Search

Type any question or keyword into the search bar at the top.

Browse

Tap a letter in the A–Z bar to browse terms alphabetically, or filter by domain, industry, or difficulty level.

3D WordGraph

Fly around the interactive 3D graph to explore how AI concepts connect. Click any word to read its full definition.