Search: shared reward — AI Glossary

Shared Autonomy Frontier

Control shared between human and agent.

World Models & Cognition

Reward Hacking Advanced

Maximizing reward without fulfilling real goal.

AI Safety & Alignment

Reward Shaping Advanced

Modifying reward to accelerate learning.

Reinforcement Learning

Sparse Reward Advanced

Reward only given upon task completion.

Reinforcement Learning

Multitask Learning Intermediate

Training one model on multiple tasks simultaneously to improve generalization through shared structure.

Machine Learning

Mutual Information Intermediate

Quantifies shared information between random variables.

AI Economics & Strategy

Blackboard System Advanced

Agents communicate via shared state.

Agents & Autonomy

Reinforcement Learning Intermediate

A learning paradigm where an agent interacts with an environment and learns to choose actions to maximize cumulative reward.

Reinforcement Learning

RLHF Intermediate

Reinforcement learning from human feedback: uses preference data to train a reward model and optimize the policy.

Optimization

Value Function Intermediate

Expected cumulative reward from a state or state-action pair.

AI Economics & Strategy

Policy Gradient Intermediate

Optimizing policies directly via gradient ascent on expected reward.

AI Economics & Strategy

Inverse Reinforcement Learning Advanced

Inferring reward function from observed behavior.

Reinforcement Learning

Reward Model Intermediate

Model trained to predict human preferences (or utility) for candidate outputs; used in RLHF-style pipelines.

Foundations & Theory

Results for "shared reward"