Results for "shared reward"

13 results

Shared Autonomy Frontier

Control shared between human and agent.

World Models & Cognition
Reward Hacking Advanced

Maximizing reward without fulfilling real goal.

AI Safety & Alignment
Reward Shaping Advanced

Modifying reward to accelerate learning.

Reinforcement Learning
Sparse Reward Advanced

Reward only given upon task completion.

Reinforcement Learning
Multitask Learning Intermediate

Training one model on multiple tasks simultaneously to improve generalization through shared structure.

Machine Learning
Mutual Information Intermediate

Quantifies shared information between random variables.

AI Economics & Strategy
Blackboard System Advanced

Agents communicate via shared state.

Agents & Autonomy
Reinforcement Learning Intermediate

A learning paradigm where an agent interacts with an environment and learns to choose actions to maximize cumulative reward.

Reinforcement Learning
RLHF Intermediate

Reinforcement learning from human feedback: uses preference data to train a reward model and optimize the policy.

Optimization
Value Function Intermediate

Expected cumulative reward from a state or state-action pair.

AI Economics & Strategy
Policy Gradient Intermediate

Optimizing policies directly via gradient ascent on expected reward.

AI Economics & Strategy
Inverse Reinforcement Learning Advanced

Inferring reward function from observed behavior.

Reinforcement Learning
Reward Model Intermediate

Model trained to predict human preferences (or utility) for candidate outputs; used in RLHF-style pipelines.

Foundations & Theory