Search: shared reward — AI Glossary

Reward Hacking Advanced

Maximizing reward without fulfilling real goal.

AI Safety & Alignment

Reward Shaping Advanced

Modifying reward to accelerate learning.

Reinforcement Learning

Inverse Reinforcement Learning Advanced

Inferring reward function from observed behavior.

Reinforcement Learning

Reward Model Intermediate

Model trained to predict human preferences (or utility) for candidate outputs; used in RLHF-style pipelines.

Foundations & Theory

Sparse Reward Advanced

Reward only given upon task completion.

Reinforcement Learning

Multitask Learning Intermediate

Training one model on multiple tasks simultaneously to improve generalization through shared structure.

Machine Learning

Blackboard System Advanced

Agents communicate via shared state.

Agents & Autonomy

Shared Autonomy Frontier

Control shared between human and agent.

World Models & Cognition

Cooperative AI Intermediate

Designing AI to cooperate with humans and each other.

Governance & Ethics

Specification Gaming Advanced

Model exploits poorly specified objectives.

AI Safety & Alignment

Value Function Intermediate

Expected cumulative reward from a state or state-action pair.

AI Economics & Strategy

Reinforcement Learning Intermediate

A learning paradigm where an agent interacts with an environment and learns to choose actions to maximize cumulative reward.

Reinforcement Learning

RLHF Intermediate

Reinforcement learning from human feedback: uses preference data to train a reward model and optimize the policy.

Optimization

Value Misalignment Advanced

Model optimizes objectives misaligned with human values.

AI Safety & Alignment

Guardrails Intermediate

Rules and controls around generation (filters, validators, structured outputs) to reduce unsafe or invalid behavior.

Reinforcement Learning

Outer Alignment Advanced

Correctly specifying goals.

AI Safety & Alignment

Policy Gradient Intermediate

Optimizing policies directly via gradient ascent on expected reward.

AI Economics & Strategy

Mutual Information Intermediate

Quantifies shared information between random variables.

AI Economics & Strategy

Graph Attention Network Intermediate

GNN using attention to weight neighbor contributions dynamically.

Model Architectures

CLIP Intermediate

Joint vision-language model aligning images and text.

Computer Vision

Hierarchical Planning Advanced

Decomposing goals into sub-tasks.

Agents & Autonomy

Human-in-the-Loop Control Frontier

Humans assist or override autonomous behavior.

World Models & Cognition

Teleoperation Frontier

Human controlling robot remotely.

World Models & Cognition

Cooperative Game Advanced

Agents optimize collective outcomes.

Agents & Autonomy

Norm Formation Advanced

Emergence of conventions among agents.

Dynamics & Physics

DPO Intermediate

A preference-based training method optimizing policies directly from pairwise comparisons without explicit RL loops.

Optimization

Markov Decision Process Intermediate

Formal framework for sequential decision-making under uncertainty.

AI Economics & Strategy

Policy Intermediate

Strategy mapping states to actions.

AI Economics & Strategy

Q-Function Intermediate

Expected return of taking action in a state.

AI Economics & Strategy

Bellman Equation Intermediate

Fundamental recursive relationship defining optimal value functions.

AI Economics & Strategy

Results for "shared reward"

Welcome to AI Glossary

Search

Browse

3D WordGraph