Results for "shared reward"

AdvertisementAd space — search-top

38 results

Reward Hacking Advanced

Maximizing reward without fulfilling real goal.

AI Safety & Alignment
Reward Shaping Advanced

Modifying reward to accelerate learning.

Reinforcement Learning
Inverse Reinforcement Learning Advanced

Inferring reward function from observed behavior.

Reinforcement Learning
Reward Model Intermediate

Model trained to predict human preferences (or utility) for candidate outputs; used in RLHF-style pipelines.

Foundations & Theory
Sparse Reward Advanced

Reward only given upon task completion.

Reinforcement Learning
Multitask Learning Intermediate

Training one model on multiple tasks simultaneously to improve generalization through shared structure.

Machine Learning
Blackboard System Advanced

Agents communicate via shared state.

Agents & Autonomy
Shared Autonomy Frontier

Control shared between human and agent.

World Models & Cognition
Cooperative AI Intermediate

Designing AI to cooperate with humans and each other.

Governance & Ethics
Specification Gaming Advanced

Model exploits poorly specified objectives.

AI Safety & Alignment
Value Function Intermediate

Expected cumulative reward from a state or state-action pair.

AI Economics & Strategy
Reinforcement Learning Intermediate

A learning paradigm where an agent interacts with an environment and learns to choose actions to maximize cumulative reward.

Reinforcement Learning
RLHF Intermediate

Reinforcement learning from human feedback: uses preference data to train a reward model and optimize the policy.

Optimization
Value Misalignment Advanced

Model optimizes objectives misaligned with human values.

AI Safety & Alignment
Guardrails Intermediate

Rules and controls around generation (filters, validators, structured outputs) to reduce unsafe or invalid behavior.

Reinforcement Learning
Outer Alignment Advanced

Correctly specifying goals.

AI Safety & Alignment
Policy Gradient Intermediate

Optimizing policies directly via gradient ascent on expected reward.

AI Economics & Strategy
Mutual Information Intermediate

Quantifies shared information between random variables.

AI Economics & Strategy
Graph Attention Network Intermediate

GNN using attention to weight neighbor contributions dynamically.

Model Architectures
CLIP Intermediate

Joint vision-language model aligning images and text.

Computer Vision
Hierarchical Planning Advanced

Decomposing goals into sub-tasks.

Agents & Autonomy
Human-in-the-Loop Control Frontier

Humans assist or override autonomous behavior.

World Models & Cognition
Teleoperation Frontier

Human controlling robot remotely.

World Models & Cognition
Cooperative Game Advanced

Agents optimize collective outcomes.

Agents & Autonomy
Norm Formation Advanced

Emergence of conventions among agents.

Dynamics & Physics
DPO Intermediate

A preference-based training method optimizing policies directly from pairwise comparisons without explicit RL loops.

Optimization
Markov Decision Process Intermediate

Formal framework for sequential decision-making under uncertainty.

AI Economics & Strategy
Policy Intermediate

Strategy mapping states to actions.

AI Economics & Strategy
Q-Function Intermediate

Expected return of taking action in a state.

AI Economics & Strategy
Bellman Equation Intermediate

Fundamental recursive relationship defining optimal value functions.

AI Economics & Strategy

Welcome to AI Glossary

The free, self-building AI dictionary. Help us keep it free—click an ad once in a while!

Search

Type any question or keyword into the search bar at the top.

Browse

Tap a letter in the A–Z bar to browse terms alphabetically, or filter by domain, industry, or difficulty level.

3D WordGraph

Fly around the interactive 3D graph to explore how AI concepts connect. Click any word to read its full definition.