Search: reward inference

Exploration-Exploitation Tradeoff Intermediate

Balancing learning new behaviors vs exploiting known rewards.

AI Economics & Strategy

Alignment Problem Advanced

Ensuring AI systems pursue intended human goals.

AI Safety & Alignment

Inner Alignment Advanced

Ensuring learned behavior matches intended objective.

AI Safety & Alignment

Imitation Learning Advanced

Learning policies from expert demonstrations.

Reinforcement Learning

Deceptive Alignment Advanced

Model behaves well during training but not deployment.

AI Safety & Alignment

Power-Seeking Behavior Advanced

Tendency to gain control/resources.

AI Safety & Alignment

Confounding Intermediate

A hidden variable influences both cause and effect, biasing naive estimates of causal impact.

Foundations & Theory

Latency Intermediate

Time from request to response; critical for real-time inference and UX.

Foundations & Theory

Quantization Intermediate

Reducing numeric precision of weights/activations to speed inference and reduce memory with acceptable accuracy loss.

Foundations & Theory

Bayesian Inference Intermediate

Updating beliefs about parameters using observed evidence and prior distributions.

AI Economics & Strategy

Causal Mask Intermediate

Prevents attention to future tokens during training/inference.

AI Economics & Strategy

Variational Autoencoder Advanced

Autoencoder using probabilistic latent variables and KL regularization.

Diffusion & Generative Models

Instrumental Variable Advanced

Variable enabling causal inference despite confounding.

Causal AI & Interpretability

Token Budgeting Intermediate

Limiting inference usage.

AI Economics & Strategy

Likelihood Function Advanced

Probability of data given parameters.

Probability & Statistics

Tokenization Intermediate

Converting text into discrete units (tokens) for modeling; subword tokenizers balance vocabulary size and coverage.

Foundations & Theory

A/B Testing Intermediate

Controlled experiment comparing variants by random assignment to estimate causal effects of changes.

Foundations & Theory

Active Learning Intermediate

Selecting the most informative samples to label (e.g., uncertainty sampling) to reduce labeling cost.

Foundations & Theory

Observability Intermediate

A broader capability to infer internal system state from telemetry, crucial for AI services and agents.

Evaluation & Benchmarking

Throughput Intermediate

How many requests or tokens can be processed per unit time; affects scalability and cost.

Transformers & LLMs

Privacy Attack Intermediate

Attacks that infer whether specific records were in training data, or reconstruct sensitive training examples.

Foundations & Theory

Human-in-the-Loop Intermediate

System design where humans validate or guide model outputs, especially for high-stakes decisions.

Foundations & Theory

KL Divergence Intermediate

Measures how one probability distribution diverges from another.

AI Economics & Strategy

Key-Value Cache Intermediate

Stores past attention states to speed up autoregressive decoding.

AI Economics & Strategy

Maximum Likelihood Estimation Intermediate

Estimating parameters by maximizing likelihood of observed data.

AI Economics & Strategy

Algorithmic Accountability Intermediate

Ensuring decisions can be explained and traced.

AI Economics & Strategy

Gradient Leakage Intermediate

Recovering training data from gradients.

AI Economics & Strategy

Model Inversion Intermediate

Inferring sensitive features of training data.

AI Economics & Strategy

Conditional Random Field Intermediate

Probabilistic graphical model for structured prediction.

Model Architectures

Hidden Markov Model Intermediate

Probabilistic model for sequential data with latent states.

Model Architectures

Results for "reward inference"

Welcome to AI Glossary

Search

Browse

3D WordGraph