Search: Shapley values

SHAP Intermediate

Feature attribution method grounded in cooperative game theory for explaining predictions in tabular settings.

Foundations & Theory

Value Misalignment Advanced

Model optimizes objectives misaligned with human values.

AI Safety & Alignment

Interpretability Intermediate

Studying internal mechanisms or input influence on outputs (e.g., saliency maps, SHAP, attention analysis).

Foundations & Theory

Explainability Requirement Intermediate

Legal or policy requirement to explain AI decisions.

AI Economics & Strategy

Explainability Mandate Intermediate

Requirement to provide explanations.

Governance & Ethics

Explainable Credit Model Intermediate

Credit models with interpretable logic.

AI Economics & Strategy

Cooperative Game Advanced

Agents optimize collective outcomes.

Agents & Autonomy

Random Variable Advanced

Variable whose values depend on chance.

Probability & Statistics

Parameters Intermediate

The learned numeric values of a model adjusted during training to minimize a loss function.

Foundations & Theory

Loss Function Intermediate

A function measuring prediction error (and sometimes calibration), guiding gradient-based optimization.

Foundations & Theory

AUC Intermediate

Scalar summary of ROC; measures ranking ability, not calibration.

Foundations & Theory

Mean Squared Error Intermediate

Average of squared residuals; common regression objective.

Optimization

Self-Attention Intermediate

Attention where queries/keys/values come from the same sequence, enabling token-to-token interactions.

Transformers & LLMs

Forecasting Intermediate

Predicting future values from past observations.

Time Series

Q-Function Intermediate

Expected return of taking action in a state.

AI Economics & Strategy

Alignment Problem Advanced

Ensuring AI systems pursue intended human goals.

AI Safety & Alignment

Likelihood Function Advanced

Probability of data given parameters.

Probability & Statistics

Outer Alignment Advanced

Correctly specifying goals.

AI Safety & Alignment

Value Learning Intermediate

Inferring and aligning with human preferences.

Governance & Ethics

Feature Intermediate

A measurable property or attribute used as model input (raw or engineered), such as age, pixel intensity, or token ID.

Foundations & Theory

Objective Function Intermediate

A scalar measure optimized during training, typically expected loss over data, sometimes with regularization terms.

Optimization

PR Curve Intermediate

Often more informative than ROC on imbalanced datasets; focuses on positive class performance.

Evaluation & Benchmarking

ROC Curve Intermediate

Plots true positive rate vs false positive rate across thresholds; summarizes separability.

Foundations & Theory

Brier Score Intermediate

A proper scoring rule measuring squared error of predicted probabilities for binary outcomes.

Evaluation & Benchmarking

Activation Function Intermediate

Nonlinear functions enabling networks to approximate complex mappings; ReLU variants dominate modern DL.

Foundations & Theory

Weight Initialization Intermediate

Methods to set starting weights to preserve signal/gradient scales across layers.

Foundations & Theory

Transformer Intermediate

Architecture based on self-attention and feedforward layers; foundation of modern LLMs and many multimodal models.

Transformers & LLMs

Attention Intermediate

Mechanism that computes context-aware mixtures of representations; scales well and captures long-range dependencies.

Transformers & LLMs

RLHF Intermediate

Reinforcement learning from human feedback: uses preference data to train a reward model and optimize the policy.

Optimization

Alignment Intermediate

Ensuring model behavior matches human goals, norms, and constraints, including reducing harmful or deceptive outputs.

Foundations & Theory

Results for "Shapley values"

Welcome to AI Glossary

Search

Browse

3D WordGraph