Search: probabilistic accuracy

Overconfidence Intermediate

Probabilities do not reflect true correctness.

Model Failure Modes

Perception Stack Advanced

Software pipeline converting raw sensor data into structured representations.

Robotics & Embodied AI

State Estimation Advanced

Inferring the agent’s internal state from noisy sensor data.

Robotics & Embodied AI

Exteroception Advanced

External sensing of surroundings (vision, audio, lidar).

Robotics & Embodied AI

RRT Advanced

Sampling-based motion planner.

Motion Planning & Navigation

Latent Dynamics Frontier

Modeling environment evolution in latent space.

World Models & Cognition

Object Permanence Frontier

Understanding objects exist when unseen.

World Models & Cognition

Commonsense Physics Frontier

Human-like understanding of physical behavior.

World Models & Cognition

Intent Recognition Frontier

Inferring human goals from behavior.

World Models & Cognition

Natural Language Instruction Frontier

Controlling robots via language.

World Models & Cognition

Safety-Critical System Advanced

Systems where failure causes physical harm.

Agents & Autonomy

Case Outcome Prediction Intermediate

Predicting case success probabilities.

AI in Law

Automated Hypothesis Generation Advanced

AI proposing scientific hypotheses.

AI in Science

Existential Risk Advanced

Risk threatening humanity’s survival.

AI Safety & Alignment

Concept Drift Intermediate

The relationship between inputs and outputs changes over time, requiring monitoring and model updates.

Foundations & Theory

Generalization Intermediate

How well a model performs on new data drawn from the same (or similar) distribution as training.

Foundations & Theory

Data Leakage Intermediate

When information from evaluation data improperly influences training, inflating reported performance.

Foundations & Theory

PR Curve Intermediate

Often more informative than ROC on imbalanced datasets; focuses on positive class performance.

Evaluation & Benchmarking

Mean Squared Error Intermediate

Average of squared residuals; common regression objective.

Optimization

Early Stopping Intermediate

Halting training when validation performance stops improving to reduce overfitting.

Foundations & Theory

Chain-of-Thought Intermediate

Stepwise reasoning patterns that can improve multi-step tasks; often handled implicitly or summarized for safety/privacy.

Foundations & Theory

RAG Intermediate

Architecture that retrieves relevant documents (e.g., from a vector DB) and conditions generation on them to reduce hallucinations.

Foundations & Theory

Chunking Intermediate

Breaking documents into pieces for retrieval; chunk size/overlap strongly affect RAG quality.

Foundations & Theory

SHAP Intermediate

Feature attribution method grounded in cooperative game theory for explaining predictions in tabular settings.

Foundations & Theory

Data Labeling Intermediate

Human or automated process of assigning targets; quality, consistency, and guidelines matter heavily.

Foundations & Theory

Class Imbalance Intermediate

When some classes are rare, requiring reweighting, resampling, or specialized metrics.

Machine Learning

CI/CD for ML Intermediate

Automated testing and deployment processes for models and data workflows, extending DevOps to ML artifacts.

MLOps & Infrastructure

Audit Intermediate

Systematic review of model/data processes to ensure performance, fairness, security, and policy compliance.

Governance & Ethics

Benchmark Intermediate

A dataset + metric suite for comparing models; can be gamed or misaligned with real-world goals.

Evaluation & Benchmarking

Data Poisoning Intermediate

Maliciously inserting or altering training data to implant backdoors or degrade performance.

Foundations & Theory

Results for "probabilistic accuracy"

Welcome to AI Glossary

Search

Browse

3D WordGraph