Search: real-world testing

Deceptive Alignment Advanced

Model behaves well during training but not deployment.

AI Safety & Alignment

Scalable Oversight Advanced

Using limited human feedback to guide large models.

AI Safety & Alignment

Model-Based RL Advanced

RL using learned or known environment models.

Reinforcement Learning

Commonsense Physics Frontier

Human-like understanding of physical behavior.

World Models & Cognition

Dataset Intermediate

A structured collection of examples used to train/evaluate models; quality, bias, and coverage often dominate outcomes.

Machine Learning

Train/Validation/Test Split Intermediate

Separating data into training (fit), validation (tune), and test (final estimate) to avoid leakage and optimism bias.

Evaluation & Benchmarking

Prompt Engineering Intermediate

Crafting prompts to elicit desired behavior, often using role, structure, constraints, and examples.

Prompting & Instructions

Model Risk Management Intermediate

Framework for identifying, measuring, and mitigating model risks.

AI Economics & Strategy

Audit Intermediate

Systematic review of model/data processes to ensure performance, fairness, security, and policy compliance.

Governance & Ethics

Canary Release Intermediate

Incrementally deploying new models to reduce risk.

MLOps & Infrastructure

Probability Distribution Advanced

Describes likelihoods of random variable outcomes.

Probability & Statistics

Central Limit Theorem Advanced

Sum of independent variables converges to normal distribution.

Probability & Statistics

Likelihood Function Advanced

Probability of data given parameters.

Probability & Statistics

Posterior Distribution Advanced

Updated belief after observing data.

Probability & Statistics

Outer Alignment Advanced

Correctly specifying goals.

AI Safety & Alignment

Change Management Intermediate

Governance of model changes.

Governance & Ethics

Risk Model Intermediate

Quantifying financial risk.

AI Economics & Strategy

Model Risk Intermediate

Risk of incorrect financial models.

AI Economics & Strategy

Tool Use Intermediate

Letting an LLM call external functions/APIs to fetch data, compute, or take actions, improving reliability.

Agents & Autonomy

Latency Intermediate

Time from request to response; critical for real-time inference and UX.

Foundations & Theory

Online Learning Intermediate

Learning where data arrives sequentially and the model updates continuously, often under changing distributions.

Machine Learning

Online Inference Intermediate

Low-latency prediction per request.

MLOps & Infrastructure

GAN Advanced

Two-network setup where generator fools a discriminator.

Diffusion & Generative Models

Closed-Loop Control Advanced

Control using real-time sensor feedback.

Robotics & Embodied AI

Tool-Augmented Prompt Intro

Enables external computation or lookup.

Prompting & Instructions

Digital Twin Advanced

High-fidelity virtual model of a physical system.

Simulation & Sim-to-Real

Computer Vision Intermediate

AI focused on interpreting images/video: classification, detection, segmentation, tracking, and 3D understanding.

Computer Vision

Embodiment Hypothesis Advanced

Intelligence emerges from interaction with the physical world.

Agents & Autonomy

Active Inference Frontier

Acting to minimize surprise or free energy.

World Models & Cognition

AI Boxing Advanced

Isolating AI systems.

AI Safety & Alignment

Results for "real-world testing"

Welcome to AI Glossary

Search

Browse

3D WordGraph