Results for "model-based"

Model-Based RL

Advanced

RL using learned or known environment models.

Model-based reinforcement learning is like having a map while exploring a new city. Instead of wandering around aimlessly, you can look at the map to plan your route and make better decisions about where to go next. In this type of learning, an AI agent first learns how the environment works—like...

AdvertisementAd space — search-top

405 results

Agent Intermediate

A system that perceives state, selects actions, and pursues goals—often combining LLM reasoning with tools and memory.

Agents & Autonomy
Artificial Intelligence Intermediate

The field of building systems that perform tasks associated with human intelligence—perception, reasoning, language, planning, and decision-making—via algori...

Foundations & Theory
Strategic Interaction Advanced

Decisions dependent on others’ actions.

Agents & Autonomy
Shadow Deployment Intermediate

Running new model alongside production without user impact.

MLOps & Infrastructure
LIME Intermediate

Local surrogate explanation method approximating model behavior near a specific input.

Foundations & Theory
Model Card Intermediate

Standardized documentation describing intended use, performance, limitations, data, and ethical considerations.

Foundations & Theory
Model Inversion Intermediate

Inferring sensitive features of training data.

AI Economics & Strategy
Model Documentation Intermediate

Required descriptions of model behavior and limits.

Governance & Ethics
Model Intermediate

A parameterized mapping from inputs to outputs; includes architecture + learned parameters.

Foundations & Theory
Underfitting Intermediate

When a model cannot capture underlying structure, performing poorly on both training and test data.

Foundations & Theory
Model Governance Intermediate

Policies and practices for approving, monitoring, auditing, and documenting models in production.

Governance & Ethics
Model Risk Management Intermediate

Framework for identifying, measuring, and mitigating model risks.

AI Economics & Strategy
Model Moat Intermediate

Competitive advantage from proprietary models/data.

AI Economics & Strategy
Distillation Intermediate

Training a smaller “student” model to mimic a larger “teacher,” often improving efficiency while retaining performance.

Foundations & Theory
Parameters Intermediate

The learned numeric values of a model adjusted during training to minimize a loss function.

Foundations & Theory
Vocabulary Intermediate

The set of tokens a model can represent; impacts efficiency, multilinguality, and handling of rare strings.

Transformers & LLMs
Training Pipeline Intermediate

End-to-end process for model training.

MLOps & Infrastructure
Hallucination Intermediate

Model-generated content that is fluent but unsupported by evidence or incorrect; mitigated by grounding and verification.

Model Failure Modes
Overgeneralization Intermediate

Applying learned patterns incorrectly.

Model Failure Modes
Model Inventory Intermediate

Central catalog of deployed and experimental models.

AI Economics & Strategy
Feedback Loop Collapse Intermediate

Model trained on its own outputs degrades quality.

Model Failure Modes
Overfitting Intermediate

When a model fits noise/idiosyncrasies of training data and performs poorly on unseen data.

Foundations & Theory
Train/Validation/Test Split Intermediate

Separating data into training (fit), validation (tune), and test (final estimate) to avoid leakage and optimism bias.

Evaluation & Benchmarking
Cross-Validation Intermediate

A robust evaluation technique that trains/evaluates across multiple splits to estimate performance variability.

Foundations & Theory
System Prompt Intermediate

A high-priority instruction layer setting overarching behavior constraints for a chat model.

Reinforcement Learning
Interpretability Intermediate

Studying internal mechanisms or input influence on outputs (e.g., saliency maps, SHAP, attention analysis).

Foundations & Theory
Federated Learning Intermediate

Training across many devices/silos without centralizing raw data; aggregates updates, not data.

Foundations & Theory
Eval Harness Intermediate

System for running consistent evaluations across tasks, versions, prompts, and model settings.

Foundations & Theory
Variance Term Intermediate

Error due to sensitivity to fluctuations in the training dataset.

AI Economics & Strategy
Backdoor / Trojan Intermediate

Hidden behavior activated by specific triggers, causing targeted mispredictions or undesired outputs.

Foundations & Theory

Welcome to AI Glossary

The free, self-building AI dictionary. Help us keep it free—click an ad once in a while!

Search

Type any question or keyword into the search bar at the top.

Browse

Tap a letter in the A–Z bar to browse terms alphabetically, or filter by domain, industry, or difficulty level.

3D WordGraph

Fly around the interactive 3D graph to explore how AI concepts connect. Click any word to read its full definition.