Results for "model-based"

Model-Based RL

Advanced

RL using learned or known environment models.

Model-based reinforcement learning is like having a map while exploring a new city. Instead of wandering around aimlessly, you can look at the map to plan your route and make better decisions about where to go next. In this type of learning, an AI agent first learns how the environment works—like...

AdvertisementAd space — search-top

405 results

Expressivity Intermediate

The range of functions a model can represent.

AI Economics & Strategy
Model Watermarking Intermediate

Embedding signals to prove model ownership.

AI Economics & Strategy
Hidden Markov Model Intermediate

Probabilistic model for sequential data with latent states.

Model Architectures
Diffusion Model Advanced

Generative model that learns to reverse a gradual noise process.

Diffusion & Generative Models
Acoustic Model Intermediate

Maps audio signals to linguistic units.

Speech & Audio AI
Prediction Drift Intermediate

Shift in model outputs.

MLOps & Infrastructure
Training Cost Intermediate

Cost of model training.

AI Economics & Strategy
Open-Weight Model Intermediate

Models whose weights are publicly available.

AI Economics & Strategy
Closed Model Intermediate

Models accessible only via service APIs.

AI Economics & Strategy
Zero-Shot Prompting Intro

Task instruction without examples.

Prompting & Instructions
One-Shot Prompting Intro

One example included to guide output.

Prompting & Instructions
Model Disclosure Intermediate

Requirement to reveal AI usage in legal decisions.

AI in Law
Multitask Learning Intermediate

Training one model on multiple tasks simultaneously to improve generalization through shared structure.

Machine Learning
Objective Function Intermediate

A scalar measure optimized during training, typically expected loss over data, sometimes with regularization terms.

Optimization
Grounding Intermediate

Constraining outputs to retrieved or provided sources, often with citation, to improve factual reliability.

Foundations & Theory
Data Leakage Intermediate

When information from evaluation data improperly influences training, inflating reported performance.

Foundations & Theory
SFT Intermediate

Fine-tuning on (prompt, response) pairs to align a model with instruction-following behaviors.

Foundations & Theory
Language Model Intermediate

A model that assigns probabilities to sequences of tokens; often trained by next-token prediction.

Large Language Models
Alignment Intermediate

Ensuring model behavior matches human goals, norms, and constraints, including reducing harmful or deceptive outputs.

Foundations & Theory
MLOps Intermediate

Practices for operationalizing ML: versioning, CI/CD, monitoring, retraining, and reliable production management.

MLOps & Infrastructure
CI/CD for ML Intermediate

Automated testing and deployment processes for models and data workflows, extending DevOps to ML artifacts.

MLOps & Infrastructure
Monitoring Intermediate

Observing model inputs/outputs, latency, cost, and quality over time to catch regressions and drift.

MLOps & Infrastructure
Adversarial Example Intermediate

Inputs crafted to cause model errors or unsafe behavior, often imperceptible in vision or subtle in text.

Foundations & Theory
Prompt Injection Intermediate

Attacks that manipulate model instructions (especially via retrieved content) to override system goals or exfiltrate data.

Foundations & Theory
Loss Landscape Intermediate

The shape of the loss function over parameter space.

AI Economics & Strategy
Emergent Abilities Intermediate

Capabilities that appear only beyond certain model sizes.

AI Economics & Strategy
ARIMA Intermediate

Classical statistical time-series model.

Time Series
Inference Pipeline Intermediate

Model execution path in production.

MLOps & Infrastructure
Distribution Shift Intermediate

Train/test environment mismatch.

Model Failure Modes
Deceptive Alignment Advanced

Model behaves well during training but not deployment.

AI Safety & Alignment

Welcome to AI Glossary

The free, self-building AI dictionary. Help us keep it free—click an ad once in a while!

Search

Type any question or keyword into the search bar at the top.

Browse

Tap a letter in the A–Z bar to browse terms alphabetically, or filter by domain, industry, or difficulty level.

3D WordGraph

Fly around the interactive 3D graph to explore how AI concepts connect. Click any word to read its full definition.