Results for "model-based"

Model-Based RL

Advanced

RL using learned or known environment models.

Model-based reinforcement learning is like having a map while exploring a new city. Instead of wandering around aimlessly, you can look at the map to plan your route and make better decisions about where to go next. In this type of learning, an AI agent first learns how the environment works—like...

Full Definition View in 3D WordGraph

405 results

Expressivity Intermediate

The range of functions a model can represent.

AI Economics & Strategy

Model Watermarking Intermediate

Embedding signals to prove model ownership.

AI Economics & Strategy

Hidden Markov Model Intermediate

Probabilistic model for sequential data with latent states.

Model Architectures

Diffusion Model Advanced

Generative model that learns to reverse a gradual noise process.

Diffusion & Generative Models

Acoustic Model Intermediate

Maps audio signals to linguistic units.

Speech & Audio AI

Prediction Drift Intermediate

Shift in model outputs.

MLOps & Infrastructure

Training Cost Intermediate

Cost of model training.

AI Economics & Strategy

Open-Weight Model Intermediate

Models whose weights are publicly available.

AI Economics & Strategy

Closed Model Intermediate

Models accessible only via service APIs.

AI Economics & Strategy

Zero-Shot Prompting Intro

Task instruction without examples.

Prompting & Instructions

One-Shot Prompting Intro

One example included to guide output.

Prompting & Instructions

Model Disclosure Intermediate

Requirement to reveal AI usage in legal decisions.

Multitask Learning Intermediate

Training one model on multiple tasks simultaneously to improve generalization through shared structure.

Machine Learning

Objective Function Intermediate

A scalar measure optimized during training, typically expected loss over data, sometimes with regularization terms.

Grounding Intermediate

Constraining outputs to retrieved or provided sources, often with citation, to improve factual reliability.

Foundations & Theory

Data Leakage Intermediate

When information from evaluation data improperly influences training, inflating reported performance.

Foundations & Theory

SFT Intermediate

Fine-tuning on (prompt, response) pairs to align a model with instruction-following behaviors.

Foundations & Theory

Language Model Intermediate

A model that assigns probabilities to sequences of tokens; often trained by next-token prediction.

Large Language Models

Alignment Intermediate

Ensuring model behavior matches human goals, norms, and constraints, including reducing harmful or deceptive outputs.

Foundations & Theory

MLOps Intermediate

Practices for operationalizing ML: versioning, CI/CD, monitoring, retraining, and reliable production management.

MLOps & Infrastructure

CI/CD for ML Intermediate

Automated testing and deployment processes for models and data workflows, extending DevOps to ML artifacts.

MLOps & Infrastructure

Monitoring Intermediate

Observing model inputs/outputs, latency, cost, and quality over time to catch regressions and drift.

MLOps & Infrastructure

Adversarial Example Intermediate

Inputs crafted to cause model errors or unsafe behavior, often imperceptible in vision or subtle in text.

Foundations & Theory

Prompt Injection Intermediate

Attacks that manipulate model instructions (especially via retrieved content) to override system goals or exfiltrate data.

Foundations & Theory

Loss Landscape Intermediate

The shape of the loss function over parameter space.

AI Economics & Strategy

Emergent Abilities Intermediate

Capabilities that appear only beyond certain model sizes.

AI Economics & Strategy

ARIMA Intermediate

Classical statistical time-series model.

Inference Pipeline Intermediate

Model execution path in production.

MLOps & Infrastructure

Distribution Shift Intermediate

Train/test environment mismatch.

Model Failure Modes

Deceptive Alignment Advanced

Model behaves well during training but not deployment.

AI Safety & Alignment

1 2 3 4 5 6 7 8 9 10 11 12 13 14