Results for "model-based"

Model-Based RL

Advanced

RL using learned or known environment models.

Model-based reinforcement learning is like having a map while exploring a new city. Instead of wandering around aimlessly, you can look at the map to plan your route and make better decisions about where to go next. In this type of learning, an AI agent first learns how the environment works—like...

AdvertisementAd space — search-top

405 results

Model Orchestration Intermediate

Coordinating models, tools, and logic.

AI Economics & Strategy
Model Predictive Control Intermediate

Optimizes future actions using a model of dynamics.

Foundations & Theory
Friction Model Advanced

Mathematical representation of friction forces.

Dynamics & Physics
Fine-Tuning Intermediate

Updating a pretrained model’s weights on task-specific data to improve performance or adapt style/behavior.

Large Language Models
Concept Drift Intermediate

The relationship between inputs and outputs changes over time, requiring monitoring and model updates.

Foundations & Theory
Generalization Intermediate

How well a model performs on new data drawn from the same (or similar) distribution as training.

Foundations & Theory
Accuracy Intermediate

Fraction of correct predictions; can be misleading on imbalanced datasets.

Foundations & Theory
PR Curve Intermediate

Often more informative than ROC on imbalanced datasets; focuses on positive class performance.

Evaluation & Benchmarking
Autoregressive Model Intermediate

Generates sequences one token at a time, conditioning on past tokens.

Foundations & Theory
Chain-of-Thought Intermediate

Stepwise reasoning patterns that can improve multi-step tasks; often handled implicitly or summarized for safety/privacy.

Foundations & Theory
Parameter-Efficient Fine-Tuning Intermediate

Techniques that fine-tune small additional components rather than all weights to reduce compute and storage.

Foundations & Theory
Audit Intermediate

Systematic review of model/data processes to ensure performance, fairness, security, and policy compliance.

Governance & Ethics
Perplexity Intermediate

Exponential of average negative log-likelihood; lower means better predictive fit, not necessarily better utility.

Evaluation & Benchmarking
Data Poisoning Intermediate

Maliciously inserting or altering training data to implant backdoors or degrade performance.

Foundations & Theory
Human-in-the-Loop Intermediate

System design where humans validate or guide model outputs, especially for high-stakes decisions.

Foundations & Theory
Parameter Sharing Intermediate

Using same parameters across different parts of a model.

AI Economics & Strategy
Multi-Head Attention Intermediate

Allows model to attend to information from different subspaces simultaneously.

AI Economics & Strategy
Audit Trail Intermediate

Logged record of model inputs, outputs, and decisions.

AI Economics & Strategy
Conditional Random Field Intermediate

Probabilistic graphical model for structured prediction.

Model Architectures
Noise Schedule Advanced

Controls amount of noise added at each diffusion step.

Diffusion & Generative Models
Structural Causal Model Advanced

Formal model linking causal mechanisms and variables.

Causal AI & Interpretability
Batch Inference Intermediate

Running predictions on large datasets periodically.

MLOps & Infrastructure
Canary Release Intermediate

Incrementally deploying new models to reduce risk.

MLOps & Infrastructure
Compute Scaling Intermediate

Increasing model capacity via compute.

AI Economics & Strategy
Data Scaling Intermediate

Increasing performance via more data.

AI Economics & Strategy
Inference Cost Intermediate

Cost to run models in production.

AI Economics & Strategy
Mesa-Optimizer Advanced

Learned subsystem that optimizes its own objective.

AI Safety & Alignment
Decomposition Prompt Intro

Breaking tasks into sub-steps.

Prompting & Instructions
Retrieval Prompt Intro

Prompt augmented with retrieved documents.

Prompting & Instructions
Prompt Sensitivity Intermediate

Small prompt changes cause large output changes.

Model Failure Modes

Welcome to AI Glossary

The free, self-building AI dictionary. Help us keep it free—click an ad once in a while!

Search

Type any question or keyword into the search bar at the top.

Browse

Tap a letter in the A–Z bar to browse terms alphabetically, or filter by domain, industry, or difficulty level.

3D WordGraph

Fly around the interactive 3D graph to explore how AI concepts connect. Click any word to read its full definition.