Results for "model-based"

Model-Based RL

Advanced

RL using learned or known environment models.

Model-based reinforcement learning is like having a map while exploring a new city. Instead of wandering around aimlessly, you can look at the map to plan your route and make better decisions about where to go next. In this type of learning, an AI agent first learns how the environment works—like...

AdvertisementAd space — search-top

405 results

Specificity Intermediate

Of true negatives, the fraction correctly identified.

Foundations & Theory
AUC Intermediate

Scalar summary of ROC; measures ranking ability, not calibration.

Foundations & Theory
Mean Squared Error Intermediate

Average of squared residuals; common regression objective.

Optimization
Stochastic Gradient Descent Intermediate

A gradient method using random minibatches for efficient training on large datasets.

Foundations & Theory
Batch Size Intermediate

Number of samples per gradient update; impacts compute efficiency, generalization, and stability.

Foundations & Theory
Early Stopping Intermediate

Halting training when validation performance stops improving to reduce overfitting.

Foundations & Theory
Neural Network Intermediate

A parameterized function composed of interconnected units organized in layers with nonlinear activations.

Neural Networks
Activation Function Intermediate

Nonlinear functions enabling networks to approximate complex mappings; ReLU variants dominate modern DL.

Foundations & Theory
Chunking Intermediate

Breaking documents into pieces for retrieval; chunk size/overlap strongly affect RAG quality.

Foundations & Theory
Causal Inference Intermediate

Framework for reasoning about cause-effect relationships beyond correlation, often using structural assumptions and experiments.

Foundations & Theory
Data Governance Intermediate

Processes and controls for data quality, access, lineage, retention, and compliance across the AI lifecycle.

Foundations & Theory
Experiment Tracking Intermediate

Logging hyperparameters, code versions, data snapshots, and results to reproduce and compare experiments.

Evaluation & Benchmarking
Reproducibility Intermediate

Ability to replicate results given same code/data; harder in distributed training and nondeterministic ops.

Foundations & Theory
Latency Intermediate

Time from request to response; critical for real-time inference and UX.

Foundations & Theory
Throughput Intermediate

How many requests or tokens can be processed per unit time; affects scalability and cost.

Transformers & LLMs
Temperature Intermediate

Scales logits before sampling; higher increases randomness/diversity, lower increases determinism.

Foundations & Theory
Top-k Intermediate

Samples from the k highest-probability tokens to limit unlikely outputs.

Foundations & Theory
Benchmark Intermediate

A dataset + metric suite for comparing models; can be gamed or misaligned with real-world goals.

Evaluation & Benchmarking
Responsible AI Intermediate

A discipline ensuring AI systems are fair, safe, transparent, privacy-preserving, and accountable throughout lifecycle.

Governance & Ethics
Speech Recognition Intermediate

Converting audio speech into text, often using encoder-decoder or transducer architectures.

Speech & Audio AI
Information Gain Intermediate

Reduction in uncertainty achieved by observing a variable; used in decision trees and active learning.

AI Economics & Strategy
Computational Learning Theory Intermediate

A theoretical framework analyzing what classes of functions can be learned, how efficiently, and with what guarantees.

AI Economics & Strategy
KL Divergence Intermediate

Measures how one probability distribution diverges from another.

AI Economics & Strategy
Maximum Likelihood Estimation Intermediate

Estimating parameters by maximizing likelihood of observed data.

AI Economics & Strategy
Second-Order Methods Intermediate

Optimization using curvature information; often expensive at scale.

AI Economics & Strategy
Hessian Matrix Intermediate

Matrix of second derivatives describing local curvature of loss.

AI Economics & Strategy
Bottleneck Layer Intermediate

A narrow hidden layer forcing compact representations.

AI Economics & Strategy
Depth vs Width Intermediate

Tradeoffs between many layers vs many neurons per layer.

AI Economics & Strategy
Self-Reflection Intermediate

Models evaluating and improving their own outputs.

AI Economics & Strategy
Gradient Leakage Intermediate

Recovering training data from gradients.

AI Economics & Strategy

Welcome to AI Glossary

The free, self-building AI dictionary. Help us keep it free—click an ad once in a while!

Search

Type any question or keyword into the search bar at the top.

Browse

Tap a letter in the A–Z bar to browse terms alphabetically, or filter by domain, industry, or difficulty level.

3D WordGraph

Fly around the interactive 3D graph to explore how AI concepts connect. Click any word to read its full definition.