Results for "model-based"

Model-Based RL

Advanced

RL using learned or known environment models.

Model-based reinforcement learning is like having a map while exploring a new city. Instead of wandering around aimlessly, you can look at the map to plan your route and make better decisions about where to go next. In this type of learning, an AI agent first learns how the environment works—like...

Full Definition View in 3D WordGraph

405 results

Specificity Intermediate

Of true negatives, the fraction correctly identified.

Foundations & Theory

AUC Intermediate

Scalar summary of ROC; measures ranking ability, not calibration.

Foundations & Theory

Mean Squared Error Intermediate

Average of squared residuals; common regression objective.

Stochastic Gradient Descent Intermediate

A gradient method using random minibatches for efficient training on large datasets.

Foundations & Theory

Batch Size Intermediate

Number of samples per gradient update; impacts compute efficiency, generalization, and stability.

Foundations & Theory

Early Stopping Intermediate

Halting training when validation performance stops improving to reduce overfitting.

Foundations & Theory

Neural Network Intermediate

A parameterized function composed of interconnected units organized in layers with nonlinear activations.

Neural Networks

Activation Function Intermediate

Nonlinear functions enabling networks to approximate complex mappings; ReLU variants dominate modern DL.

Foundations & Theory

Chunking Intermediate

Breaking documents into pieces for retrieval; chunk size/overlap strongly affect RAG quality.

Foundations & Theory

Causal Inference Intermediate

Framework for reasoning about cause-effect relationships beyond correlation, often using structural assumptions and experiments.

Foundations & Theory

Data Governance Intermediate

Processes and controls for data quality, access, lineage, retention, and compliance across the AI lifecycle.

Foundations & Theory

Experiment Tracking Intermediate

Logging hyperparameters, code versions, data snapshots, and results to reproduce and compare experiments.

Evaluation & Benchmarking

Reproducibility Intermediate

Ability to replicate results given same code/data; harder in distributed training and nondeterministic ops.

Foundations & Theory

Latency Intermediate

Time from request to response; critical for real-time inference and UX.

Foundations & Theory

Throughput Intermediate

How many requests or tokens can be processed per unit time; affects scalability and cost.

Transformers & LLMs

Temperature Intermediate

Scales logits before sampling; higher increases randomness/diversity, lower increases determinism.

Foundations & Theory

Top-k Intermediate

Samples from the k highest-probability tokens to limit unlikely outputs.

Foundations & Theory

Benchmark Intermediate

A dataset + metric suite for comparing models; can be gamed or misaligned with real-world goals.

Evaluation & Benchmarking

Responsible AI Intermediate

A discipline ensuring AI systems are fair, safe, transparent, privacy-preserving, and accountable throughout lifecycle.

Governance & Ethics

Speech Recognition Intermediate

Converting audio speech into text, often using encoder-decoder or transducer architectures.

Speech & Audio AI

Information Gain Intermediate

Reduction in uncertainty achieved by observing a variable; used in decision trees and active learning.

AI Economics & Strategy

Computational Learning Theory Intermediate

A theoretical framework analyzing what classes of functions can be learned, how efficiently, and with what guarantees.

AI Economics & Strategy

KL Divergence Intermediate

Measures how one probability distribution diverges from another.

AI Economics & Strategy

Maximum Likelihood Estimation Intermediate

Estimating parameters by maximizing likelihood of observed data.

AI Economics & Strategy

Second-Order Methods Intermediate

Optimization using curvature information; often expensive at scale.

AI Economics & Strategy

Hessian Matrix Intermediate

Matrix of second derivatives describing local curvature of loss.

AI Economics & Strategy

Bottleneck Layer Intermediate

A narrow hidden layer forcing compact representations.

AI Economics & Strategy

Depth vs Width Intermediate

Tradeoffs between many layers vs many neurons per layer.

AI Economics & Strategy

Self-Reflection Intermediate

Models evaluating and improving their own outputs.

AI Economics & Strategy

Gradient Leakage Intermediate

Recovering training data from gradients.

AI Economics & Strategy

1 2 3 4 5 6 7 8 9 10 11 12 13 14