Results for "model-based"

Model-Based RL

Advanced

RL using learned or known environment models.

Model-based reinforcement learning is like having a map while exploring a new city. Instead of wandering around aimlessly, you can look at the map to plan your route and make better decisions about where to go next. In this type of learning, an AI agent first learns how the environment works—like...

Full Definition View in 3D WordGraph

405 results

Model Orchestration Intermediate

Coordinating models, tools, and logic.

AI Economics & Strategy

Model Predictive Control Intermediate

Optimizes future actions using a model of dynamics.

Foundations & Theory

Friction Model Advanced

Mathematical representation of friction forces.

Dynamics & Physics

Fine-Tuning Intermediate

Updating a pretrained model’s weights on task-specific data to improve performance or adapt style/behavior.

Large Language Models

Concept Drift Intermediate

The relationship between inputs and outputs changes over time, requiring monitoring and model updates.

Foundations & Theory

Generalization Intermediate

How well a model performs on new data drawn from the same (or similar) distribution as training.

Foundations & Theory

Accuracy Intermediate

Fraction of correct predictions; can be misleading on imbalanced datasets.

Foundations & Theory

PR Curve Intermediate

Often more informative than ROC on imbalanced datasets; focuses on positive class performance.

Evaluation & Benchmarking

Autoregressive Model Intermediate

Generates sequences one token at a time, conditioning on past tokens.

Foundations & Theory

Chain-of-Thought Intermediate

Stepwise reasoning patterns that can improve multi-step tasks; often handled implicitly or summarized for safety/privacy.

Foundations & Theory

Parameter-Efficient Fine-Tuning Intermediate

Techniques that fine-tune small additional components rather than all weights to reduce compute and storage.

Foundations & Theory

Audit Intermediate

Systematic review of model/data processes to ensure performance, fairness, security, and policy compliance.

Governance & Ethics

Perplexity Intermediate

Exponential of average negative log-likelihood; lower means better predictive fit, not necessarily better utility.

Evaluation & Benchmarking

Data Poisoning Intermediate

Maliciously inserting or altering training data to implant backdoors or degrade performance.

Foundations & Theory

Human-in-the-Loop Intermediate

System design where humans validate or guide model outputs, especially for high-stakes decisions.

Foundations & Theory

Parameter Sharing Intermediate

Using same parameters across different parts of a model.

AI Economics & Strategy

Multi-Head Attention Intermediate

Allows model to attend to information from different subspaces simultaneously.

AI Economics & Strategy

Audit Trail Intermediate

Logged record of model inputs, outputs, and decisions.

AI Economics & Strategy

Conditional Random Field Intermediate

Probabilistic graphical model for structured prediction.

Model Architectures

Noise Schedule Advanced

Controls amount of noise added at each diffusion step.

Diffusion & Generative Models

Structural Causal Model Advanced

Formal model linking causal mechanisms and variables.

Causal AI & Interpretability

Batch Inference Intermediate

Running predictions on large datasets periodically.

MLOps & Infrastructure

Canary Release Intermediate

Incrementally deploying new models to reduce risk.

MLOps & Infrastructure

Compute Scaling Intermediate

Increasing model capacity via compute.

AI Economics & Strategy

Data Scaling Intermediate

Increasing performance via more data.

AI Economics & Strategy

Inference Cost Intermediate

Cost to run models in production.

AI Economics & Strategy

Mesa-Optimizer Advanced

Learned subsystem that optimizes its own objective.

AI Safety & Alignment

Decomposition Prompt Intro

Breaking tasks into sub-steps.

Prompting & Instructions

Retrieval Prompt Intro

Prompt augmented with retrieved documents.

Prompting & Instructions

Prompt Sensitivity Intermediate

Small prompt changes cause large output changes.

Model Failure Modes

1 2 3 4 5 6 7 8 9 10 11 12 13 14