Search: cumulative probability

Sampling Intermediate

Stochastic generation strategies that trade determinism for diversity; key knobs include temperature and nucleus sampling.

Foundations & Theory

Probability Distribution Advanced

Describes likelihoods of random variable outcomes.

Probability & Statistics

Reinforcement Learning Intermediate

A learning paradigm where an agent interacts with an environment and learns to choose actions to maximize cumulative reward.

Reinforcement Learning

Value Function Intermediate

Expected cumulative reward from a state or state-action pair.

AI Economics & Strategy

Top-p Intermediate

Samples from the smallest set of tokens whose probabilities sum to p, adapting set size by context.

Foundations & Theory

Top-k Intermediate

Samples from the k highest-probability tokens to limit unlikely outputs.

Foundations & Theory

Likelihood Function Advanced

Probability of data given parameters.

Probability & Statistics

Policy Intermediate

Strategy mapping states to actions.

AI Economics & Strategy

Markov Decision Process Intermediate

Formal framework for sequential decision-making under uncertainty.

AI Economics & Strategy

Change Point Detection Intermediate

Identifying abrupt changes in data generation.

Time Series

Q-Function Intermediate

Expected return of taking action in a state.

AI Economics & Strategy

KL Divergence Intermediate

Measures how one probability distribution diverges from another.

AI Economics & Strategy

Energy-Based Model Intermediate

Models that define an energy landscape rather than explicit probabilities.

Model Architectures

RLHF Intermediate

Reinforcement learning from human feedback: uses preference data to train a reward model and optimize the policy.

Optimization

Beam Search Intermediate

Search algorithm for generation that keeps top-k partial sequences; can improve likelihood but reduce diversity.

Foundations & Theory

Action Space Intermediate

Set of all actions available to the agent.

AI Economics & Strategy

Optimal Control Intermediate

Finding control policies minimizing cumulative cost.

Foundations & Theory

Slow Takeoff Advanced

Incremental capability growth.

AI Safety & Alignment

Log Loss Intermediate

Penalizes confident wrong predictions heavily; standard for classification and language modeling.

Optimization

Maximum Likelihood Estimation Intermediate

Estimating parameters by maximizing likelihood of observed data.

AI Economics & Strategy

Bayesian Inference Intermediate

Updating beliefs about parameters using observed evidence and prior distributions.

AI Economics & Strategy

Factor Graph Intermediate

Graphical model expressing factorization of a probability distribution.

Model Architectures

Expectation Advanced

Average value under a distribution.

Probability & Statistics

Language Model Intermediate

A model that assigns probabilities to sequences of tokens; often trained by next-token prediction.

Large Language Models

PAC Learning Intermediate

A model is PAC-learnable if it can, with high probability, learn an approximately correct hypothesis from finite samples.

AI Economics & Strategy

Entropy Intermediate

A measure of randomness or uncertainty in a probability distribution.

AI Economics & Strategy

Cross-Entropy Intermediate

Measures divergence between true and predicted probability distributions.

AI Economics & Strategy

Fisher Information Intermediate

Measures how much information an observable random variable carries about unknown parameters.

AI Economics & Strategy

Propensity Score Advanced

Probability of treatment assignment given covariates.

Causal AI & Interpretability

GAN Advanced

Two-network setup where generator fools a discriminator.

Diffusion & Generative Models

Results for "cumulative probability"

Welcome to AI Glossary

Search

Browse

3D WordGraph