Search: function approximation

Reward Shaping Advanced

Modifying reward to accelerate learning.

Reinforcement Learning

Reinforcement Learning Intermediate

A learning paradigm where an agent interacts with an environment and learns to choose actions to maximize cumulative reward.

Reinforcement Learning

ReLU Intermediate

Activation max(0, x); improves gradient flow and training speed in deep nets.

Foundations & Theory

Logits Intermediate

Raw model outputs before converting to probabilities; manipulated during decoding and calibration.

Foundations & Theory

Convex Optimization Intermediate

Optimization problems where any local minimum is global.

AI Economics & Strategy

Maximum Likelihood Estimation Intermediate

Estimating parameters by maximizing likelihood of observed data.

AI Economics & Strategy

Sharp Minimum Intermediate

A narrow minimum often associated with poorer generalization.

AI Economics & Strategy

Markov Decision Process Intermediate

Formal framework for sequential decision-making under uncertainty.

AI Economics & Strategy

Energy-Based Model Intermediate

Models that define an energy landscape rather than explicit probabilities.

Model Architectures

Policy Gradient Intermediate

Optimizing policies directly via gradient ascent on expected reward.

AI Economics & Strategy

Expectation Advanced

Average value under a distribution.

Probability & Statistics

Local Minimum Intermediate

Minimum relative to nearby points.

Foundations & Theory

Constrained Optimization Intermediate

Optimization under equality/inequality constraints.

Foundations & Theory

Lagrangian Intermediate

Converts constrained problem to unconstrained form.

Foundations & Theory

Dual Problem Intermediate

Alternative formulation providing bounds.

Foundations & Theory

Optimal Control Intermediate

Finding control policies minimizing cumulative cost.

Foundations & Theory

Model Predictive Control Intermediate

Optimizes future actions using a model of dynamics.

Foundations & Theory

Linear Quadratic Regulator Intermediate

Optimal control for linear systems with quadratic cost.

Foundations & Theory

Imitation Learning Advanced

Learning policies from expert demonstrations.

Reinforcement Learning

Online Learning Intermediate

Learning where data arrives sequentially and the model updates continuously, often under changing distributions.

Machine Learning

Transfer Learning Intermediate

Reusing knowledge from a source task/domain to improve learning on a target task/domain, typically via pretrained models.

Machine Learning

Multitask Learning Intermediate

Training one model on multiple tasks simultaneously to improve generalization through shared structure.

Machine Learning

Embedding Intermediate

A continuous vector encoding of an item (word, image, user) such that semantic similarity corresponds to geometric closeness.

Machine Learning

Feature Engineering Intermediate

Designing input features to expose useful structure (e.g., ratios, lags, aggregations), often crucial outside deep learning.

Foundations & Theory

Hyperparameters Intermediate

Configuration choices not learned directly (or not typically learned) that govern training or architecture.

Optimization

Representation Learning Intermediate

Automatically learning useful internal features (latent variables) that capture salient structure for downstream tasks.

Machine Learning

Empirical Risk Minimization Intermediate

Minimizing average loss on training data; can overfit when data is limited or biased.

Optimization

Regularization Intermediate

Techniques that discourage overly complex solutions to improve generalization (reduce overfitting).

Foundations & Theory

Overfitting Intermediate

When a model fits noise/idiosyncrasies of training data and performs poorly on unseen data.

Foundations & Theory

Mean Squared Error Intermediate

Average of squared residuals; common regression objective.

Optimization

Results for "function approximation"

Welcome to AI Glossary

Search

Browse

3D WordGraph