Search: learning signal

Weight Initialization Intermediate

Methods to set starting weights to preserve signal/gradient scales across layers.

Foundations & Theory

Supervised Learning Intermediate

Learning a function from input-output pairs (labeled data), optimizing performance on predicting outputs for unseen inputs.

Machine Learning

Unsupervised Learning Intermediate

Learning structure from unlabeled data, such as discovering groups, compressing representations, or modeling data distributions.

Machine Learning

Self-Supervised Learning Intermediate

Learning from data by constructing “pseudo-labels” (e.g., next-token prediction, masked modeling) without manual annotation.

Machine Learning

Reinforcement Learning Intermediate

A learning paradigm where an agent interacts with an environment and learns to choose actions to maximize cumulative reward.

Reinforcement Learning

Online Learning Intermediate

Learning where data arrives sequentially and the model updates continuously, often under changing distributions.

Machine Learning

Transfer Learning Intermediate

Reusing knowledge from a source task/domain to improve learning on a target task/domain, typically via pretrained models.

Machine Learning

Representation Learning Intermediate

Automatically learning useful internal features (latent variables) that capture salient structure for downstream tasks.

Machine Learning

Learning Rate Schedule Intermediate

Adjusting learning rate over training to improve convergence.

AI Economics & Strategy

Off-Policy Learning Intermediate

Learning from data generated by a different policy.

AI Economics & Strategy

On-Policy Learning Intermediate

Learning only from current policy’s data.

AI Economics & Strategy

Imitation Learning Advanced

Learning policies from expert demonstrations.

Reinforcement Learning

Lifelong Learning Advanced

Learning without catastrophic forgetting.

Agents & Autonomy

Feature Engineering Intermediate

Designing input features to expose useful structure (e.g., ratios, lags, aggregations), often crucial outside deep learning.

Foundations & Theory

Vanishing Gradient Intermediate

Gradients shrink through layers, slowing learning in early layers; mitigated by ReLU, residuals, normalization.

Foundations & Theory

RLHF Intermediate

Reinforcement learning from human feedback: uses preference data to train a reward model and optimize the policy.

Optimization

Bias Term Intermediate

Systematic error introduced by simplifying assumptions in a learning algorithm.

AI Economics & Strategy

Information Gain Intermediate

Reduction in uncertainty achieved by observing a variable; used in decision trees and active learning.

AI Economics & Strategy

Warmup Intermediate

Gradually increasing learning rate at training start to avoid divergence.

AI Economics & Strategy

Inductive Bias Intermediate

Built-in assumptions guiding learning efficiency and generalization.

AI Economics & Strategy

Actor-Critic Intermediate

Combines value estimation (critic) with policy learning (actor).

AI Economics & Strategy

Exploration-Exploitation Tradeoff Intermediate

Balancing learning new behaviors vs exploiting known rewards.

AI Economics & Strategy

Adaptive Optimization Intermediate

Methods like Adam adjusting learning rates dynamically.

Foundations & Theory

Catastrophic Forgetting Intermediate

Loss of old knowledge when learning new tasks.

Model Failure Modes

System Identification Advanced

Learning physical parameters from data.

Simulation & Sim-to-Real

Reward Shaping Advanced

Modifying reward to accelerate learning.

Reinforcement Learning

Behavior Cloning Advanced

Learning action mapping directly from demonstrations.

Reinforcement Learning

Predictive Coding Frontier

Learning by minimizing prediction error.

World Models & Cognition

Developmental Robotics Advanced

Robots learning via exploration and growth.

Agents & Autonomy

AlphaFold Advanced

Deep learning system for protein structure prediction.

AI in Science

Results for "learning signal"