Results for "learning signal"

44 results

Weight Initialization Intermediate

Methods to set starting weights to preserve signal/gradient scales across layers.

Foundations & Theory
Supervised Learning Intermediate

Learning a function from input-output pairs (labeled data), optimizing performance on predicting outputs for unseen inputs.

Machine Learning
Unsupervised Learning Intermediate

Learning structure from unlabeled data, such as discovering groups, compressing representations, or modeling data distributions.

Machine Learning
Self-Supervised Learning Intermediate

Learning from data by constructing “pseudo-labels” (e.g., next-token prediction, masked modeling) without manual annotation.

Machine Learning
Reinforcement Learning Intermediate

A learning paradigm where an agent interacts with an environment and learns to choose actions to maximize cumulative reward.

Reinforcement Learning
Online Learning Intermediate

Learning where data arrives sequentially and the model updates continuously, often under changing distributions.

Machine Learning
Transfer Learning Intermediate

Reusing knowledge from a source task/domain to improve learning on a target task/domain, typically via pretrained models.

Machine Learning
Representation Learning Intermediate

Automatically learning useful internal features (latent variables) that capture salient structure for downstream tasks.

Machine Learning
Learning Rate Schedule Intermediate

Adjusting learning rate over training to improve convergence.

AI Economics & Strategy
Off-Policy Learning Intermediate

Learning from data generated by a different policy.

AI Economics & Strategy
On-Policy Learning Intermediate

Learning only from current policy’s data.

AI Economics & Strategy
Imitation Learning Advanced

Learning policies from expert demonstrations.

Reinforcement Learning
Lifelong Learning Advanced

Learning without catastrophic forgetting.

Agents & Autonomy
Feature Engineering Intermediate

Designing input features to expose useful structure (e.g., ratios, lags, aggregations), often crucial outside deep learning.

Foundations & Theory
Vanishing Gradient Intermediate

Gradients shrink through layers, slowing learning in early layers; mitigated by ReLU, residuals, normalization.

Foundations & Theory
RLHF Intermediate

Reinforcement learning from human feedback: uses preference data to train a reward model and optimize the policy.

Optimization
Bias Term Intermediate

Systematic error introduced by simplifying assumptions in a learning algorithm.

AI Economics & Strategy
Information Gain Intermediate

Reduction in uncertainty achieved by observing a variable; used in decision trees and active learning.

AI Economics & Strategy
Warmup Intermediate

Gradually increasing learning rate at training start to avoid divergence.

AI Economics & Strategy
Inductive Bias Intermediate

Built-in assumptions guiding learning efficiency and generalization.

AI Economics & Strategy
Actor-Critic Intermediate

Combines value estimation (critic) with policy learning (actor).

AI Economics & Strategy
Exploration-Exploitation Tradeoff Intermediate

Balancing learning new behaviors vs exploiting known rewards.

AI Economics & Strategy
Adaptive Optimization Intermediate

Methods like Adam adjusting learning rates dynamically.

Foundations & Theory
Catastrophic Forgetting Intermediate

Loss of old knowledge when learning new tasks.

Model Failure Modes
System Identification Advanced

Learning physical parameters from data.

Simulation & Sim-to-Real
Reward Shaping Advanced

Modifying reward to accelerate learning.

Reinforcement Learning
Behavior Cloning Advanced

Learning action mapping directly from demonstrations.

Reinforcement Learning
Predictive Coding Frontier

Learning by minimizing prediction error.

World Models & Cognition
Developmental Robotics Advanced

Robots learning via exploration and growth.

Agents & Autonomy
AlphaFold Advanced

Deep learning system for protein structure prediction.

AI in Science