Results for "learning rate"

Learning Rate

Intermediate

Controls the size of parameter updates; too high diverges, too low trains slowly or gets stuck.

Think of the learning rate as the size of your steps when walking towards a destination. If you take giant steps, you might overshoot and miss your goal, but if you take tiny steps, you might take forever to get there. In machine learning, the learning rate controls how big of a change we make to...

Full Definition View in 3D WordGraph

348 results

Actor-Critic Intermediate

Combines value estimation (critic) with policy learning (actor).

AI Economics & Strategy

Exploration-Exploitation Tradeoff Intermediate

Balancing learning new behaviors vs exploiting known rewards.

AI Economics & Strategy

Agent Loop Intermediate

Continuous cycle of observation, reasoning, action, and feedback.

AI Economics & Strategy

Catastrophic Forgetting Intermediate

Loss of old knowledge when learning new tasks.

Model Failure Modes

Hybrid Training Advanced

Combining simulation and real-world data.

Simulation & Sim-to-Real

Model-Free RL Advanced

RL without explicit dynamics model.

Reinforcement Learning

World Model Frontier

Learned model of environment dynamics.

World Models & Cognition

Lifelong Learning Advanced

Learning without catastrophic forgetting.

Agents & Autonomy

AlphaFold Advanced

Deep learning system for protein structure prediction.

Narrow AI Frontier

AI limited to specific domains.

AGI & General Intelligence

Feature Intermediate

A measurable property or attribute used as model input (raw or engineered), such as age, pixel intensity, or token ID.

Foundations & Theory

Loss Function Intermediate

A function measuring prediction error (and sometimes calibration), guiding gradient-based optimization.

Foundations & Theory

MLOps Intermediate

Practices for operationalizing ML: versioning, CI/CD, monitoring, retraining, and reliable production management.

MLOps & Infrastructure

CI/CD for ML Intermediate

Automated testing and deployment processes for models and data workflows, extending DevOps to ML artifacts.

MLOps & Infrastructure

Backdoor / Trojan Intermediate

Hidden behavior activated by specific triggers, causing targeted mispredictions or undesired outputs.

Foundations & Theory

Model Stealing Intermediate

Reconstructing a model or its capabilities via API queries or leaked artifacts.

Foundations & Theory

Information Gain Intermediate

Reduction in uncertainty achieved by observing a variable; used in decision trees and active learning.

AI Economics & Strategy

State Space Intermediate

All possible configurations an agent may encounter.

AI Economics & Strategy

Q-Function Intermediate

Expected return of taking action in a state.

AI Economics & Strategy

Self-Reflection Intermediate

Models evaluating and improving their own outputs.

AI Economics & Strategy

Boltzmann Machine Intermediate

Probabilistic energy-based neural network with hidden variables.

Model Architectures

Restricted Boltzmann Machine Intermediate

Simplified Boltzmann Machine with bipartite structure.

Model Architectures

Data Scaling Intermediate

Increasing performance via more data.

AI Economics & Strategy

Objective Surface Intermediate

Visualization of optimization landscape.

Foundations & Theory

Saddle Plateau Intermediate

Flat high-dimensional regions slowing training.

Foundations & Theory

Stochastic Approximation Intermediate

Optimization under uncertainty.

Foundations & Theory

Feedback Loop Collapse Intermediate

Model trained on its own outputs degrades quality.

Model Failure Modes

Model-Based RL Advanced

RL using learned or known environment models.

Reinforcement Learning

Dynamics Model Advanced

Predicts next state given current state and action.

Reinforcement Learning

Behavior Cloning Advanced

Learning action mapping directly from demonstrations.

Reinforcement Learning

1 2 3 4 5 6 7 8 9 10 11 12