Results for "noisy gradients"
State Estimation
Advanced
Inferring the agent’s internal state from noisy sensor data.
Momentum
Intermediate
Uses an exponential moving average of gradients to speed convergence and reduce oscillation.
Vanishing Gradient
Intermediate
Gradients shrink through layers, slowing learning in early layers; mitigated by ReLU, residuals, normalization.
Exploding Gradient
Intermediate
Gradients grow too large, causing divergence; mitigated by clipping, normalization, careful init.
LSTM
Intermediate
An RNN variant using gates to mitigate vanishing gradients and capture longer context.
Gradient Clipping
Intermediate
Limiting gradient magnitude to prevent exploding gradients.
Residual Connection
Intermediate
Allows gradients to bypass layers, enabling very deep networks.
Gradient Leakage
Intermediate
Recovering training data from gradients.