Vanishing Gradient

Intermediate

Gradients shrink through layers, slowing learning in early layers; mitigated by ReLU, residuals, normalization.

Full Definition

Gradients shrink through layers, slowing learning in early layers; mitigated by ReLU, residuals, normalization.

Keywords

Domains

Related Terms

Concept Map

See how Vanishing Gradient connects to other concepts.

Open Knowledge Graph