Vanishing Gradient
IntermediateGradients shrink through layers, slowing learning in early layers; mitigated by ReLU, residuals, normalization.
Full Definition
Gradients shrink through layers, slowing learning in early layers; mitigated by ReLU, residuals, normalization.
Keywords
Domains
Related Terms
Concept Map
See how Vanishing Gradient connects to other concepts.
Open Knowledge Graph