Results for "weight reuse"

AdvertisementAd space — search-top

18 results

Feature Store Intermediate

Centralized repository for curated features.

MLOps & Infrastructure
Weight Initialization Intermediate

Methods to set starting weights to preserve signal/gradient scales across layers.

Foundations & Theory
Open-Weight Model Intermediate

Models whose weights are publicly available.

AI Economics & Strategy
Exploding Gradient Intermediate

Gradients grow too large, causing divergence; mitigated by clipping, normalization, careful init.

Foundations & Theory
Convolutional Neural Network Intermediate

Networks using convolution operations with weight sharing and locality, effective for images and signals.

Neural Networks Computer Vision
Few-Shot Learning Intermediate

Achieving task performance by providing a small number of examples inside the prompt without weight updates.

Foundations & Theory
Vanishing Gradient Intermediate

Gradients shrink through layers, slowing learning in early layers; mitigated by ReLU, residuals, normalization.

Foundations & Theory
LoRA Intermediate

PEFT method injecting trainable low-rank matrices into layers, enabling efficient fine-tuning.

Foundations & Theory
Recurrent Neural Network Intermediate

Networks with recurrent connections for sequences; largely supplanted by Transformers for many tasks.

Neural Networks
Logits Intermediate

Raw model outputs before converting to probabilities; manipulated during decoding and calibration.

Foundations & Theory
Pruning Intermediate

Removing weights or neurons to shrink models and improve efficiency; can be structured or unstructured.

Foundations & Theory
Bottleneck Layer Intermediate

A narrow hidden layer forcing compact representations.

AI Economics & Strategy
Parameter Sharing Intermediate

Using same parameters across different parts of a model.

AI Economics & Strategy
Closed Model Intermediate

Models accessible only via service APIs.

AI Economics & Strategy
Graph Attention Network Intermediate

GNN using attention to weight neighbor contributions dynamically.

Model Architectures
Orthogonality Advanced

Vectors with zero inner product; implies independence.

Mathematics
Catastrophic Forgetting Intermediate

Loss of old knowledge when learning new tasks.

Model Failure Modes
Lifelong Learning Advanced

Learning without catastrophic forgetting.

Agents & Autonomy

Welcome to AI Glossary

The free, self-building AI dictionary. Help us keep it free—click an ad once in a while!

Search

Type any question or keyword into the search bar at the top.

Browse

Tap a letter in the A–Z bar to browse terms alphabetically, or filter by domain, industry, or difficulty level.

3D WordGraph

Fly around the interactive 3D graph to explore how AI concepts connect. Click any word to read its full definition.