Results for "generalization"

Generalization

Intermediate

How well a model performs on new data drawn from the same (or similar) distribution as training.

Generalization is like a student who learns a subject well enough to answer different types of questions on a test, not just the ones they practiced. In machine learning, generalization means that a model can make accurate predictions on new data it hasn’t seen before. A model that generalizes we...

Full Definition View in 3D WordGraph

34 results

Generalization Intermediate

How well a model performs on new data drawn from the same (or similar) distribution as training.

Foundations & Theory

Sharp Minimum Intermediate

A narrow minimum often associated with poorer generalization.

AI Economics & Strategy

Batch Size Intermediate

Number of samples per gradient update; impacts compute efficiency, generalization, and stability.

Foundations & Theory

Flat Minimum Intermediate

A wide basin often correlated with better generalization.

AI Economics & Strategy

Multitask Learning Intermediate

Training one model on multiple tasks simultaneously to improve generalization through shared structure.

Machine Learning

Curriculum Learning Intermediate

Ordering training samples from easier to harder to improve convergence or generalization.

Foundations & Theory

VC Dimension Intermediate

A measure of a model class’s expressive capacity based on its ability to shatter datasets.

AI Economics & Strategy

Rademacher Complexity Intermediate

Measures a model’s ability to fit random noise; used to bound generalization error.

AI Economics & Strategy

Inductive Bias Intermediate

Built-in assumptions guiding learning efficiency and generalization.

AI Economics & Strategy

Empirical Risk Minimization Intermediate

Minimizing average loss on training data; can overfit when data is limited or biased.

Regularization Intermediate

Techniques that discourage overly complex solutions to improve generalization (reduce overfitting).

Foundations & Theory

Dropout Intermediate

Randomly zeroing activations during training to reduce co-adaptation and overfitting.

Foundations & Theory

Data Leakage Intermediate

When information from evaluation data improperly influences training, inflating reported performance.

Foundations & Theory

Vocabulary Intermediate

The set of tokens a model can represent; impacts efficiency, multilinguality, and handling of rare strings.

Transformers & LLMs

Large Language Model Intermediate

A high-capacity language model trained on massive corpora, exhibiting broad generalization and emergent behaviors.

Large Language Models

Data Augmentation Intermediate

Expanding training data via transformations (flips, noise, paraphrases) to improve robustness.

Foundations & Theory

Synthetic Data Intermediate

Artificially created data used to train/test models; helpful for privacy and coverage, risky if unrealistic.

Foundations & Theory

CI/CD for ML Intermediate

Automated testing and deployment processes for models and data workflows, extending DevOps to ML artifacts.

MLOps & Infrastructure

Computational Learning Theory Intermediate

A theoretical framework analyzing what classes of functions can be learned, how efficiently, and with what guarantees.

AI Economics & Strategy

Variance Term Intermediate

Error due to sensitivity to fluctuations in the training dataset.

AI Economics & Strategy

Parameter Sharing Intermediate

Using same parameters across different parts of a model.

AI Economics & Strategy

Expressivity Intermediate

The range of functions a model can represent.

AI Economics & Strategy

Rotary Positional Embeddings Intermediate

Encodes positional information via rotation in embedding space.

AI Economics & Strategy

Data Scaling Intermediate

Increasing performance via more data.

AI Economics & Strategy

Variance Advanced

Measure of spread around the mean.

Probability & Statistics

Inner Alignment Advanced

Ensuring learned behavior matches intended objective.

AI Safety & Alignment

Overgeneralization Intermediate

Applying learned patterns incorrectly.

Model Failure Modes

Catastrophic Forgetting Intermediate

Loss of old knowledge when learning new tasks.

Model Failure Modes

Distribution Shift Intermediate

Train/test environment mismatch.

Model Failure Modes

Domain Randomization Advanced

Randomizing simulation parameters to improve real-world transfer.

Simulation & Sim-to-Real