Search: loss geometry — AI Glossary

Generalization Intermediate

How well a model performs on new data drawn from the same (or similar) distribution as training.

Foundations & Theory

Calibration Intermediate

The degree to which predicted probabilities match true frequencies (e.g., 0.8 means ~80% correct).

Foundations & Theory

Learning Rate Intermediate

Controls the size of parameter updates; too high diverges, too low trains slowly or gets stuck.

Foundations & Theory

Epoch Intermediate

One complete traversal of the training dataset during training.

Foundations & Theory

Neural Network Intermediate

A parameterized function composed of interconnected units organized in layers with nonlinear activations.

Neural Networks

Next-Token Prediction Intermediate

Training objective where the model predicts the next token given previous tokens (causal modeling).

Foundations & Theory

Vanishing Gradient Intermediate

Gradients shrink through layers, slowing learning in early layers; mitigated by ReLU, residuals, normalization.

Foundations & Theory

Masked Language Model Intermediate

Predicts masked tokens in a sequence, enabling bidirectional context; often used for embeddings rather than generation.

Foundations & Theory

Context Window Intermediate

Maximum number of tokens the model can attend to in one forward pass; constrains long-document reasoning.

Transformers & LLMs

Few-Shot Learning Intermediate

Achieving task performance by providing a small number of examples inside the prompt without weight updates.

Foundations & Theory

Fine-Tuning Intermediate

Updating a pretrained model’s weights on task-specific data to improve performance or adapt style/behavior.

Large Language Models

SFT Intermediate

Fine-tuning on (prompt, response) pairs to align a model with instruction-following behaviors.

Foundations & Theory

DPO Intermediate

A preference-based training method optimizing policies directly from pairwise comparisons without explicit RL loops.

Optimization

Reward Model Intermediate

Model trained to predict human preferences (or utility) for candidate outputs; used in RLHF-style pipelines.

Foundations & Theory

LIME Intermediate

Local surrogate explanation method approximating model behavior near a specific input.

Foundations & Theory

Curriculum Learning Intermediate

Ordering training samples from easier to harder to improve convergence or generalization.

Foundations & Theory

Differential Privacy Intermediate

A formal privacy framework ensuring outputs do not reveal much about any single individual’s data contribution.

Security & Privacy

Quantization Intermediate

Reducing numeric precision of weights/activations to speed inference and reduce memory with acceptable accuracy loss.

Foundations & Theory

Pruning Intermediate

Removing weights or neurons to shrink models and improve efficiency; can be structured or unstructured.

Foundations & Theory

Convex Optimization Intermediate

Optimization problems where any local minimum is global.

AI Economics & Strategy

Backdoor / Trojan Intermediate

Hidden behavior activated by specific triggers, causing targeted mispredictions or undesired outputs.

Foundations & Theory

Non-Convex Optimization Intermediate

Optimization with multiple local minima/saddle points; typical in neural networks.

AI Economics & Strategy

Second-Order Methods Intermediate

Optimization using curvature information; often expensive at scale.

AI Economics & Strategy

Scaling Laws Intermediate

Empirical laws linking model size, data, compute to performance.

AI Economics & Strategy

Variational Autoencoder Advanced

Autoencoder using probabilistic latent variables and KL regularization.

Diffusion & Generative Models

Mode Collapse Advanced

Generator produces limited variety of outputs.

Diffusion & Generative Models

Image Classification Intermediate

Assigning category labels to images.

Computer Vision

CLIP Intermediate

Joint vision-language model aligning images and text.

Computer Vision

Neural Vocoder Intermediate

Generates audio waveforms from spectrograms.

Speech & Audio AI

Gradient Advanced

Direction of steepest ascent of a function.

Mathematics

Results for "loss geometry"

Welcome to AI Glossary

Search

Browse

3D WordGraph