Results for "trial-and-error"

134 results

Underfitting Intermediate

When a model cannot capture underlying structure, performing poorly on both training and test data.

Foundations & Theory
F1 Score Intermediate

Harmonic mean of precision and recall; useful when balancing false positives/negatives matters.

Foundations & Theory
Log Loss Intermediate

Penalizes confident wrong predictions heavily; standard for classification and language modeling.

Optimization
Momentum Intermediate

Uses an exponential moving average of gradients to speed convergence and reduce oscillation.

Optimization
Adam Intermediate

Popular optimizer combining momentum and per-parameter adaptive step sizes via first/second moment estimates.

Optimization
Batch Size Intermediate

Number of samples per gradient update; impacts compute efficiency, generalization, and stability.

Foundations & Theory
ReLU Intermediate

Activation max(0, x); improves gradient flow and training speed in deep nets.

Foundations & Theory
Normalization Intermediate

Techniques that stabilize and speed training by normalizing activations; LayerNorm is common in Transformers.

Foundations & Theory
Dropout Intermediate

Randomly zeroing activations during training to reduce co-adaptation and overfitting.

Foundations & Theory
LSTM Intermediate

An RNN variant using gates to mitigate vanishing gradients and capture longer context.

Foundations & Theory
Attention Intermediate

Mechanism that computes context-aware mixtures of representations; scales well and captures long-range dependencies.

Transformers & LLMs
Tokenization Intermediate

Converting text into discrete units (tokens) for modeling; subword tokenizers balance vocabulary size and coverage.

Foundations & Theory
Vocabulary Intermediate

The set of tokens a model can represent; impacts efficiency, multilinguality, and handling of rare strings.

Transformers & LLMs
Large Language Model Intermediate

A high-capacity language model trained on massive corpora, exhibiting broad generalization and emergent behaviors.

Large Language Models
Prompt Intermediate

The text (and possibly other modalities) given to an LLM to condition its output behavior.

Prompting & Instructions
Prompt Engineering Intermediate

Crafting prompts to elicit desired behavior, often using role, structure, constraints, and examples.

Prompting & Instructions
RAG Intermediate

Architecture that retrieves relevant documents (e.g., from a vector DB) and conditions generation on them to reduce hallucinations.

Foundations & Theory
Semantic Search Intermediate

Retrieval based on embedding similarity rather than keyword overlap, capturing paraphrases and related concepts.

Foundations & Theory
Hallucination Intermediate

Model-generated content that is fluent but unsupported by evidence or incorrect; mitigated by grounding and verification.

Model Failure Modes
RLHF Intermediate

Reinforcement learning from human feedback: uses preference data to train a reward model and optimize the policy.

Optimization
Alignment Intermediate

Ensuring model behavior matches human goals, norms, and constraints, including reducing harmful or deceptive outputs.

Foundations & Theory
Guardrails Intermediate

Rules and controls around generation (filters, validators, structured outputs) to reduce unsafe or invalid behavior.

Reinforcement Learning
Bias Intermediate

Systematic differences in model outcomes across groups; arises from data, labels, and deployment context.

Foundations & Theory
Explainability Intermediate

Techniques to understand model decisions (global or local), important in high-stakes and regulated settings.

Foundations & Theory
Causal Inference Intermediate

Framework for reasoning about cause-effect relationships beyond correlation, often using structural assumptions and experiments.

Foundations & Theory
Confounding Intermediate

A hidden variable influences both cause and effect, biasing naive estimates of causal impact.

Foundations & Theory
Data Labeling Intermediate

Human or automated process of assigning targets; quality, consistency, and guidelines matter heavily.

Foundations & Theory
Synthetic Data Intermediate

Artificially created data used to train/test models; helpful for privacy and coverage, risky if unrealistic.

Foundations & Theory
Encryption in Transit/At Rest Intermediate

Protecting data during network transfer and while stored; essential for ML pipelines handling sensitive data.

Security & Privacy
PII Intermediate

Information that can identify an individual (directly or indirectly); requires careful handling and compliance.

Foundations & Theory