Domain: Foundations & Theory

127 terms

Plant Intermediate

The physical system being controlled.

Positional Encoding Intermediate

Injects sequence order into Transformers, since attention alone is permutation-invariant.

Precision Intermediate

Of predicted positives, the fraction that are truly positive; sensitive to false positives.

Privacy Attack Intermediate

Attacks that infer whether specific records were in training data, or reconstruct sensitive training examples.

Prompt Injection Intermediate

Attacks that manipulate model instructions (especially via retrieved content) to override system goals or exfiltrate data.

Pruning Intermediate

Removing weights or neurons to shrink models and improve efficiency; can be structured or unstructured.

Quantization Intermediate

Reducing numeric precision of weights/activations to speed inference and reduce memory with acceptable accuracy loss.

RAG Intermediate

Architecture that retrieves relevant documents (e.g., from a vector DB) and conditions generation on them to reduce hallucinations.

Recall Intermediate

Of true positives, the fraction correctly identified; sensitive to false negatives.

Regularization Intermediate

Techniques that discourage overly complex solutions to improve generalization (reduce overfitting).

ReLU Intermediate

Activation max(0, x); improves gradient flow and training speed in deep nets.

Reproducibility Intermediate

Ability to replicate results given same code/data; harder in distributed training and nondeterministic ops.

Reward Model Intermediate

Model trained to predict human preferences (or utility) for candidate outputs; used in RLHF-style pipelines.

Robust Control Intermediate

Control that remains stable under model uncertainty.

ROC Curve Intermediate

Plots true positive rate vs false positive rate across thresholds; summarizes separability.

Saddle Plateau Intermediate

Flat high-dimensional regions slowing training.

Safety Filter Intermediate

Automated detection/prevention of disallowed outputs (toxicity, self-harm, illegal instruction, etc.).

Sampling Intermediate

Stochastic generation strategies that trade determinism for diversity; key knobs include temperature and nucleus sampling.

Secure Inference Intermediate

Methods to protect model/data during inference (e.g., trusted execution environments) from operators/attackers.

Semantic Search Intermediate

Retrieval based on embedding similarity rather than keyword overlap, capturing paraphrases and related concepts.

SFT Intermediate

Fine-tuning on (prompt, response) pairs to align a model with instruction-following behaviors.

SHAP Intermediate

Feature attribution method grounded in cooperative game theory for explaining predictions in tabular settings.

Softmax Intermediate

Converts logits to probabilities by exponentiation and normalization; common in classification and LMs.

Specificity Intermediate

Of true negatives, the fraction correctly identified.

Stability Intermediate

System returns to equilibrium after disturbance.

Stochastic Approximation Intermediate

Optimization under uncertainty.

Stochastic Gradient Descent Intermediate

A gradient method using random minibatches for efficient training on large datasets.

Structured Output Intermediate

Forcing predictable formats for downstream systems; reduces parsing errors and supports validation/guardrails.

Synthetic Data Intermediate

Artificially created data used to train/test models; helpful for privacy and coverage, risky if unrealistic.

Temperature Intermediate

Scales logits before sampling; higher increases randomness/diversity, lower increases determinism.