Results for "standardized evaluation"
Model Card
Intermediate
Standardized documentation describing intended use, performance, limitations, data, and ethical considerations.
Data Leakage
Intermediate
When information from evaluation data improperly influences training, inflating reported performance.
Cross-Validation
Intermediate
A robust evaluation technique that trains/evaluates across multiple splits to estimate performance variability.