2 results
When information from evaluation data improperly influences training, inflating reported performance.
A robust evaluation technique that trains/evaluates across multiple splits to estimate performance variability.