Eval Harness
IntermediateSystem for running consistent evaluations across tasks, versions, prompts, and model settings.
Full Definition
System for running consistent evaluations across tasks, versions, prompts, and model settings.
Keywords
Domains
Related Terms
Concept Map
See how Eval Harness connects to other concepts.
Open Knowledge Graph