Eval Harness

Intermediate

System for running consistent evaluations across tasks, versions, prompts, and model settings.

Full Definition

System for running consistent evaluations across tasks, versions, prompts, and model settings.

Keywords

Domains

Related Terms

Concept Map

See how Eval Harness connects to other concepts.

Open Knowledge Graph