SFT

Intermediate

Fine-tuning on (prompt, response) pairs to align a model with instruction-following behaviors.

Full Definition

Fine-tuning on (prompt, response) pairs to align a model with instruction-following behaviors.

Keywords

Domains

Related Terms

Concept Map

See how SFT connects to other concepts.

Open Knowledge Graph