SFT
IntermediateFine-tuning on (prompt, response) pairs to align a model with instruction-following behaviors.
Full Definition
Fine-tuning on (prompt, response) pairs to align a model with instruction-following behaviors.
Keywords
Domains
Related Terms
Concept Map
See how SFT connects to other concepts.
Open Knowledge Graph