Results for "fine-tuning"
SFT
Intermediate
Fine-tuning on (prompt, response) pairs to align a model with instruction-following behaviors.
Parameter-Efficient Fine-Tuning
Intermediate
Techniques that fine-tune small additional components rather than all weights to reduce compute and storage.
LoRA
Intermediate
PEFT method injecting trainable low-rank matrices into layers, enabling efficient fine-tuning.
Fine-Tuning
Intermediate
Updating a pretrained model’s weights on task-specific data to improve performance or adapt style/behavior.