SFT

Intermediate

Fine-tuning on (prompt, response) pairs to align a model with instruction-following behaviors.

AdvertisementAd space — term-top

Why It Matters

SFT is important because it helps AI models become more aligned with user expectations and instructions. By training models to follow specific prompts, industries can create AI systems that provide more relevant and accurate information, improving user experience and satisfaction.

Supervised Fine-Tuning (SFT) is a method employed in the training of machine learning models, particularly in natural language processing, where the model is fine-tuned on a dataset consisting of pairs of prompts and corresponding responses. This approach aims to align the model's behavior with specific instruction-following capabilities, enhancing its ability to generate relevant and contextually appropriate outputs. Mathematically, SFT involves optimizing the model parameters to minimize a loss function that measures the discrepancy between the predicted responses and the ground truth responses provided in the training data. SFT is a critical step in the development of models that require adherence to user instructions and preferences, serving as a bridge between general language understanding and task-specific performance.

Keywords

Domains

Related Terms

Welcome to AI Glossary

The free, self-building AI dictionary. Help us keep it free—click an ad once in a while!

Search

Type any question or keyword into the search bar at the top.

Browse

Tap a letter in the A–Z bar to browse terms alphabetically, or filter by domain, industry, or difficulty level.

3D WordGraph

Fly around the interactive 3D graph to explore how AI concepts connect. Click any word to read its full definition.