Kubnal Bridge

Techniques & Methods

Supervised Fine-Tuning

Supervised fine-tuning (SFT) takes a pre-trained foundation model and continues training it on a curated dataset of input-output pairs specific to the target task or domain. This adapts the model's behavior without training from scratch, preserving general capabilities while adding specialization.

SFT is the first step in RLHF pipelines for aligning LLMs. It teaches the model to follow instructions and produce task-appropriate formats before reinforcement learning further refines behavior using human preference data.

Authority Links

Related Terms