The largest independent dev blog feed.

We surface the best developer writing from thousands of independent blogs, updated daily. The open web is worth fighting for.

Learn more

Reimagining Supervised Fine-Tuning for Large Language Models and Diffusion from a Denoising Perspective

1 · Norm Inui · May 18, 2024, midnight

Summary

TL; DR 🚧 (WIP) In a recent study by Gekhman, Zorik, et al., the authors reveal that supervised fine-tuning (SFT) struggles to integrate new knowledge into Large Language Model (LLM). The authors propose an approach by categorizing fine-tuning datasets into four categories as below: Their findings suggest that the most effective fine-tuning datasets should consist of HighlyKnown, MaybeKnown, and WeaklyKnown data, with minimal amount of Unknown data. This conclusion makes me rethink the natur...

Read full post on →

AUTHOR