Basics of Transformers and Huggingface - Training

1 · · Jan. 22, 2024, 6:30 p.m.
Now you know what transformer does, how it knows what it knows. We combine these both now. Let’s train a model! What you need Dataset The whole point of training something is to do something we want. Here, we assume that we the model to give good summaries, or even better, good summaries of scientific articles. No matter that is your task, you create an appropriate dataset for it. The model Now that we have the data, we need to train the model. I’ll be using Bart for this. Bart is an ~400M param...