Powered by GitBook

1 of 1

Loading...

Week 7 - Fine-Tuning I

This week you will...

know how to prepare the data for training LLMs.
get a better technical understanding of how to train LLMs.
learn about different alignment approaches such as RLHF and RLAIF using PPO, or DPO.

Learning Resources

from the Hugging Face NLP course.
by Andrej Karpathy explaining how to train a GPT from scratch.

Until next week you should...

and of the course Generative AI with Large Language Models.
specify which metrics you'll use to evaluate the model performance, and why you've chosen these metrics. Document it in the corresponding section of your project repository.

5MB

231204_Fine-Tuning I.pdf

PDF

Training a causal language model from scratch