Week 7 - Fine-Tuning I
This week you will...
know how to prepare the data for training LLMs.
get a better technical understanding of how to train LLMs.
learn about different alignment approaches such as RLHF and RLAIF using PPO, or DPO.
Learning Resources
Training a causal language model from scratch from the Hugging Face NLP course.
Video by Andrej Karpathy explaining how to train a GPT from scratch.
Until next week you should...
Last updated