Week 7 - Fine-Tuning I
Last updated
Last updated
know how to prepare the data for training LLMs.
get a better technical understanding of how to train LLMs.
learn about different alignment approaches such as RLHF and RLAIF using PPO, or DPO.
Training a causal language model from scratch from the Hugging Face NLP course.
Video by Andrej Karpathy explaining how to train a GPT from scratch.