Week 7 - Fine-Tuning I

This week you will...

  • know how to prepare the data for training LLMs.

  • get a better technical understanding of how to train LLMs.

  • learn about different alignment approaches such as RLHF and RLAIF using PPO, or DPO.

