opencampus.sh Machine Learning Program

CtrlK

Week 7 - Fine-Tuning I

This week you will...

know how to prepare the data for training LLMs.
get a better technical understanding of how to train LLMs.
learn about different alignment approaches such as RLHF and RLAIF using PPO, or DPO.

Learning Resources

231204_Fine-Tuning I.pdf

Training a causal language model from scratch from the Hugging Face NLP course.
Video by Andrej Karpathy explaining how to train a GPT from scratch.

Until next week you should...

week 2 and week 3 of the course Generative AI with Large Language Models.
specify which metrics you'll use to evaluate the model performance, and why you've chosen these metrics. Document it in the corresponding section of your project repository.

PreviousWeek 6 - Model Evaluation NextWeek 8 - Fine-Tuning II and Model Inference

Last updated 1 year ago

Was this helpful?