Week 7 - Fine-Tuning I
Last updated
Was this helpful?
Last updated
Was this helpful?
know how to prepare the data for training LLMs.
get a better technical understanding of how to train LLMs.
learn about different alignment approaches such as RLHF and RLAIF using PPO, or DPO.
from the Hugging Face NLP course.
by Andrej Karpathy explaining how to train a GPT from scratch.