# Week 6 - Model Evaluation

### This week you will...

* get to know Weights & Biases a popular platform to evaluate deep learning models.
* understand different evaluation metrics.
* get a high level introduction into training LLMs.

### Learning Resources

{% file src="<https://4020123021-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2F-MHobCAnoTQkN71lOgdv%2Fuploads%2FPFgO43eHV33ySwU8RPPE%2F231127_Model%20Evaluation.pdf?alt=media&token=ee6337fb-cdfa-49fc-a819-0eb5cb8a3c1d>" %}

* [short course](https://www.deeplearning.ai/short-courses/evaluating-debugging-generative-ai/) by Deeplearning.AI and Weights & Biases on how to use the Weights & Biases framework to track and evaluate your model results

### Until next week you should...

* [x] watch [this video](https://youtu.be/-RPeakdlHYo) on data collators.
* [x] work through the section [Training a causal language model from scratch](https://huggingface.co/learn/nlp-course/en/chapter7/6) from the Hugging Face NLP course (do not change to the TensorFlow version of the site).
* [x] Decide on a baseline model and add a short description of your approach in the corresponding section of your project repository.
