Fine-Tuning

Course module.

Module Contents

Chapter 01

Master the fundamentals of Fine-Tuning large language models, including Supervised Fine-Tuning, Instruction Tuning, and Catastrophic Forgetting.

Chapter 02

Coming soon

Chapter 03

Learn how RLHF works, from the reward model to the PPO algorithm, to align language models with human preferences.

Chapter 04

Review & Cheat Sheet

Mark it as mastered to track your progress through the course.