Fine-Tuning

Course module.

Practice in the Vault

Module Contents

1. Fine-Tuning Basics

2. LoRA and PEFT

3. RLHF

Chapter 01

Fine-Tuning Basics

Master the fundamentals of Fine-Tuning large language models, including Supervised Fine-Tuning, Instruction Tuning, and Catastrophic Forgetting.

Start Learning
Chapter 03

RLHF

Learn how RLHF works, from the reward model to the PPO algorithm, to align language models with human preferences.

Start Learning