Knowledge Quiz
Test your understanding of this article
1.What is the primary innovation introduced by TinyLoRA?
2.How many parameters did TinyLoRA use to train the 8B parameter Qwen2.5 model to 91% accuracy on GSM8K?
3.What training method was crucial for TinyLoRA to achieve strong performance, according to the article?
4.Compared to models trained with Supervised Fine-Tuning (SFT), how many more updates did SFT models require to reach similar performance as TinyLoRA with RL?
