Paper-Weekly18-The Flan Collection: Designing Data and Methods for Effective Instruction Tuning

Jan 26, 2024 1 min read

We find task balancing and enrichment techniques are overlooked but critical to effective instruction tuning, and in particular, training with mixed prompt settings (zero-shot, few-shot, and chain-of-thought) actually yields stronger (2%+) performance in all settings.

instruction tuning 经典数据集，主要结论如下：

混合zero-shot和few-shot yields better results.
验证一些trick的有效性：scaling, enriching task variety with input inversion, adding CoT, balancing different data source
Demonstrate these technical choices yield 3-17% Held-Out task improvements over existing open source instruction tuning collections
Demonstrate Flan-T5 serves as a stronger and more computationally-efficient starting checkpoint for single-task finetuning