Qinyu Chen
Qinyu Chen
Home
Posts
Publications
Gallery
Contact
CV
Light
Dark
Automatic
Data
Paper-Weekly21-Best Practices and Lessons Learned on Synthetic Data for Language Models
The success of AI models relies on the availability of large, diverse, and high-quality datasets, which can be challenging to obtain due to data scarcity, privacy concerns, and high costs.
Mar 11, 2024
1 min read
Paper-Weekly18-The Flan Collection: Designing Data and Methods for Effective Instruction Tuning
We find task balancing and enrichment techniques are overlooked but critical to effective instruction tuning, and in particular, training with mixed prompt settings (zero-shot, few-shot, and chain-of-thought) actually yields stronger (2%+) performance in all settings.
Jan 26, 2024
1 min read
Cite
×