OFA-Sys / DiverseEvolLinks
Self-Evolved Diverse Data Sampling for Efficient Instruction Tuning
☆83Updated last year
Alternatives and similar repositories for DiverseEvol
Users that are interested in DiverseEvol are comparing it to the libraries listed below
Sorting:
- Reformatted Alignment☆113Updated 11 months ago
- [ACL 2024] LLM2LLM: Boosting LLMs with Novel Iterative Data Enhancement☆188Updated last year
- ☆50Updated last year
- The source code and dataset mentioned in the paper Seal-Tools: Self-Instruct Tool Learning Dataset for Agent Tuning and Detailed Benchmar…☆53Updated 9 months ago
- Conifer: Improving Complex Constrained Instruction-Following Ability of Large Language Models☆88Updated last year
- [ACL'24] Superfiltering: Weak-to-Strong Data Filtering for Fast Instruction-Tuning☆169Updated 2 months ago
- ☆50Updated last year
- [COLING 2025] ToolEyes: Fine-Grained Evaluation for Tool Learning Capabilities of Large Language Models in Real-world Scenarios☆69Updated 3 months ago
- ☆103Updated 8 months ago
- SELF-GUIDE: Better Task-Specific Instruction Following via Self-Synthetic Finetuning. COLM 2024 Accepted Paper☆33Updated last year
- [ICML 2025] Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale☆257Updated last month
- Counting-Stars (★)☆83Updated 2 months ago
- Code implementation of synthetic continued pretraining☆123Updated 7 months ago
- We aim to provide the best references to search, select, and synthesize high-quality and large-quantity data for post-training your LLMs.☆58Updated 10 months ago
- Official implementation of the paper "From Complex to Simple: Enhancing Multi-Constraint Complex Instruction Following Ability of Large L…☆51Updated last year
- ☆96Updated last year
- ☆36Updated last month
- ☆83Updated last year
- ☆145Updated last year
- Official code for "MAmmoTH2: Scaling Instructions from the Web" [NeurIPS 2024]☆146Updated 9 months ago
- [ICML'2024] Can AI Assistants Know What They Don't Know?☆83Updated last year
- ☆95Updated 8 months ago
- ☆36Updated 11 months ago
- ☆104Updated last month
- Unleashing the Power of Cognitive Dynamics on Large Language Models☆63Updated 11 months ago
- WideSearch: Benchmarking Agentic Broad Info-Seeking☆69Updated last week
- Hammer: Robust Function-Calling for On-Device Language Models via Function Masking☆96Updated 2 months ago
- [ICLR'24 spotlight] Tool-Augmented Reward Modeling☆51Updated 2 months ago
- [ACL 2025] An official pytorch implement of the paper: Condor: Enhance LLM Alignment with Knowledge-Driven Data Synthesis and Refinement☆34Updated 2 months ago
- [ICLR 2025] 🧬 RegMix: Data Mixture as Regression for Language Model Pre-training (Spotlight)☆163Updated 6 months ago