dwzq-com-cn / DongwuLLM
This is the codebase for pre-training, compressing, extending, and distilling LLMs with Megatron-LM.
☆12Updated 11 months ago
Alternatives and similar repositories for DongwuLLM:
Users that are interested in DongwuLLM are comparing it to the libraries listed below
- ☆96Updated 5 months ago
- [EMNLP 2024] LongAlign: A Recipe for Long Context Alignment of LLMs☆241Updated 2 months ago
- ☆99Updated 2 months ago
- We aim to provide the best references to search, select, and synthesize high-quality and large-quantity data for post-training your LLMs.☆50Updated 5 months ago
- We introduce ScaleQuest, a scalable, novel and cost-effective data synthesis method to unleash the reasoning capability of LLMs.☆60Updated 4 months ago
- Counting-Stars (★)☆80Updated 6 months ago
- CLongEval: A Chinese Benchmark for Evaluating Long-Context Large Language Models☆39Updated 11 months ago
- [ACL'24] Superfiltering: Weak-to-Strong Data Filtering for Fast Instruction-Tuning☆144Updated 5 months ago
- L-CITEEVAL: DO LONG-CONTEXT MODELS TRULY LEVERAGE CONTEXT FOR RESPONDING?☆23Updated 4 months ago
- 🍼 Official implementation of Dynamic Data Mixing Maximizes Instruction Tuning for Mixture-of-Experts☆38Updated 5 months ago
- Official code for "MAmmoTH2: Scaling Instructions from the Web" [NeurIPS 2024]☆135Updated 4 months ago
- ☆93Updated last year
- Source code of "Reasons to Reject? Aligning Language Models with Judgments"☆58Updated last year
- The aim of this repository is to utilize LLaMA to reproduce and enhance the Stanford Alpaca☆96Updated last year
- Unofficial implementation of AlpaGasus☆90Updated last year
- [ACL 2024] MT-Bench-101: A Fine-Grained Benchmark for Evaluating Large Language Models in Multi-Turn Dialogues☆71Updated 7 months ago
- Benchmarking Complex Instruction-Following with Multiple Constraints Composition (NeurIPS 2024 Datasets and Benchmarks Track)☆68Updated last week
- ☆44Updated 8 months ago
- [COLING 2025] ToolEyes: Fine-Grained Evaluation for Tool Learning Capabilities of Large Language Models in Real-world Scenarios☆65Updated 3 months ago
- [ACL 2024] FollowBench: A Multi-level Fine-grained Constraints Following Benchmark for Large Language Models☆96Updated 2 months ago
- Implementations of online merging optimizers proposed by Online Merging Optimizers for Boosting Rewards and Mitigating Tax in Alignment☆74Updated 8 months ago
- ☆132Updated 10 months ago
- Code for our EMNLP-2023 paper: "Active Instruction Tuning: Improving Cross-Task Generalization by Training on Prompt Sensitive Tasks"☆24Updated last year
- Fantastic Data Engineering for Large Language Models☆75Updated 2 months ago
- Towards Systematic Measurement for Long Text Quality☆32Updated 6 months ago
- Llama-3-SynE: A Significantly Enhanced Version of Llama-3 with Advanced Scientific Reasoning and Chinese Language Capabilities | 继续预 训练提升 …☆33Updated 2 months ago
- ACL 2024 | LooGLE: Long Context Evaluation for Long-Context Language Models☆176Updated 4 months ago
- ☆137Updated 2 months ago
- SysBench: Can Large Language Models Follow System Messages?☆24Updated 6 months ago
- Rethinking Negative Instances for Generative Named Entity Recognition☆49Updated 11 months ago