RUCAIBox / EASYEP
☆17Updated 3 weeks ago
Alternatives and similar repositories for EASYEP:
Users that are interested in EASYEP are comparing it to the libraries listed below
- [NeurIPS 2024] Fast Best-of-N Decoding via Speculative Rejection☆42Updated 6 months ago
- [ACL 2024] The official codebase for the paper "Self-Distillation Bridges Distribution Gap in Language Model Fine-tuning".☆119Updated 6 months ago
- Llama-3-SynE: A Significantly Enhanced Version of Llama-3 with Advanced Scientific Reasoning and Chinese Language Capabilities | 继续预训练提升 …☆32Updated 4 months ago
- ☆63Updated 5 months ago
- CLongEval: A Chinese Benchmark for Evaluating Long-Context Large Language Models☆40Updated last year
- SELF-GUIDE: Better Task-Specific Instruction Following via Self-Synthetic Finetuning. COLM 2024 Accepted Paper☆32Updated 11 months ago
- this is an implementation for the paper Improve Mathematical Reasoning in Language Models by Automated Process Supervision from google de…☆30Updated last month
- [ACL'24] Superfiltering: Weak-to-Strong Data Filtering for Fast Instruction-Tuning☆149Updated 8 months ago
- We introduce ScaleQuest, a scalable, novel and cost-effective data synthesis method to unleash the reasoning capability of LLMs.☆62Updated 6 months ago
- Towards Systematic Measurement for Long Text Quality☆34Updated 8 months ago
- LongRecipe: Recipe for Efficient Long Context Generalization in Large Language Models☆76Updated 6 months ago
- The code of arxiv paper: "CoT-based Synthesizer: Enhancing LLM Performance through Answer Synthesis"☆24Updated 3 months ago
- ☆46Updated 10 months ago
- Code implementation of synthetic continued pretraining☆107Updated 4 months ago
- Fantastic Data Engineering for Large Language Models☆87Updated 4 months ago
- ☆22Updated 9 months ago
- [ACL 2024] Long-Context Language Modeling with Parallel Encodings☆154Updated 10 months ago
- The official repository of the Omni-MATH benchmark.☆83Updated 4 months ago
- ☆57Updated 6 months ago
- Official github repo for AutoDetect, an automated weakness detection framework for LLMs.☆42Updated 10 months ago
- Official implementation of the paper "From Complex to Simple: Enhancing Multi-Constraint Complex Instruction Following Ability of Large L…☆48Updated 10 months ago
- TokenSkip: Controllable Chain-of-Thought Compression in LLMs☆136Updated last month
- Official completion of “Training on the Benchmark Is Not All You Need”.☆31Updated 4 months ago
- Counting-Stars (★)☆82Updated 8 months ago
- [ICLR 2025] 🧬 RegMix: Data Mixture as Regression for Language Model Pre-training (Spotlight)☆131Updated 2 months ago
- Official codebase for "GenPRM: Scaling Test-Time Compute of Process Reward Models via Generative Reasoning".☆71Updated last week
- A prototype repo for hybrid training of pipeline parallel and distributed data parallel with comments on core code snippets. Feel free to…☆55Updated last year
- ☆98Updated 7 months ago
- ☆81Updated last year
- Trinity-RFT is a general-purpose, flexible and scalable framework designed for reinforcement fine-tuning (RFT) of large language models (…☆71Updated this week