18907305772 / FuseAILinks
FuseAI Project
☆87Updated 4 months ago
Alternatives and similar repositories for FuseAI
Users that are interested in FuseAI are comparing it to the libraries listed below
Sorting:
- Reformatted Alignment☆113Updated 9 months ago
- Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks☆143Updated 9 months ago
- Code for "Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate"☆157Updated 2 weeks ago
- ☆36Updated 9 months ago
- ☆50Updated last year
- ☆86Updated last month
- We aim to provide the best references to search, select, and synthesize high-quality and large-quantity data for post-training your LLMs.☆57Updated 8 months ago
- Unofficial implementation of AlpaGasus☆91Updated last year
- Skywork-MoE: A Deep Dive into Training Techniques for Mixture-of-Experts Language Models☆133Updated last year
- ☆40Updated last year
- Implementation of the LongRoPE: Extending LLM Context Window Beyond 2 Million Tokens Paper☆137Updated 11 months ago
- [ICLR 2024] CLEX: Continuous Length Extrapolation for Large Language Models☆78Updated last year
- ☆47Updated last week
- ☆94Updated 6 months ago
- An Open Math Pre-trainng Dataset with 370B Tokens.☆89Updated 2 months ago
- ☆121Updated last year
- ☆68Updated 3 months ago
- Repo for "Z1: Efficient Test-time Scaling with Code"☆60Updated 2 months ago
- Official implementation for 'Extending LLMs’ Context Window with 100 Samples'☆78Updated last year
- General Reasoner: Advancing LLM Reasoning Across All Domains☆141Updated last week
- ☆42Updated 8 months ago
- DeepResearch Bench: A Comprehensive Benchmark for Deep Research Agents☆82Updated this week
- [ICML 2025] Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale☆251Updated 2 weeks ago
- Official code for "MAmmoTH2: Scaling Instructions from the Web" [NeurIPS 2024]☆144Updated 7 months ago
- [ICML'24] The official implementation of “Rethinking Optimization and Architecture for Tiny Language Models”☆121Updated 5 months ago
- A simple GPT-based evaluation tool for multi-aspect, interpretable assessment of LLMs.☆85Updated last year
- RL Scaling and Test-Time Scaling (ICML'25)☆105Updated 5 months ago
- [ICLR 2025] LongPO: Long Context Self-Evolution of Large Language Models through Short-to-Long Preference Optimization☆37Updated 3 months ago
- [EMNLP 2024] LongAlign: A Recipe for Long Context Alignment of LLMs☆250Updated 6 months ago
- [NeurIPS 2024] OlympicArena: Benchmarking Multi-discipline Cognitive Reasoning for Superintelligent AI☆102Updated 3 months ago