bigai-nlco / TokenSwift
From Hours to Minutes: Lossless Acceleration of Ultra Long Sequence Generation
☆86Updated last month
Alternatives and similar repositories for TokenSwift:
Users that are interested in TokenSwift are comparing it to the libraries listed below
- ☆142Updated last month
- Exploring the Limit of Outcome Reward for Learning Mathematical Reasoning☆171Updated last month
- CPPO: Accelerating the Training of Group Relative Policy Optimization-Based Reasoning Models☆117Updated this week
- Code for "Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate"☆137Updated 2 months ago
- Reformatted Alignment☆115Updated 6 months ago
- A Large-Scale, Challenging, Decontaminated, and Verifiable Mathematical Dataset for Advancing Reasoning☆152Updated this week
- ☆94Updated 4 months ago
- An Open Math Pre-trainng Dataset with 370B Tokens.☆55Updated 2 weeks ago
- Offical Repo for "Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale"☆236Updated this week
- HelloBench: Evaluating Long Text Generation Capabilities of Large Language Models☆40Updated 4 months ago
- Skywork-MoE: A Deep Dive into Training Techniques for Mixture-of-Experts Language Models☆131Updated 10 months ago
- The demo, code and data of FollowRAG☆71Updated 4 months ago
- ☆111Updated last month
- ☆185Updated 2 months ago
- Towards Economical Inference: Enabling DeepSeek's Multi-Head Latent Attention in Any Transformer-based LLMs☆160Updated this week
- Code for Paper: Teaching Language Models to Critique via Reinforcement Learning☆92Updated last week
- xVerify: Efficient Answer Verifier for Reasoning Model Evaluations☆60Updated this week
- We aim to provide the best references to search, select, and synthesize high-quality and large-quantity data for post-training your LLMs.☆54Updated 6 months ago
- The official repository of the Omni-MATH benchmark.☆80Updated 4 months ago
- Official code of *Virgo: A Preliminary Exploration on Reproducing o1-like MLLM*☆99Updated last month
- Advancing Language Model Reasoning through Reinforcement Learning and Inference Scaling☆101Updated 2 months ago
- ☆30Updated 4 months ago
- ☆282Updated last month
- Research Code for preprint "Optimizing Test-Time Compute via Meta Reinforcement Finetuning".☆92Updated last month
- A Comprehensive Survey on Long Context Language Modeling☆131Updated 3 weeks ago
- Repo of paper "Free Process Rewards without Process Labels"☆141Updated last month
- [ICLR 2025] SuperCorrect: Advancing Small LLM Reasoning with Thought Template Distillation and Self-Correction☆68Updated 3 weeks ago
- Homepage for ProLong (Princeton long-context language models) and paper "How to Train Long-Context Language Models (Effectively)"☆175Updated last month
- A highly capable 2.4B lightweight LLM using only 1T pre-training data with all details.☆172Updated last week
- FuseAI Project☆85Updated 2 months ago