bigai-nlco / TokenSwiftLinks
[ICML 2025] |TokenSwift: Lossless Acceleration of Ultra Long Sequence Generation
☆101Updated 2 weeks ago
Alternatives and similar repositories for TokenSwift
Users that are interested in TokenSwift are comparing it to the libraries listed below
Sorting:
- CPPO: Accelerating the Training of Group Relative Policy Optimization-Based Reasoning Models☆126Updated last week
- Official code of *Virgo: A Preliminary Exploration on Reproducing o1-like MLLM*☆103Updated this week
- MMR1: Advancing the Frontiers of Multimodal Reasoning☆159Updated 2 months ago
- Towards Economical Inference: Enabling DeepSeek's Multi-Head Latent Attention in Any Transformer-based LLMs☆166Updated last week
- Exploring the Limit of Outcome Reward for Learning Mathematical Reasoning☆179Updated 2 months ago
- ✨✨R1-Reward: Training Multimodal Reward Model Through Stable Reinforcement Learning☆133Updated 3 weeks ago
- ☆201Updated 3 months ago
- ☆102Updated last month
- ☆77Updated last month
- Rethinking RL Scaling for Vision Language Models: A Transparent, From-Scratch Framework and Comprehensive Evaluation Scheme☆127Updated last month
- xVerify: Efficient Answer Verifier for Reasoning Model Evaluations☆106Updated last month
- Test-time preferenece optimization (ICML 2025).☆128Updated 3 weeks ago
- ☆94Updated 5 months ago
- RM-R1: Unleashing the Reasoning Potential of Reward Models☆93Updated this week
- L1: Controlling How Long A Reasoning Model Thinks With Reinforcement Learning☆213Updated 2 weeks ago
- Reformatted Alignment☆113Updated 8 months ago
- The official repository for "2.5 Years in Class: A Multimodal Textbook for Vision-Language Pretraining"☆156Updated 2 months ago
- ☆150Updated last month
- A Comprehensive Survey on Long Context Language Modeling☆147Updated 2 weeks ago
- Homepage for ProLong (Princeton long-context language models) and paper "How to Train Long-Context Language Models (Effectively)"☆184Updated 2 months ago
- Skywork-MoE: A Deep Dive into Training Techniques for Mixture-of-Experts Language Models☆132Updated 11 months ago
- An Open Math Pre-trainng Dataset with 370B Tokens.☆87Updated last month
- Advancing Language Model Reasoning through Reinforcement Learning and Inference Scaling☆102Updated 4 months ago
- Code for "Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate"☆151Updated last month
- ZO2 (Zeroth-Order Offloading): Full Parameter Fine-Tuning 175B LLMs with 18GB GPU Memory☆95Updated last month
- General Reasoner: Advancing LLM Reasoning Across All Domains☆117Updated this week
- ☆79Updated 3 weeks ago
- [NeurIPS 2024] CharXiv: Charting Gaps in Realistic Chart Understanding in Multimodal LLMs☆115Updated last month
- ☆188Updated last month
- ☆83Updated last month