bigai-nlco / TokenSwiftLinks

[ICML 2025] |TokenSwift: Lossless Acceleration of Ultra Long Sequence Generation

☆110

Alternatives and similar repositories for TokenSwift

Users that are interested in TokenSwift are comparing it to the libraries listed below

Sorting:

SuperGPQA / SuperGPQA
☆157Updated 3 months ago
lzhxmu / CPPO
CPPO: Accelerating the Training of Group Relative Policy Optimization-Based Reasoning Models
☆145Updated last month
SkyworkAI / Skywork-MoE
Skywork-MoE: A Deep Dive into Training Techniques for Mixture-of-Experts Language Models
☆136Updated last year
DAMO-NLP-SG / LongPO
[ICLR 2025] LongPO: Long Context Self-Evolution of Large Language Models through Short-to-Long Preference Optimization
☆38Updated 5 months ago
MiroMindAsia / MiroMind-M1
☆84Updated last week
GAIR-NLP / PC-Agent-E
Efficient Agent Training for Computer Use
☆120Updated last month
GAIR-NLP / ReAlign
Reformatted Alignment
☆113Updated 10 months ago
yyht / openrlhf_async_pipline
☆70Updated this week
TIGER-AI-Lab / General-Reasoner
General Reasoner: Advancing LLM Reasoning Across All Domains
☆156Updated last month
Tencent / llm.hunyuan.T1
☆77Updated 3 months ago
JT-Ushio / MHA2MLA
Towards Economical Inference: Enabling DeepSeek's Multi-Head Latent Attention in Any Transformer-based LLMs
☆181Updated last month
Bui1dMySea / MemLong
☆94Updated 7 months ago
QwenLM / WorldPM
☆90Updated 2 months ago
Quehry / HelloBench
HelloBench: Evaluating Long Text Generation Capabilities of Large Language Models
☆46Updated 8 months ago
THUDM / T1
RL Scaling and Test-Time Scaling (ICML'25)
☆109Updated 6 months ago
Tongyi-Zhiwen / QwenLong-L1
☆287Updated 2 months ago
MiniMax-AI / SynLogic
The official repo of SynLogic: Synthesizing Verifiable Reasoning Data at Scale for Learning Logical Reasoning and Beyond
☆160Updated 3 weeks ago
yafuly / TPO
Test-time preferenece optimization (ICML 2025).
☆149Updated 2 months ago
TIGER-AI-Lab / CritiqueFineTuning
Code for "Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate" [COLM 2025]
☆168Updated 3 weeks ago
MoonshotAI / Kimi-Researcher
☆67Updated last month
SkyworkAI / Skywork-Reward-V2
Scaling Preference Data Curation via Human-AI Synergy
☆94Updated last month
efficientscaling / Z1
Repo for "Z1: Efficient Test-time Scaling with Code"
☆63Updated 3 months ago
RUC-GSAI / YuLan-Mini
A highly capable 2.4B lightweight LLM using only 1T pre-training data with all details.
☆200Updated last week
hzy312 / knowledge-r1
IKEA: Reinforced Internal-External Knowledge Synergistic Reasoning for Efficient Adaptive Search Agent
☆61Updated 2 months ago
Open-Source-O1 / o1_Reasoning_Patterns_Study
☆103Updated 7 months ago
zhaochenyang20 / Prompt2Model-Self-Guide
SELF-GUIDE: Better Task-Specific Instruction Following via Self-Synthetic Finetuning. COLM 2024 Accepted Paper
☆33Updated last year
Infini-AI-Lab / S2FT
☆19Updated 6 months ago
SalesforceAIResearch / GemFilter
☆82Updated 6 months ago
HKUNLP / critic-rl
[ICML 2025] Teaching Language Models to Critique via Reinforcement Learning
☆105Updated 2 months ago
open-compass / Ada-LEval
The official implementation of "Ada-LEval: Evaluating long-context LLMs with length-adaptable benchmarks"
☆54Updated 2 months ago