thu-nics / R2RLinks
The official code implementation for paper "R2R: Efficiently Navigating Divergent Reasoning Paths with Small-Large Model Token Routing"
☆35Updated last week
Alternatives and similar repositories for R2R
Users that are interested in R2R are comparing it to the libraries listed below
Sorting:
- ☆84Updated last month
- [ICML24] Pruner-Zero: Evolving Symbolic Pruning Metric from scratch for LLMs☆84Updated 7 months ago
- ☆51Updated 3 months ago
- [ICLR 2025] Official Pytorch Implementation of "Mix-LN: Unleashing the Power of Deeper Layers by Combining Pre-LN and Post-LN" by Pengxia…☆24Updated 6 months ago
- VeriThinker: Learning to Verify Makes Reasoning Model Efficient☆47Updated 3 weeks ago
- ☆37Updated last month
- [ICML 2025] This is the official PyTorch implementation of "ZipAR: Accelerating Auto-regressive Image Generation through Spatial Locality…☆49Updated 3 months ago
- [ICML 2024] When Linear Attention Meets Autoregressive Decoding: Towards More Effective and Efficient Linearized Large Language Models☆31Updated last year
- [ICLR 2025] Official PyTorch implementation of "Forgetting Transformer: Softmax Attention with a Forget Gate"☆108Updated last month
- ☆58Updated this week
- ☆45Updated last week
- A WebUI for Side-by-Side Comparison of Media (Images/Videos) Across Multiple Folders☆24Updated 4 months ago
- [ICML 2025] Fourier Position Embedding: Enhancing Attention’s Periodic Extension for Length Generalization☆71Updated 3 weeks ago
- GIFT: Generative Interpretable Fine-Tuning☆20Updated 8 months ago
- ☆17Updated 5 months ago
- [ICLR 2025] Mixture Compressor for Mixture-of-Experts LLMs Gains More☆46Updated 4 months ago
- Open-Pandora: On-the-fly Control Video Generation☆34Updated 6 months ago
- [ICLR 2025] Source code for paper "A Spark of Vision-Language Intelligence: 2-Dimensional Autoregressive Transformer for Efficient Finegr…☆76Updated 6 months ago
- The official repo of continuous speculative decoding☆27Updated 2 months ago
- ☆152Updated last week
- Optimizing Anytime Reasoning via Budget Relative Policy Optimization☆38Updated last month
- ✈️ Towards Stabilized and Efficient Diffusion Transformers through Long-Skip-Connections with Spectral Constraints☆69Updated 2 months ago
- Official Implementation of FastKV: KV Cache Compression for Fast Long-Context Processing with Token-Selective Propagation☆20Updated last month
- ☆42Updated 7 months ago
- The code repository of "MBQ: Modality-Balanced Quantization for Large Vision-Language Models"☆40Updated 3 months ago
- [ICLR 2025] Linear Combination of Saved Checkpoints Makes Consistency and Diffusion Models Better☆15Updated 4 months ago
- Triton implement of bi-directional (non-causal) linear attention☆50Updated 4 months ago
- Official repository for ICML 2024 paper "MoRe Fine-Tuning with 10x Fewer Parameters"☆20Updated last month