Luowaterbi / TokenRecyclingView external linksLinks
[ACL2025 Oral🔥]Turning Trash into Treasure: Accelerating Inference of Large Language Models with Token Recycling
☆22Nov 11, 2025Updated 3 months ago
Alternatives and similar repositories for TokenRecycling
Users that are interested in TokenRecycling are comparing it to the libraries listed below
Sorting:
- ☆49Aug 14, 2025Updated 6 months ago
- Spec-Bench: A Comprehensive Benchmark and Unified Evaluation Platform for Speculative Decoding (ACL 2024 Findings)☆367Apr 22, 2025Updated 9 months ago
- A Low-Overhead tool for Floating-Point Exception Detection in NVIDIA GPUs☆12Dec 17, 2024Updated last year
- [ICLR 2026] Official Implementation of ProxyThinker: Test-Time Guidance through Small Visual Reasoners.☆19Sep 24, 2025Updated 4 months ago
- Awesome LLM for Cybersecurity☆11Nov 16, 2024Updated last year
- Code for Research Project TLDR☆25Jul 28, 2025Updated 6 months ago
- ArxivDaily☆13Updated this week
- ☆12Aug 31, 2023Updated 2 years ago
- MicroMix: Efficient Mixed-Precision Quantization with Microscaling Formats for Large Language Models☆28Updated this week
- Codes for our paper "Enhancing Continual Relation Extraction via Classifier Decomposition" (Findings of ACL2023)☆10Nov 29, 2023Updated 2 years ago
- Python Puppet Provider Abstraction for Wechaty☆13Nov 20, 2022Updated 3 years ago
- Code and data for "Medical Dialogue Generation via Dual Flow Modeling" (ACL 2023 Findings)☆13Nov 22, 2023Updated 2 years ago
- ☆13Jul 5, 2024Updated last year
- RStudio Shiny viewer for Tesla Telemetry Track Mode files☆15Mar 28, 2021Updated 4 years ago
- Code for our paper: Improved deep learning techniques in gravitational-wave data analysis.☆12Apr 16, 2021Updated 4 years ago
- [ICML 2025] Official implementation of the paper "SkipGPT: Dynamic Layer Pruning Reinvented with Token Awareness and Module Decoupling". …☆19Nov 17, 2025Updated 2 months ago
- This is for EMNLP 2024 Paper: AppBench: Planning of Multiple APIs from Various APPs for Complex User Instruction☆15Nov 4, 2024Updated last year
- ☆11Sep 7, 2024Updated last year
- 珠算代码大模型(Abacus Code LLM)☆58Sep 26, 2024Updated last year
- ☆13Jun 7, 2022Updated 3 years ago
- PyTorch implementation of CARE☆16Oct 6, 2023Updated 2 years ago
- [ECCV'24] Official Implementation of Autoregressive Visual Entity Recognizer.☆14Mar 2, 2024Updated last year
- ☆15Apr 18, 2025Updated 9 months ago
- ☆27Dec 15, 2025Updated last month
- Android Particle System implements Particle Designer effect.☆13Mar 9, 2017Updated 8 years ago
- ☆14Oct 28, 2023Updated 2 years ago
- EMNLP2022: Learning Robust Representations for Continual Relation Extraction via Adversarial Class Augmentation☆15Oct 19, 2022Updated 3 years ago
- Code & Data for our Paper "RobustGEC: Robust Grammatical Error Correction Against Subtle Context Perturbation" (EMNLP 2023)☆17Jan 23, 2024Updated 2 years ago
- Video content description model for generating descriptions for unconstrained videos☆16Jul 5, 2019Updated 6 years ago
- ☆15Apr 28, 2023Updated 2 years ago
- Code for the paper "RADAR: Enhancing Radiology Report Generation with Supplementary Knowledge Injection" (ACL'25).☆33Jul 23, 2025Updated 6 months ago
- Code for the paper "Self-Detoxifying Language Models via Toxification Reversal" (EMNLP 2023)☆18Oct 17, 2023Updated 2 years ago
- Use genetic algorithm to optimize the backpropagation neural network.☆17Aug 21, 2020Updated 5 years ago
- Codes for Mitigating Unhelpfulness in Emotional Support Conversations with Multifaceted AI Feedback (ACL 2024 Findings)☆16Jul 2, 2024Updated last year
- Spacemacs configuration layer for elpy☆18Jun 14, 2015Updated 10 years ago
- [ACL 2025 Findings] Official implementation of the paper "Unveiling the Key Factors for Distilling Chain-of-Thought Reasoning".☆20Feb 26, 2025Updated 11 months ago
- Code for ReF Decompile: Relabeling and Function Call Enhanced Decompile☆25Dec 7, 2025Updated 2 months ago
- Code for the paper "ICON: Improving Inter-Report Consistency in Radiology Report Generation via Lesion-aware Mixup Augmentation" (EMNLP'2…☆17Dec 11, 2024Updated last year
- Code repository for ICLR 2025 paper "LeanQuant: Accurate and Scalable Large Language Model Quantization with Loss-error-aware Grid"☆24Mar 2, 2025Updated 11 months ago