☆52Feb 19, 2024Updated 2 years ago
Alternatives and similar repositories for Hydra
Users that are interested in Hydra are comparing it to the libraries listed below
Sorting:
- [NeurIPS 2024] The official implementation of "Kangaroo: Lossless Self-Speculative Decoding for Accelerating LLMs via Double Early Exitin…☆66Jun 26, 2024Updated last year
- Official Implementation of "Learning Harmonized Representations for Speculative Sampling" (HASS)☆54Mar 14, 2025Updated 11 months ago
- Spec-Bench: A Comprehensive Benchmark and Unified Evaluation Platform for Speculative Decoding (ACL 2024 Findings)☆369Apr 22, 2025Updated 10 months ago
- ☆19Jul 31, 2025Updated 7 months ago
- Code associated with the paper **Draft & Verify: Lossless Large Language Model Acceleration via Self-Speculative Decoding**☆216Feb 13, 2025Updated last year
- ☆10Feb 1, 2022Updated 4 years ago
- Official codes for Scalable Infomin Learning, NeurIPS 2022☆13Feb 28, 2023Updated 3 years ago
- [ICML 2023] "Robust Weight Signatures: Gaining Robustness as Easy as Patching Weights?" by Ruisi Cai, Zhenyu Zhang, Zhangyang Wang☆16May 4, 2023Updated 2 years ago
- ☆14Jun 4, 2024Updated last year
- Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads☆2,710Jun 25, 2024Updated last year
- Official Implementation of EAGLE-1 (ICML'24), EAGLE-2 (EMNLP'24), and EAGLE-3 (NeurIPS'25).☆2,201Feb 20, 2026Updated last week
- Implementation of self-certainty as an extention of ZeroEval Project☆34May 31, 2025Updated 9 months ago
- LongSpec: Long-Context Lossless Speculative Decoding with Efficient Drafting and Verification☆74Jul 14, 2025Updated 7 months ago
- Fast inference from large lauguage models via speculative decoding☆894Aug 22, 2024Updated last year
- [ICML 2024] When Linear Attention Meets Autoregressive Decoding: Towards More Effective and Efficient Linearized Large Language Models☆35Jun 12, 2024Updated last year
- [COLM 2024] TriForce: Lossless Acceleration of Long Sequence Generation with Hierarchical Speculative Decoding☆277Aug 31, 2024Updated last year
- The Scala programming language☆16Mar 31, 2023Updated 2 years ago
- [ICML 2024] Break the Sequential Dependency of LLM Inference Using Lookahead Decoding☆1,315Mar 6, 2025Updated 11 months ago
- scalable and robust tree-based speculative decoding algorithm☆370Jan 28, 2025Updated last year
- 📰 Must-read papers and blogs on Speculative Decoding ⚡️☆1,126Jan 24, 2026Updated last month
- Documentation at☆14Mar 27, 2025Updated 11 months ago
- A salesforce library designed to provide idiomatic clojure representations of salesforce data and metadata☆11Jan 14, 2020Updated 6 years ago
- ☆11Jan 25, 2019Updated 7 years ago
- Official AYON<->Kitsu intetgration (WIP)☆12Jan 27, 2026Updated last month
- DELTA-pytorch:DELTA: Dynamically Optimizing GPU Memory beyond Tensor Recomputation☆12Apr 16, 2024Updated last year
- Code for experiments on self-prediction as a way to measure introspection in LLMs☆16Dec 10, 2024Updated last year
- ☆21Aug 8, 2025Updated 6 months ago
- ☆11Jun 14, 2015Updated 10 years ago
- CodeFeedr core infrastructure☆10Nov 10, 2020Updated 5 years ago
- [ACL2025 Oral🔥]Turning Trash into Treasure: Accelerating Inference of Large Language Models with Token Recycling☆22Nov 11, 2025Updated 3 months ago
- Schema-aware JSON compression with millisecond lookups — cut transfer/storage while enabling exists*/pos* queries. (Demo + wheels; core i…☆24Feb 21, 2026Updated last week
- Allows you to save images with their generation metadata in ComfyUI. Compatible with Civitai. Works with png, jpeg and webp.☆11Jan 14, 2024Updated 2 years ago
- open source taxi dispatch software 出行加打车软件UI设计效果图☆14Dec 22, 2020Updated 5 years ago
- ☆10Sep 10, 2023Updated 2 years ago
- Sandbox that demonstrates derivation of camera Log to Linear conversions, and an ACES IDT and ODT for Z-Log 2.☆10Nov 14, 2021Updated 4 years ago
- [ACL 2024] RelayAttention for Efficient Large Language Model Serving with Long System Prompts☆40Feb 29, 2024Updated 2 years ago
- [ICML 2021] This is the official github repo for training L_inf dist nets with high certified accuracy.☆42Mar 16, 2022Updated 3 years ago
- Evaluating AlexNet features at various depths☆40Oct 13, 2020Updated 5 years ago
- ☆10Feb 25, 2026Updated last week