Implementation of the paper Fast Inference from Transformers via Speculative Decoding, Leviathan et al. 2023.
☆100Dec 2, 2024Updated last year
Alternatives and similar repositories for Speculative-Decoding
Users that are interested in Speculative-Decoding are comparing it to the libraries listed below
Sorting:
- ☆14Aug 19, 2024Updated last year
- minimal C implementation of speculative decoding based on llama2.c☆25Jul 15, 2024Updated last year
- Fast inference from large lauguage models via speculative decoding☆894Aug 22, 2024Updated last year
- Official pytorch implementation of "RITUAL: Random Image Transformations as a Universal Anti-hallucination Lever in Large Vision Language…☆14Dec 16, 2024Updated last year
- 📰 Must-read papers and blogs on Speculative Decoding ⚡️☆1,126Jan 24, 2026Updated last month
- DuoDecoding: Hardware-aware Heterogeneous Speculative Decoding with Dynamic Multi-Sequence Drafting☆17Mar 4, 2025Updated last year
- ☆32Oct 21, 2025Updated 4 months ago
- [ACL 2025 Findings] Official pytorch implementation of "Don't Miss the Forest for the Trees: Attentional Vision Calibration for Large Vis…☆24Jul 21, 2024Updated last year
- 北京邮电大学求职仓库--持续更新☆21Sep 5, 2025Updated 6 months ago
- Multi-step AI agents powered by Gemini 2.0 and the LangGraph framework. These agents orchestrate complex workflows and enhance their reas…☆10Dec 19, 2024Updated last year
- [CVPR 2025] DyCoke: Dynamic Compression of Tokens for Fast Video Large Language Models☆101Nov 22, 2025Updated 3 months ago
- AI-native knowledge kernel for human/agent collaboration. Use it as a Knowledge Base, Wiki, Annotator, Research Tool, or Agentic Memory.☆29Updated this week
- Martingale posterior neural networks for fast sequential decision making @ Neurips 2025☆23Nov 13, 2025Updated 3 months ago
- Software to enable data-rich collaboration from high-resolution display walls to your laptop☆16Feb 19, 2026Updated last week
- Luthier, a GPU binary instrumentation tool for AMD GPUs☆27Feb 21, 2026Updated last week
- Rhetorical sentence classification using LLMs☆11Oct 26, 2025Updated 4 months ago
- Spec-Bench: A Comprehensive Benchmark and Unified Evaluation Platform for Speculative Decoding (ACL 2024 Findings)☆369Apr 22, 2025Updated 10 months ago
- PyTorch code for "Contrastive Region Guidance: Improving Grounding in Vision-Language Models without Training"☆39Mar 4, 2024Updated 2 years ago
- Fast, free, easy, and object-agnostic video anonymization☆11Dec 12, 2020Updated 5 years ago
- ☆31Feb 3, 2026Updated last month
- A widget for swiping through a deck of cards with gestures or buttons.☆13Sep 9, 2023Updated 2 years ago
- Neural Network Execution Service☆11Oct 3, 2023Updated 2 years ago
- ☆24Dec 19, 2025Updated 2 months ago
- Auction Theory Toolbox – Computer Verified Auctions☆14Jul 12, 2016Updated 9 years ago
- Free Pomodoro Timer For Android, Mac, Windows, IOS☆16Dec 20, 2025Updated 2 months ago
- As a Pangolin looks for bugs and catches them, the goal of this library is ot help developers finding bugs in their neural networks and n…☆13May 18, 2024Updated last year
- ☆13Oct 21, 2024Updated last year
- The official implementation of COOPER: A Unified Model for Cooperative Perception and Reasoning in Spatial Intelligence.☆28Dec 30, 2025Updated 2 months ago
- Official Implementation of DART (DART: Diffusion-Inspired Speculative Decoding for Fast LLM Inference).☆44Feb 8, 2026Updated 3 weeks ago
- MCP server for Grok AI API integration☆21Jun 2, 2025Updated 9 months ago
- ☆15Dec 9, 2025Updated 2 months ago
- A UI designer for constructing AI applications with OpenSearch☆16Updated this week
- From-Classification-to-Clinical☆12Apr 26, 2024Updated last year
- Demo showing how to use Entra ID with MCP servers without passing access tokens through.☆14Apr 4, 2025Updated 11 months ago
- ☆14Dec 27, 2024Updated last year
- [ICCV 2021] Multimodal Knowledge Expansion☆10Aug 28, 2021Updated 4 years ago
- ☆14Jul 18, 2025Updated 7 months ago
- A model context protocol implementation granting LLMs access to make database queries and learn about supabase types.☆14Dec 13, 2024Updated last year
- Official Repository for paper "Ontology-Free General-Domain Knowledge Graph-to-Text Generation Dataset Synthesis using Large Language Mod…☆14Nov 25, 2024Updated last year