hsj576/GRIFFIN

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/hsj576/GRIFFIN)

hsj576 / GRIFFIN

Official Implementation of "GRIFFIN: Effective Token Alignment for Faster Speculative Decoding"[NeurIPS 2025]

☆19

Alternatives and similar repositories for GRIFFIN

Users that are interested in GRIFFIN are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

HArmonizedSS / HASS
View on GitHub
Official Implementation of "Learning Harmonized Representations for Speculative Sampling" (HASS)
☆56Mar 14, 2025Updated last year
Sike-Wang / low-bit-Shampoo
View on GitHub
4-bit Shampoo for Memory-Efficient Network Training (NeurIPS 2024)
☆13Feb 13, 2025Updated last year
Linking-ai / SCOPE
View on GitHub
(ACL2025 oral) SCOPE: Optimizing KV Cache Compression in Long-context Generation
☆36May 28, 2025Updated last year
hahnyuan / ASVD4LLM
View on GitHub
Activation-aware Singular Value Decomposition for Compressing Large Language Models
☆92Oct 22, 2024Updated last year
rahulguptakota / paper-To-Reviewer-Matching-System
View on GitHub
Paper to Reviewer Assignment is a tedious but a very crucial job for conference organizers. Till date the Toronto Paper Matching System (…
☆10Nov 30, 2017Updated 8 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
Hambaobao / Marathon
View on GitHub
Marathon: A Multiple-choice Long Context Evaluation Benchmark for Large Language Models.
☆10May 16, 2024Updated 2 years ago
Pickl-3 / hitbox-fightstick-game-device
View on GitHub
☆14Jul 23, 2023Updated 3 years ago
sail-sg / SimLayerKV
View on GitHub
The official implementation of paper: SimLayerKV: A Simple Framework for Layer-Level KV Cache Reduction.
☆54Oct 18, 2024Updated last year
kyrieLei / Critic-V
View on GitHub
☆18Apr 23, 2025Updated last year
Zanette-Labs / SpeculativeRejection
View on GitHub
[NeurIPS 2024] Fast Best-of-N Decoding via Speculative Rejection
☆56Oct 29, 2024Updated last year
mutonix / pyramidinfer
View on GitHub
☆47Nov 25, 2024Updated last year
SAIRcompetition / equational-theories-stage1-judge
View on GitHub
Official evaluation models and configuration for Stage 1 of the SAIR Mathematics Distillation Challenge: Equational Theories.
☆18Apr 19, 2026Updated 3 months ago
hyx1999 / SAM-Decoding
View on GitHub
Official Implementation of SAM-Decoding: Speculative Decoding via Suffix Automaton
☆52May 12, 2026Updated 2 months ago
natnew / awesome-ai-scientists
View on GitHub
A curated collection of resources for building “AI Scientist” systems: AI that assists scientific discovery through literature intelligen…
☆15Updated this week
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
BaohaoLiao / RSD
View on GitHub
[ICML 2025] Reward-guided Speculative Decoding (RSD) for efficiency and effectiveness.
☆56May 2, 2025Updated last year
allanchen95 / TKDE-2019-CONNA
View on GitHub
TKDE'20 paper, "CONNA: Addressing Name Disambiguation on the Fly".
☆15May 31, 2021Updated 5 years ago
soyoung97 / AcuRank
View on GitHub
☆15Jul 30, 2025Updated 11 months ago
ztyang23 / BACON
View on GitHub
☆20Jul 23, 2024Updated 2 years ago
lindonroberts / trust-region
View on GitHub
Python trust-region subproblem solvers for nonlinear optimization
☆30Updated this week
THUDM / paper-source-trace
View on GitHub
☆19Sep 29, 2024Updated last year
mavenlin / ai_research_trends
View on GitHub
Trends of arxiv submissions counted from twitter/medium/reddit etc.
☆39Dec 11, 2022Updated 3 years ago
k-fujikawa / recsys-challenge-2024-1st-place
View on GitHub
☆18Aug 17, 2024Updated last year
LiaoMengqi / E3-RL4LLMs
View on GitHub
[ EMNLP 2025 Main ] Enhancing Efficiency and Exploration in Reinforcement Learning for LLMs
☆17Nov 7, 2025Updated 8 months ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
MozerWang / DEMO
View on GitHub
[ACL 2025 (Findings)] DEMO: Reframing Dialogue Interaction with Fine-grained Element Modeling
☆22Dec 16, 2024Updated last year
zfjsail / wechat-wow-analysis
View on GitHub
Code for our TKDE paper "Understanding WeChat User Preferences and “Wow” Diffusion"
☆20Aug 29, 2024Updated last year
ZhouYuxuanYX / Hierarchical-Speculative-Decoding
View on GitHub
Hierarchical Speculative Decoding is the SOTA verification algorithm for lossless accelerated LLM inference.
☆24Apr 14, 2026Updated 3 months ago
zjunlp / LookAheadTuning
View on GitHub
[WSDM 2026] LookAhead Tuning: Safer Language Models via Partial Answer Previews
☆17Dec 14, 2025Updated 7 months ago
CyanScholar / CloudNativeSim
View on GitHub
A toolkit for modeling and simulation of cloud-native applications.
☆16Aug 4, 2025Updated 11 months ago
zhenyuhe00 / BiPE
View on GitHub
Two Stones Hit One Bird: Bilevel Positional Encoding for Better Length Extrapolation, ICML 2024
☆24Jun 26, 2024Updated 2 years ago
facebookresearch / ToMi
View on GitHub
Code accompanying our EMNLP 2019 paper: "Revisiting the Evaluation of Theory of Mind through Question Answering"
☆29Aug 9, 2020Updated 5 years ago
maitrix-org / dynamic-alignment-optimization
View on GitHub
[EMNLP'24 (Main)] DRPO(Dynamic Rewarding with Prompt Optimization) is a tuning-free approach for self-alignment. DRPO leverages a search-…
☆24Nov 17, 2024Updated last year
SafeAILab / EAGLE
View on GitHub
Official Implementation of EAGLE-1 (ICML'24), EAGLE-2 (EMNLP'24), and EAGLE-3 (NeurIPS'25).
☆2,478Feb 20, 2026Updated 5 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
minhaoJ2 / TaxoEnrich
View on GitHub
The source code for self-supervised Taxonomy Completion framework TaxoEnrich, published in WWW 2022.
☆21Apr 25, 2022Updated 4 years ago
Jiuzhouh / Uncertainty-Aware-Language-Agent
View on GitHub
This is the official repo for Towards Uncertainty-Aware Language Agent.
☆31Aug 15, 2024Updated last year
alessiodevoto / l2compress
View on GitHub
Code for the EMNLP24 paper "A simple and effective L2 norm based method for KV Cache compression."
☆19Dec 13, 2024Updated last year
IPBench / IPBench
View on GitHub
[ACL 2026] Repository of IPBench
☆23Apr 6, 2026Updated 3 months ago
Dominic789654 / LongGenBench
View on GitHub
Source code for the paper "LongGenBench: Long-context Generation Benchmark"
☆24Oct 8, 2024Updated last year
OpenDFM / SciEval
View on GitHub
[AAAI 2024] SciEval: A Multi-Level Large Language Model Evaluation Benchmark for Scientific Research
☆31Aug 6, 2024Updated last year
yuezhouhu / adaspec
View on GitHub
A selective knowledge distillation algorithm for efficient speculative decoders
☆39Nov 27, 2025Updated 7 months ago