NonvolatileMemory / GliDe_with_a_CaPE_ICML_24
View external linksLinks

official code for GliDe with a CaPE

☆20

Alternatives and similar repositories for GliDe_with_a_CaPE_ICML_24

Users that are interested in GliDe_with_a_CaPE_ICML_24 are comparing it to the libraries listed below

Sorting:

hemingkx / SWIFT
View on GitHub
[ICLR 2025] SWIFT: On-the-Fly Self-Speculative Decoding for LLM Inference Acceleration
☆61Feb 21, 2025Updated 11 months ago
wangjs9 / CARE-master
View on GitHub
PyTorch implementation of CARE
☆16Oct 6, 2023Updated 2 years ago
hyx1999 / SAM-Decoding
View on GitHub
Official Implementation of SAM-Decoding: Speculative Decoding via Suffix Automaton
☆40Feb 13, 2025Updated last year
thunlp / FR-Spec
View on GitHub
[ACL 2025 main] FR-Spec: Frequency-Ranked Speculative Sampling
☆49Jul 15, 2025Updated 7 months ago
NonvolatileMemory / flash_tree_attn
View on GitHub
☆20Dec 24, 2024Updated last year
chqiwang / sa-nmt
View on GitHub
semi-autoregressive neural machine translation
☆23Sep 9, 2018Updated 7 years ago
antgroup / cakekv
View on GitHub
☆34Mar 17, 2025Updated 10 months ago
LiuXiaoxuanPKU / OSD
View on GitHub
☆64Dec 3, 2024Updated last year
sail-sg / LongSpec
View on GitHub
LongSpec: Long-Context Lossless Speculative Decoding with Efficient Drafting and Verification
☆73Jul 14, 2025Updated 7 months ago
yandex-research / specexec
View on GitHub
☆66Nov 4, 2024Updated last year
BaiTheBest / SparseLLM
View on GitHub
Official Repo for SparseLLM: Global Pruning of LLMs (NeurIPS 2024)
☆67Mar 27, 2025Updated 10 months ago
UriSha / EmbeddinglessNMT
View on GitHub
The implementation of "Neural Machine Translation without Embeddings", NAACL 2021
☆33Jun 9, 2021Updated 4 years ago
NVlabs / RocketKV
View on GitHub
[ICML 2025] RocketKV: Accelerating Long-Context LLM Inference via Two-Stage KV Cache Compression
☆32Aug 7, 2025Updated 6 months ago
hemingkx / Spec-Bench
View on GitHub
Spec-Bench: A Comprehensive Benchmark and Unified Evaluation Platform for Speculative Decoding (ACL 2024 Findings)
☆367Apr 22, 2025Updated 9 months ago
NJUNLP / MCSD
View on GitHub
Multi-Candidate Speculative Decoding
☆39Apr 22, 2024Updated last year
peiyunh / alpf
View on GitHub
Active Learning with Partial Feedback, ICLR 2019
☆11Apr 27, 2020Updated 5 years ago
betarixm / CSED423
View on GitHub
POSTECH: Compiler Construction (Spring 2022)
☆10Mar 10, 2023Updated 2 years ago
diku-dk / dpp-e2023-pub
View on GitHub
Repository for the DPP'23 course
☆11May 2, 2024Updated last year
rhubarbwu / neural-collapse
View on GitHub
Generic library for neural collapse and several derivative works on the phenomenon.
☆18Apr 14, 2025Updated 10 months ago
smatthewenglish / trst
View on GitHub
☆12Jan 15, 2015Updated 11 years ago
Coldmist-Lu / MQM_APE
View on GitHub
[MQM-APE] Toward High-Quality Error Annotation Predictors with Automatic Post-Editing in LLM Translation Evaluators.
☆11Sep 24, 2024Updated last year
Luowaterbi / TokenRecycling
View on GitHub
[ACL2025 Oral🔥]Turning Trash into Treasure: Accelerating Inference of Large Language Models with Token Recycling
☆22Nov 11, 2025Updated 3 months ago
tyshiwo1 / Accelerating-T2I-AR-with-SJD
View on GitHub
[ICLR 2025] Implementation of Accelerating Auto-regressive Text-to-Image Generation with Training-free Speculative Jacobi Decoding
☆48Apr 21, 2025Updated 9 months ago
lightmatter-ai / INT-FP-QSim
View on GitHub
Flexible simulator for mixed precision and format simulation of LLMs and vision transformers.
☆51Jul 10, 2023Updated 2 years ago
Infini-AI-Lab / TriForce
View on GitHub
[COLM 2024] TriForce: Lossless Acceleration of Long Sequence Generation with Hierarchical Speculative Decoding
☆276Aug 31, 2024Updated last year
shanshw / LogConfigLocalizer
View on GitHub
☆16Apr 15, 2025Updated 9 months ago
cmuparlay / psac
View on GitHub
Parallel Self-Adjusting Computation
☆15Jul 5, 2021Updated 4 years ago
demelin / transformer_lexical_shortcuts
View on GitHub
Codebase accompanying the paper 'Widening the Representation Bottleneck in Neural Machine Translation with Lexical Shortcuts', (Emelin, D…
☆11Feb 14, 2023Updated 3 years ago
SunbowLiu / PTvsBT
View on GitHub
On the Complementarity between Pre-Training and Back-Translation for Neural Machine Translation (Findings of EMNLP 2021))
☆13Nov 21, 2021Updated 4 years ago
freesunshine0316 / sembleu
View on GitHub
SemBleu: A Robust Metric for AMR Parsing Evaluation
☆12Feb 22, 2021Updated 4 years ago
tinganchen / AlignQ
View on GitHub
[CVPR 2022] AlignQ: Alignment Quantization with ADMM-based Correlation Preservation
☆11Jan 6, 2023Updated 3 years ago
IirisSundin / active-learning-for-decision-making
View on GitHub
This repository contains the code used in a publication 'Active Learning for Decision-Making from Imbalanced Observational Data', Iiris S…
☆11May 14, 2019Updated 6 years ago
zenggo / affective-decoding-4-empathetic-dialog
View on GitHub
The code of paper Affective Decoding for Empathetic Response Generation
☆11Oct 12, 2021Updated 4 years ago
da03 / criticize_text_generation
View on GitHub
A method for evaluating the high-level coherence of machine-generated texts. Identifies high-level coherence issues in transformer-based …
☆11Mar 18, 2023Updated 2 years ago
huqinghao / PalQuant
View on GitHub
☆12Aug 26, 2022Updated 3 years ago
Shuai-Xie / BiSeNet-wali
View on GitHub
Compress BiSeNet with Structure Knowledge Distillation for Real-time image segmentation on wali-TX2
☆11Jul 29, 2020Updated 5 years ago
ryanzhumich / sparc_atis_pytorch
View on GitHub
☆10Oct 28, 2019Updated 6 years ago
zzshou / amr-data-augmentation
View on GitHub
Code for our paper "AMR-DA: Data augmentation by abstract meaning representation" in ACL 2022
☆13May 17, 2022Updated 3 years ago
namespace-Pt / TSGen
View on GitHub
☆12Oct 28, 2024Updated last year

NonvolatileMemory / GliDe_with_a_CaPE_ICML_24View external linksLinks

Alternatives and similar repositories for GliDe_with_a_CaPE_ICML_24

NonvolatileMemory / GliDe_with_a_CaPE_ICML_24
View external linksLinks