NonvolatileMemory / GliDe_with_a_CaPE_ICML_24View external linksLinks
official code for GliDe with a CaPE
☆20Aug 13, 2024Updated last year
Alternatives and similar repositories for GliDe_with_a_CaPE_ICML_24
Users that are interested in GliDe_with_a_CaPE_ICML_24 are comparing it to the libraries listed below
Sorting:
- [ICLR 2025] SWIFT: On-the-Fly Self-Speculative Decoding for LLM Inference Acceleration☆61Feb 21, 2025Updated 11 months ago
- PyTorch implementation of CARE☆16Oct 6, 2023Updated 2 years ago
- Official Implementation of SAM-Decoding: Speculative Decoding via Suffix Automaton☆40Feb 13, 2025Updated last year
- [ACL 2025 main] FR-Spec: Frequency-Ranked Speculative Sampling☆49Jul 15, 2025Updated 7 months ago
- ☆20Dec 24, 2024Updated last year
- semi-autoregressive neural machine translation☆23Sep 9, 2018Updated 7 years ago
- ☆34Mar 17, 2025Updated 10 months ago
- ☆64Dec 3, 2024Updated last year
- LongSpec: Long-Context Lossless Speculative Decoding with Efficient Drafting and Verification☆73Jul 14, 2025Updated 7 months ago
- ☆66Nov 4, 2024Updated last year
- Official Repo for SparseLLM: Global Pruning of LLMs (NeurIPS 2024)☆67Mar 27, 2025Updated 10 months ago
- The implementation of "Neural Machine Translation without Embeddings", NAACL 2021☆33Jun 9, 2021Updated 4 years ago
- [ICML 2025] RocketKV: Accelerating Long-Context LLM Inference via Two-Stage KV Cache Compression☆32Aug 7, 2025Updated 6 months ago
- Spec-Bench: A Comprehensive Benchmark and Unified Evaluation Platform for Speculative Decoding (ACL 2024 Findings)☆367Apr 22, 2025Updated 9 months ago
- Multi-Candidate Speculative Decoding☆39Apr 22, 2024Updated last year
- Active Learning with Partial Feedback, ICLR 2019☆11Apr 27, 2020Updated 5 years ago
- POSTECH: Compiler Construction (Spring 2022)☆10Mar 10, 2023Updated 2 years ago
- Repository for the DPP'23 course☆11May 2, 2024Updated last year
- Generic library for neural collapse and several derivative works on the phenomenon.☆18Apr 14, 2025Updated 10 months ago
- ☆12Jan 15, 2015Updated 11 years ago
- [MQM-APE] Toward High-Quality Error Annotation Predictors with Automatic Post-Editing in LLM Translation Evaluators.☆11Sep 24, 2024Updated last year
- [ACL2025 Oral🔥]Turning Trash into Treasure: Accelerating Inference of Large Language Models with Token Recycling☆22Nov 11, 2025Updated 3 months ago
- [ICLR 2025] Implementation of Accelerating Auto-regressive Text-to-Image Generation with Training-free Speculative Jacobi Decoding☆48Apr 21, 2025Updated 9 months ago
- Flexible simulator for mixed precision and format simulation of LLMs and vision transformers.☆51Jul 10, 2023Updated 2 years ago
- [COLM 2024] TriForce: Lossless Acceleration of Long Sequence Generation with Hierarchical Speculative Decoding☆276Aug 31, 2024Updated last year
- ☆16Apr 15, 2025Updated 9 months ago
- Parallel Self-Adjusting Computation☆15Jul 5, 2021Updated 4 years ago
- Codebase accompanying the paper 'Widening the Representation Bottleneck in Neural Machine Translation with Lexical Shortcuts', (Emelin, D…☆11Feb 14, 2023Updated 3 years ago
- On the Complementarity between Pre-Training and Back-Translation for Neural Machine Translation (Findings of EMNLP 2021))☆13Nov 21, 2021Updated 4 years ago
- SemBleu: A Robust Metric for AMR Parsing Evaluation☆12Feb 22, 2021Updated 4 years ago
- [CVPR 2022] AlignQ: Alignment Quantization with ADMM-based Correlation Preservation☆11Jan 6, 2023Updated 3 years ago
- This repository contains the code used in a publication 'Active Learning for Decision-Making from Imbalanced Observational Data', Iiris S…☆11May 14, 2019Updated 6 years ago
- The code of paper Affective Decoding for Empathetic Response Generation☆11Oct 12, 2021Updated 4 years ago
- A method for evaluating the high-level coherence of machine-generated texts. Identifies high-level coherence issues in transformer-based …☆11Mar 18, 2023Updated 2 years ago
- ☆12Aug 26, 2022Updated 3 years ago
- Compress BiSeNet with Structure Knowledge Distillation for Real-time image segmentation on wali-TX2☆11Jul 29, 2020Updated 5 years ago
- ☆10Oct 28, 2019Updated 6 years ago
- Code for our paper "AMR-DA: Data augmentation by abstract meaning representation" in ACL 2022☆13May 17, 2022Updated 3 years ago
- ☆12Oct 28, 2024Updated last year