official code for GliDe with a CaPE
☆20Aug 13, 2024Updated last year
Alternatives and similar repositories for GliDe_with_a_CaPE_ICML_24
Users that are interested in GliDe_with_a_CaPE_ICML_24 are comparing it to the libraries listed below
Sorting:
- Official Implementation of "Learning Harmonized Representations for Speculative Sampling" (HASS)☆54Mar 14, 2025Updated 11 months ago
- PyTorch implementation of CARE☆16Oct 6, 2023Updated 2 years ago
- [ICLR 2025] SWIFT: On-the-Fly Self-Speculative Decoding for LLM Inference Acceleration☆64Feb 21, 2025Updated last year
- [ACL 2025 main] FR-Spec: Frequency-Ranked Speculative Sampling☆52Jul 15, 2025Updated 7 months ago
- ☆20Dec 24, 2024Updated last year
- semi-autoregressive neural machine translation☆23Sep 9, 2018Updated 7 years ago
- ☆36Mar 17, 2025Updated 11 months ago
- ☆64Dec 3, 2024Updated last year
- LongSpec: Long-Context Lossless Speculative Decoding with Efficient Drafting and Verification☆74Jul 14, 2025Updated 7 months ago
- ☆66Nov 4, 2024Updated last year
- Official Repo for SparseLLM: Global Pruning of LLMs (NeurIPS 2024)☆67Mar 27, 2025Updated 11 months ago
- ☆42Mar 28, 2024Updated last year
- [ICML 2025] RocketKV: Accelerating Long-Context LLM Inference via Two-Stage KV Cache Compression☆34Aug 7, 2025Updated 7 months ago
- Multi-Candidate Speculative Decoding☆39Apr 22, 2024Updated last year
- [ACL2025 Oral🔥]Turning Trash into Treasure: Accelerating Inference of Large Language Models with Token Recycling☆22Nov 11, 2025Updated 3 months ago
- Active Learning with Partial Feedback, ICLR 2019☆11Apr 27, 2020Updated 5 years ago
- Repository for the DPP'23 course☆11May 2, 2024Updated last year
- POSTECH: Compiler Construction (Spring 2022)☆11Mar 10, 2023Updated 2 years ago
- [MQM-APE] Toward High-Quality Error Annotation Predictors with Automatic Post-Editing in LLM Translation Evaluators.☆11Sep 24, 2024Updated last year
- ☆12Jan 15, 2015Updated 11 years ago
- Flexible simulator for mixed precision and format simulation of LLMs and vision transformers.☆52Jul 10, 2023Updated 2 years ago
- [COLM 2024] TriForce: Lossless Acceleration of Long Sequence Generation with Hierarchical Speculative Decoding☆277Aug 31, 2024Updated last year
- On the Complementarity between Pre-Training and Back-Translation for Neural Machine Translation (Findings of EMNLP 2021))☆13Nov 21, 2021Updated 4 years ago
- Research Artifact For Our Submission To VLDB☆10Oct 27, 2021Updated 4 years ago
- ☆13Jan 7, 2025Updated last year
- ☆12Aug 26, 2022Updated 3 years ago
- Parallel Self-Adjusting Computation☆15Jul 5, 2021Updated 4 years ago
- ☆12Oct 28, 2024Updated last year
- SemBleu: A Robust Metric for AMR Parsing Evaluation☆12Feb 22, 2021Updated 5 years ago
- [ICLR 2025] RaSA: Rank-Sharing Low-Rank Adaptation☆10May 19, 2025Updated 9 months ago
- The official implementation for the paper Improving Empathetic Dialogue Generation by Dynamically Infusing Commonsense Knowledge.☆15Aug 14, 2023Updated 2 years ago
- A method for evaluating the high-level coherence of machine-generated texts. Identifies high-level coherence issues in transformer-based …☆11Mar 18, 2023Updated 2 years ago
- ☆10May 16, 2024Updated last year
- ☆17Apr 15, 2025Updated 10 months ago
- Code for our paper "AMR-DA: Data augmentation by abstract meaning representation" in ACL 2022☆13May 17, 2022Updated 3 years ago
- [CVPR 2022] AlignQ: Alignment Quantization with ADMM-based Correlation Preservation☆11Jan 6, 2023Updated 3 years ago
- Compress BiSeNet with Structure Knowledge Distillation for Real-time image segmentation on wali-TX2☆11Jul 29, 2020Updated 5 years ago
- The Python solutions of leetcode☆13Apr 26, 2020Updated 5 years ago
- Large language models to diffusion finetuning code☆24Jun 2, 2025Updated 9 months ago