Repository of the paper "Accelerating Transformer Inference for Translation via Parallel Decoding"
☆124Mar 15, 2024Updated 2 years ago
Alternatives and similar repositories for parallel-decoding
Users that are interested in parallel-decoding are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A PyTorch-based neural implicit geometry toolbox.☆16Jul 25, 2022Updated 3 years ago
- Official Implementation of ACL2023: Don't Parse, Choose Spans! Continuous and Discontinuous Constituency Parsing via Autoregressive Span …☆14Aug 25, 2023Updated 2 years ago
- [ICML 2024] Break the Sequential Dependency of LLM Inference Using Lookahead Decoding☆1,333Mar 6, 2025Updated last year
- contrastive decoding☆206Nov 14, 2022Updated 3 years ago
- ☆13Aug 23, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- REST: Retrieval-Based Speculative Decoding, NAACL 2024☆218Mar 5, 2026Updated last month
- Parallel Associative Scan for Language Models☆18Jan 8, 2024Updated 2 years ago
- This repository contains the official code of the research paper Study on transfer learning capabilities for pneumonia classification in …☆10May 13, 2022Updated 3 years ago
- Teaching material for the course of Deep Learning and Applied AI, 2nd semester 2020, Sapienza University of Rome☆34Aug 24, 2020Updated 5 years ago
- ☆11Oct 12, 2023Updated 2 years ago
- Deep Learning & Applied AI: Tutorials☆14Jul 5, 2020Updated 5 years ago
- A Word Level Transformer layer based on PyTorch and 🤗 Transformers.☆34Jan 31, 2024Updated 2 years ago
- NLP Preprocessing Pipeline Wrappers☆11May 12, 2023Updated 2 years ago
- An Empirical Study On Contrastive Search And Contrastive Decoding For Open-ended Text Generation☆27Jun 7, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads☆2,727Jun 25, 2024Updated last year