An innovative method expediting LLMs via streamlined semi-autoregressive generation and draft verification.
☆28Apr 15, 2025Updated last year
Alternatives and similar repositories for BiTA
Users that are interested in BiTA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Ouroboros: Speculative Decoding with Large Model Enhanced Drafting (EMNLP 2024 main)☆116Mar 20, 2025Updated last year
- ☆25Oct 31, 2024Updated last year
- Cascade Speculative Drafting☆33Apr 2, 2024Updated 2 years ago
- ☆13May 11, 2023Updated 2 years ago
- Efficient LLM Inference Acceleration using Prompting