FasterDecoding / MedusaView on GitHub
Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads
2,720Jun 25, 2024Updated last year

Alternatives and similar repositories for Medusa

Users that are interested in Medusa are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Are these results useful?