FasterDecoding / MedusaView on GitHub
Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads
2,708Jun 25, 2024Updated last year

Alternatives and similar repositories for Medusa

Users that are interested in Medusa are comparing it to the libraries listed below

Sorting:

Are these results useful?