FasterDecoding / Medusa

Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads
2,371Updated 6 months ago

Alternatives and similar repositories for Medusa:

Users that are interested in Medusa are comparing it to the libraries listed below