FasterDecoding / MedusaLinks

Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads
2,530Updated 11 months ago

Alternatives and similar repositories for Medusa

Users that are interested in Medusa are comparing it to the libraries listed below

Sorting: