infinigence / SpecEEView on GitHub
Repo for SpecEE: Accelerating Large Language Model Inference with Speculative Early Exiting (ISCA25)
72Apr 25, 2025Updated 11 months ago

Alternatives and similar repositories for SpecEE

Users that are interested in SpecEE are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Are these results useful?