hemingkx / SWIFT

[ICLR 2025] SWIFT: On-the-Fly Self-Speculative Decoding for LLM Inference Acceleration
37Updated 2 months ago

Alternatives and similar repositories for SWIFT:

Users that are interested in SWIFT are comparing it to the libraries listed below