hemingkx / SWIFT

[ICLR 2025] SWIFT: On-the-Fly Self-Speculative Decoding for LLM Inference Acceleration
42Updated last month

Alternatives and similar repositories for SWIFT:

Users that are interested in SWIFT are comparing it to the libraries listed below