hemingkx / SWIFT

SWIFT: On-the-Fly Self-Speculative Decoding for LLM Inference Acceleration
31Updated last month

Alternatives and similar repositories for SWIFT:

Users that are interested in SWIFT are comparing it to the libraries listed below