NEO-MLSys25 / NEOView on GitHub
NEO is a LLM inference engine built to save the GPU memory crisis by CPU offloading
84Jun 16, 2025Updated 8 months ago

Alternatives and similar repositories for NEO

Users that are interested in NEO are comparing it to the libraries listed below

Sorting:

Are these results useful?