NEO-MLSys25 / NEOView on GitHub
NEO is a LLM inference engine built to save the GPU memory crisis by CPU offloading
90Jun 16, 2025Updated 9 months ago

Alternatives and similar repositories for NEO

Users that are interested in NEO are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Are these results useful?