NEO-MLSys25 / NEOView on GitHub
NEO is a LLM inference engine built to save the GPU memory crisis by CPU offloading
94Jun 16, 2025Updated 10 months ago

Alternatives and similar repositories for NEO

Users that are interested in NEO are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Are these results useful?