dusty-nv / NanoLLM

Optimized local inference for LLMs with HuggingFace-like APIs for quantization, vision/language models, multimodal agents, speech, vector DB, and RAG.
239Updated 4 months ago

Alternatives and similar repositories for NanoLLM:

Users that are interested in NanoLLM are comparing it to the libraries listed below