xaskasdf / ntransformerView on GitHub
High-efficiency LLM inference engine in C++/CUDA. Run Llama 70B on RTX 3090.
446Feb 22, 2026Updated last month

Alternatives and similar repositories for ntransformer

Users that are interested in ntransformer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Are these results useful?