High-efficiency LLM inference engine in C++/CUDA. Run Llama 70B on RTX 3090.
β461Feb 22, 2026Updated 2 months ago
Alternatives and similar repositories for ntransformer
Users that are interested in ntransformer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Rust implementation of the Zstandard Seekable Formatβ265Apr 17, 2026Updated 2 weeks ago
- Experiments with the Mojo π₯ programming language on macOS arm64 guided by testsβ13Jan 8, 2026Updated 3 months ago
- DeepDream for video with temporal consistency. Features RAFT optical flow estimation and occlusion masking to prevent ghosting. A PyTorchβ¦