gigit0000 / qwen3.cu
View external linksLinks

Single-file, pure CUDA C implementation for running inference on Qwen3 0.6B GGUF. No Dependencies.
22Nov 26, 2025Updated 2 months ago

Alternatives and similar repositories for qwen3.cu

Users that are interested in qwen3.cu are comparing it to the libraries listed below

Sorting:

Are these results useful?