gigit0000 / qwen3.cuLinks

Single-file, pure CUDA C implementation for running inference on Qwen3 0.6B GGUF. No Dependencies.
22Updated last month

Alternatives and similar repositories for qwen3.cu

Users that are interested in qwen3.cu are comparing it to the libraries listed below

Sorting: