Quantized LLM training in pure CUDA/C++.
☆246Jun 3, 2026Updated this week
Alternatives and similar repositories for llmq
Users that are interested in llmq are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Write a fast kernel and see how you compare against the best humans and AI on gpumode.com☆98May 8, 2026Updated last month
- My submission for the GPUMODE/AMD fp8 mm challenge☆29Jun 4, 2025Updated last year
- QuTLASS: CUTLASS-Powered Quantized BLAS for Deep Learning☆185Nov 11, 2025Updated 6 months ago
- ☆21Apr 27, 2026Updated last month
- Ship correct and fast LLM kernels to PyTorch