CalebDu / Awesome-CuteView external linksLinks
☆114May 16, 2025Updated 8 months ago
Alternatives and similar repositories for Awesome-Cute
Users that are interested in Awesome-Cute are comparing it to the libraries listed below
Sorting:
- Examples of CUDA implementations by Cutlass CuTe☆270Jul 1, 2025Updated 7 months ago
- ☆261Jul 11, 2024Updated last year
- ☆162Feb 5, 2026Updated last week
- 使用 cutlass 仓库在 ada 架构上实现 fp8 的 flash attention☆78Aug 12, 2024Updated last year
- Implement Flash Attention using Cute.☆100Dec 17, 2024Updated last year
- Benchmark tests supporting the TiledCUDA library.☆18Nov 19, 2024Updated last year
- Quantized Attention on GPU☆44Nov 22, 2024Updated last year
- flash attention tutorial written in python, triton, cuda, cutlass☆486Jan 20, 2026Updated 3 weeks ago
- ☆49Apr 15, 2024Updated last year
- ☆177May 7, 2025Updated 9 months ago
- DeeperGEMM: crazy optimized version☆74May 5, 2025Updated 9 months ago
- ☆88May 31, 2025Updated 8 months ago
- A standalone GEMM kernel for fp16 activation and quantized weight, extracted from FasterTransformer☆96Sep 13, 2025Updated 5 months ago
- Artifacts of EVT ASPLOS'24