Will write CUDA for 100 days
☆39May 25, 2025Updated last year
Alternatives and similar repositories for 100-days-of-cuda
Users that are interested in 100-days-of-cuda are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A simple task manager in Django☆12Nov 14, 2024Updated last year
- simple grpo☆12May 28, 2025Updated last year
- 个人学习编译原理、理解创造一个编译器主体流程的小项目☆10Oct 7, 2020Updated 5 years ago
- Linux from beginner to master☆32Dec 4, 2025Updated 6 months ago
- An expression parser supporting multiple types☆21Sep 25, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆45May 4, 2025Updated last year
- Go和大语言模型编程☆44Mar 5, 2025Updated last year
- Explore training for quantized models☆26Jul 12, 2025Updated 10 months ago
- A bunch of kernels that might make stuff slower 😉☆90Updated this week
- GEMM☆10Aug 26, 2023Updated 2 years ago
- This is the project VRGK compiled for Unreal Engine 5.3 following the video of VR Experiences. https://youtu.be/YKXwHYGaCqg?si=UKcNqGvX0e…☆12Jan 5, 2024Updated 2 years ago
- ☆11May 16, 2026Updated 3 weeks ago
- My tests and experiments with some popular dl frameworks.☆17Sep 11, 2025Updated 9 months ago
- ☆48Mar 27, 2023Updated 3 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- GEMV implementation with CUTLASS☆21Aug 21, 2025Updated 9 months ago
- Multi-heap-sort for many small arrays, quicksort with 3 pivots for one big array, CUDA acceleration, CUDA memory compression.☆13Sep 29, 2024Updated last year
- ☆14Nov 3, 2025Updated 7 months ago
- 《汇编语言一发入魂》配套代码☆15May 30, 2020Updated 6 years ago
- This project aims to provide a high effective KV cache manage framework for llm inference and improve memory utilization and inference sp…☆60Apr 24, 2026Updated last month
- A scalable asynchronous reinforcement learning implementation with in-flight weight updates.☆419Updated this week
- DoubleAI’s hyperoptimised version of cuGraph☆59Mar 3, 2026Updated 3 months ago
- ☆18Nov 22, 2025Updated 6 months ago
- 。☆13Jan 15, 2022Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- 中国大学MOOC-浙 江大学-翁恺老师网课-C语言程序设计,我从零开始自学编程的记录。☆17May 18, 2020Updated 6 years ago
- ☆32Jul 2, 2025Updated 11 months ago
- 《PostgreSQL内部机制剖析(译)》适用于数据库管理员和系统开发人员☆18Jan 20, 2020Updated 6 years ago
- Fast GPU based tensor core reductions☆13Jan 13, 2023Updated 3 years ago
- portFFT is a library implementing Fast Fourier Transforms using SYCL☆19Mar 1, 2025Updated last year
- Cute layout visualization☆39Jan 18, 2026Updated 4 months ago
- Toy vector database written in c99.☆25Sep 5, 2024Updated last year
- Open deep learning compiler stack for cpu, gpu and specialized accelerators☆19Updated this week
- ☆13Aug 31, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- a reactor network library☆16Aug 21, 2025Updated 9 months ago
- [ICML 2025] SparseLoRA: Accelerating LLM Fine-Tuning with Contextual Sparsity☆76Mar 10, 2026Updated 3 months ago
- Experimental GPU language with meta-programming☆31Sep 6, 2024Updated last year
- ☆15Mar 23, 2022Updated 4 years ago
- ☆68May 23, 2025Updated last year
- My submission for the GPUMODE/AMD fp8 mm challenge☆29Jun 4, 2025Updated last year
- ☆13Sep 2, 2025Updated 9 months ago