CUDA Learning guide
☆561Jun 20, 2024Updated 2 years ago
Alternatives and similar repositories for Parallel-Computing-Cuda-C
Users that are interested in Parallel-Computing-Cuda-C are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- NVIDIA tools guide☆166Jan 7, 2025Updated last year
- Read custom dataset☆12Mar 31, 2023Updated 3 years ago
- GPU programming related news and material links☆2,206Jun 15, 2026Updated 2 weeks ago
- My study notes and hands-on projects for CUDA-based GPU programming☆13Dec 11, 2025Updated 6 months ago
- Examples from Programming in Parallel with CUDA☆172Feb 5, 2026Updated 4 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Learn CUDA Programming, published by Packt☆1,259Dec 30, 2023Updated 2 years ago
- Solve puzzles. Learn CUDA.☆12,258Sep 1, 2024Updated last year
- Code samples for the CUDA tutorial "CUDA and Applications to Task-based Programming"☆99Aug 14, 2023Updated 2 years ago
- Solve puzzles. Learn CUDA.☆62Dec 13, 2023Updated 2 years ago
- ☆3,786Mar 11, 2026Updated 3 months ago
- GPU Kernels☆226Apr 27, 2025Updated last year
- Fast low-bit matmul kernels in Triton☆475May 15, 2026Updated last month
- ☆15Feb 13, 2018Updated 8 years ago
- Implementation from scratch in CUDA C++ of image processing algorithms.☆23Oct 26, 2020Updated 5 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Training materials associated with NVIDIA's CUDA Training Series (www.olcf.ornl.gov/cuda-training-series/)☆1,010Aug 19, 2024Updated last year
- Learn CUDA with PyTorch☆339Jun 1, 2026Updated 3 weeks ago
- ☆98May 30, 2026Updated last month
- Standalone commandline CLI tool for compiling Triton kernels☆20Sep 13, 2024Updated last year
- ☆14Apr 10, 2023Updated 3 years ago
- A plugin for Jupyter Notebook to run CUDA C/C++ code☆260Sep 13, 2024Updated last year
- UNet diffusion model in pure CUDA☆660Jun 28, 2024Updated 2 years ago
- Step by step implementation of a fast softmax kernel in CUDA☆68Jan 6, 2025Updated last year
- Material for gpu-mode lectures☆6,262Jun 15, 2026Updated 2 weeks ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Multi-heap-sort for many small arrays, quicksort with 3 pivots for one big array, CUDA acceleration, CUDA memory compression.☆13Sep 29, 2024Updated last year
- A 120-day CUDA learning plan covering daily concepts, exercises, pitfalls, and references (including “Programming Massively Parallel Proc…☆931Mar 29, 2025Updated last year
- ☆496Dec 18, 2025Updated 6 months ago
- High Quality Resources on GPU Programming/Architecture☆593Jul 26, 2024Updated last year
- A curated collection of resources, tutorials, and best practices for learning and mastering NVIDIA CUTLASS☆261May 6, 2025Updated last year
- Fast CUDA matrix multiplication from scratch☆1,222Sep 2, 2025Updated 9 months ago
- CUDA Library Samples☆2,446Jun 10, 2026Updated 3 weeks ago
- CUDA Templates and Python DSLs for High-Performance Linear Algebra☆9,967Updated this week
- CUDA Core Compute Libraries☆2,395Jun 24, 2026Updated last week
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆92Feb 29, 2024Updated 2 years ago
- Step-by-step optimization of CUDA SGEMM☆477Mar 30, 2022Updated 4 years ago
- Samples for CUDA Developers which demonstrates features in CUDA Toolkit☆9,340May 27, 2026Updated last month
- Apply GPU in ML and DL☆68Mar 23, 2026Updated 3 months ago
- Implementations of 2D Image Convolution algorithm with CUDA (using global memory, shared memory and constant memory)☆17Jan 21, 2018Updated 8 years ago
- Learnings and programs related to CUDA☆438Jun 29, 2025Updated last year
- LLM training in simple, raw C/CUDA☆30,362Jun 26, 2025Updated last year