NVIDIA curated collection of educational resources related to general purpose GPU programming.
☆1,618Apr 30, 2026Updated 3 weeks ago
Alternatives and similar repositories for accelerated-computing-hub
Users that are interested in accelerated-computing-hub are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- RAPIDS Deployment Documentation☆15May 13, 2026Updated last week
- Numbast is a tool to build an automated pipeline that converts CUDA APIs into Numba bindings.☆62May 14, 2026Updated last week
- CUDA Core Compute Libraries☆2,338Updated this week
- CUDA Python: Performance meets Productivity☆3,250Updated this week
- NVIDIA Math Libraries for the Python Ecosystem☆577May 1, 2026Updated 2 weeks ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Some CUDA example code with READMEs.☆180Nov 11, 2025Updated 6 months ago
- No-GIL Python environment featuring NVIDIA Deep Learning libraries.☆71Apr 14, 2025Updated last year
- CUDA Kernel Benchmarking Library☆864Updated this week
- Examples demonstrating available options to program multiple GPUs in a single node or a cluster☆887Sep 26, 2025Updated 7 months ago
- ☆646May 14, 2026Updated last week
- The CUDA target for Numba☆277Updated this week
- CUDA Templates and Python DSLs for High-Performance Linear Algebra☆9,731May 13, 2026Updated last week
- Material for gpu-mode lectures☆6,080May 9, 2026Updated last week
- Fast low-bit matmul kernels in Triton☆454Updated this week
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- GPU programming related news and material links☆2,133Mar 8, 2026Updated 2 months ago
- An efficient C++20 GPU numerical computing library with Python-like syntax☆1,422Updated this week
- ☆20Oct 31, 2025Updated 6 months ago
- NVIDIA NVSHMEM is a parallel programming interface for NVIDIA GPUs based on OpenSHMEM. NVSHMEM can significantly reduce multi-process com…☆532May 5, 2026Updated 2 weeks ago
- A curated collection of resources, tutorials, and best practices for learning and mastering NVIDIA CUTLASS☆256May 6, 2025Updated last year
- LLVM/MLIR based compiler instrumentation of AMD GPU kernels☆20Jul 13, 2025Updated 10 months ago
- Helpful kernel tutorials, examples and SKILLs for tile-based GPU programming☆727Updated this week
- RAPIDS Memory Manager☆696Updated this week
- CUDA Tile IR is an MLIR-based intermediate representation and compiler infrastructure for CUDA kernel optimization, focusing on tile-base…☆958Apr 1, 2026Updated last month
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- KvikIO - High Performance File IO☆265Updated this week
- Pseudo-spectral code for DNS of Homogenous isotropic turbulence. Scalars and particles are also supported.☆11Oct 19, 2023Updated 2 years ago
- Dragon distributed runtime for HPC and AI applications and workflows☆90Mar 31, 2026Updated last month
- ☆66Apr 26, 2025Updated last year
- Offline as of 2026-03-13☆14Mar 13, 2026Updated 2 months ago
- ☆3,617Mar 11, 2026Updated 2 months ago
- A curated list of resources for learning and exploring Triton, OpenAI's programming language for writing efficient GPU code.☆482Mar 10, 2025Updated last year
- Fastest kernels written from scratch☆578Sep 18, 2025Updated 8 months ago
- A Datacenter Scale Distributed Inference Serving Framework☆6,791May 14, 2026Updated last week
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- How to call NVTX from Fortran☆12Jun 25, 2025Updated 10 months ago
- Samples for CUDA Developers which demonstrates features in CUDA Toolkit☆9,175May 13, 2026Updated last week
- LLM training in simple, raw C/CUDA☆113May 1, 2024Updated 2 years ago
- Kokkos C++ Performance Portability Programming Ecosystem: Profiling and Debugging Tools☆145Mar 28, 2026Updated last month
- CUDA Library Samples☆2,399May 12, 2026Updated last week
- A Python framework for GPU-accelerated simulation, robotics, and machine learning.☆6,666Updated this week
- ☆326Updated this week