hao-lh / awesome-cuda-programming
☆18Updated 7 years ago
Related projects ⓘ
Alternatives and complementary repositories for awesome-cuda-programming
- Some CUDA design patterns and a bit of template magic for CUDA☆146Updated last year
- Hands-On GPU Programming with Python and CUDA, Second Edition, published by Packt☆33Updated 3 years ago
- A full-fledged yet minimalistic CUDA-based convolutional neural network library from scratch in C++☆15Updated 5 years ago
- CUDA by practice☆116Updated 4 years ago
- A minimalistic header only C++11 Neural Network library based on Eigen::Tensor☆20Updated 6 years ago
- Parallel network flows using OpenMP and CUDA.☆27Updated 6 years ago
- A collection of awesome algorithms, implemented in CUDA.☆24Updated 6 years ago
- Example of how to use CUDA with CMake >= 3.8☆69Updated last year
- Example code to create and train a Pytorch model using the new C++ frontend.☆17Updated 5 years ago
- ResNet Implementation, Training, and Inference Using LibTorch C++ API☆35Updated 5 months ago
- numerical optimizaiton methods with msnhnet☆12Updated 3 years ago
- Introduction to CUDA programming☆113Updated 7 years ago
- My curated list of C++ (GPU) BLAS libraries and machine learning/reinforcement learning frameworks☆23Updated 4 years ago
- ☆19Updated 8 years ago
- some CUDA programming example☆25Updated 7 years ago
- Algorithms implemented in CUDA + resources about GPGPU☆54Updated 2 years ago
- ☆42Updated 6 years ago
- Learning and practice of high performance computing (CUDA, Vulkan, OpenCL, OpenMP, TBB, SSE/AVX, NEON, MPI, coroutines, etc. )☆57Updated last week
- Deep Learning Compression and Acceleration SDK -- deep model compression for Edge and IoT embedded systems, and deep model acceleration f…☆20Updated 6 years ago
- Collective Knowledge repository for NVIDIA's TensorRT☆37Updated 3 years ago
- Programming accelerated applications with CUDA C/C++, enough to be able to begin work accelerating your own CPU-only applications for per…☆92Updated 6 years ago
- An expression template based linear algebra library running completely on the GPU using CUDA☆22Updated 3 years ago
- Tensorflow Faster RCNN☆7Updated 7 years ago
- Example repository for custom C++/CUDA operators for TorchScript☆114Updated 2 years ago
- Samples demonstrating how to use the Compute Sanitizer Tools and Public API☆68Updated last year
- THIS REPOSITORY HAS MOVED TO github.com/nvidia/cub, WHICH IS AUTOMATICALLY MIRRORED HERE.☆83Updated 9 months ago
- Implementing CNN for Digit Recognition (MNIST and SVHN dataset) using PyTorch C++ API☆24Updated 2 years ago
- A curated list of awesome GPGPU (CUDA/OpenCL/Vulkan) resources☆81Updated last year