hao-lh / awesome-cuda-programming
☆18Updated 8 years ago
Alternatives and similar repositories for awesome-cuda-programming:
Users that are interested in awesome-cuda-programming are comparing it to the libraries listed below
- Personal collection of references for high performance mixed precision training.☆41Updated 5 years ago
- ☆56Updated 6 years ago
- Example codes appears in lectures☆23Updated 3 years ago
- CUDA by practice☆125Updated 5 years ago
- CS294-162; Machine Learning Systems Seminar☆31Updated last year
- ☆43Updated 7 years ago
- The "CUDA templates" are a collection of C++ template classes and functions which provide a consistent interface to NVIDIA's "Compute Uni…☆27Updated 13 years ago
- Resources for recent AI systems (deployment concerns, cost and accessibility). -- closed☆12Updated 3 years ago
- Colab notebooks for d2l-book☆11Updated 5 years ago
- Tensorflow Faster RCNN☆7Updated 7 years ago
- Example code to create and train a Pytorch model using the new C++ frontend.☆17Updated 6 years ago
- CUDA C++ syntax support & snippets for VSCode☆20Updated 4 years ago
- Algorithms implemented in CUDA + resources about GPGPU☆55Updated 3 years ago
- ☆20Updated 8 years ago
- Test winograd convolution written in TVM for CUDA and AMDGPU☆41Updated 6 years ago
- ☆26Updated 2 years ago
- Some CUDA design patterns and a bit of template magic for CUDA☆150Updated last year
- carefree-ml implemented Machine Learning algorithms with numpy, mainly for educational use☆34Updated 9 months ago
- NVIDIA Fleet Command is a hybrid-cloud platform for securely and remotely deploying, managing, and scaling AI across dozens or up to thou…☆12Updated 2 years ago
- Resources to work offline on the assignments of Heterogenous Parallel Programming course from Coursera.☆71Updated 5 years ago
- Hands-On GPU Programming with Python and CUDA, Second Edition, published by Packt☆36Updated 4 years ago
- A collection of awesome algorithms, implemented in CUDA.☆25Updated 7 years ago
- Samples demonstrating how to use the Compute Sanitizer Tools and Public API☆77Updated last year
- A full-fledged yet minimalistic CUDA-based convolutional neural network library from scratch in C++☆15Updated 5 years ago
- Programming accelerated applications with CUDA C/C++, enough to be able to begin work accelerating your own CPU-only applications for per…☆92Updated 6 years ago
- Latte is a convolutional neural network (CNN) inference engine written in C++ and uses AVX to vectorize operations. The engine runs on Wi…☆13Updated 6 years ago
- DLPack for Tensorflow☆35Updated 4 years ago
- https://beta.mxnet.io/☆13Updated 5 years ago
- A curated list of awesome GPGPU (CUDA/OpenCL/Vulkan) resources☆89Updated 2 years ago
- ☆57Updated 4 years ago