hao-lh / awesome-cuda-programming
☆18Updated 8 years ago
Alternatives and similar repositories for awesome-cuda-programming:
Users that are interested in awesome-cuda-programming are comparing it to the libraries listed below
- Personal collection of references for high performance mixed precision training.☆41Updated 5 years ago
- CUDA C++ syntax support & snippets for VSCode☆20Updated 3 years ago
- A curated list of awesome GPGPU (CUDA/OpenCL/Vulkan) resources☆85Updated last year
- ☆26Updated 2 years ago
- Colab notebooks for d2l-book☆11Updated 5 years ago
- ☆56Updated 6 years ago
- https://beta.mxnet.io/☆13Updated 5 years ago
- Heterogeneous Run Time version of TensorFlow. Added heterogeneous capabilities to the TensorFlow, uses heterogeneous computing infrastruc…☆36Updated 6 years ago
- A full-fledged yet minimalistic CUDA-based convolutional neural network library from scratch in C++☆15Updated 5 years ago
- ☆9Updated 3 months ago
- Example codes appears in lectures☆23Updated 3 years ago
- Hands-On GPU Programming with Python and CUDA, Second Edition, published by Packt☆36Updated 3 years ago
- A minimalistic header only C++11 Neural Network library based on Eigen::Tensor☆20Updated 7 years ago
- Some CUDA design patterns and a bit of template magic for CUDA☆148Updated last year
- Tensorflow Faster RCNN☆7Updated 7 years ago
- Deep neural network framework for multiple GPUs☆33Updated 9 years ago
- A collection of awesome algorithms, implemented in CUDA.☆24Updated 6 years ago
- Test winograd convolution written in TVM for CUDA and AMDGPU☆40Updated 6 years ago
- A lightweight deep learning framework made with ❤️☆32Updated 5 years ago
- Learning and practice of high performance computing (CUDA, Vulkan, OpenCL, OpenMP, TBB, SSE/AVX, NEON, MPI, coroutines, etc. )☆58Updated last month
- ☆30Updated 4 years ago
- This library empowers users to seamlessly port pretrained models and checkpoints on the HuggingFace (HF) hub (developed using HF transfor…☆57Updated this week
- Codebase associated with the PyTorch compiler tutorial☆44Updated 5 years ago
- Contains sources related to the lectures and labs for the NVIDIA OpenACC course.☆51Updated 5 years ago
- CS344 - Introduction To Parallel Programming course (Udacity) proposed solutions☆52Updated 7 years ago
- Move to https://github.com/apache/incubator-tvm-site☆27Updated 4 years ago
- Example code used in the CVPR 2015 tutorial☆39Updated 9 years ago
- ☆22Updated 9 years ago
- A disciplined approach to neural network parameters - Reviewing the approach for setting Hyper parameters by Leslie Smith☆11Updated 6 years ago