hao-lh / awesome-cuda-programmingLinks
☆18Updated 8 years ago
Alternatives and similar repositories for awesome-cuda-programming
Users that are interested in awesome-cuda-programming are comparing it to the libraries listed below
Sorting:
- Example codes appears in lectures☆23Updated 3 years ago
- Personal collection of references for high performance mixed precision training.☆41Updated 5 years ago
- Resources for recent AI systems (deployment concerns, cost and accessibility). -- closed☆12Updated 4 years ago
- Hands-On GPU Programming with Python and CUDA, Second Edition, published by Packt☆36Updated 4 years ago
- Some CUDA design patterns and a bit of template magic for CUDA☆154Updated 2 years ago
- ☆26Updated 2 years ago
- Build TVM docker image for production compilation deployments☆12Updated 3 years ago
- https://beta.mxnet.io/☆13Updated 5 years ago
- A curated list of awesome stuff about HPC☆25Updated 8 years ago
- Test winograd convolution written in TVM for CUDA and AMDGPU☆41Updated 6 years ago
- Runtime Tracing Library for TensorFlow☆43Updated 6 years ago
- CUDA by practice☆128Updated 5 years ago
- Tensorflow Faster RCNN☆7Updated 8 years ago
- A minimalistic header only C++11 Neural Network library based on Eigen::Tensor☆20Updated 7 years ago
- Introduction to CUDA programming☆122Updated 8 years ago
- CS-E4850 Computer Vision Course - Python assignments☆9Updated 6 years ago
- CS294-162; Machine Learning Systems Seminar☆31Updated 2 years ago
- Colab notebooks for d2l-book☆11Updated 5 years ago
- Algorithms implemented in CUDA + resources about GPGPU☆56Updated 3 years ago
- NVIDIA_Hot_Openings☆27Updated 2 years ago
- This repository contains the results and code for the MLPerf™ Training v1.0 benchmark.☆38Updated last year
- ☆55Updated 6 years ago
- Deep Learning Compression and Acceleration SDK -- deep model compression for Edge and IoT embedded systems, and deep model acceleration f…☆20Updated 7 years ago
- Python bindings for NVTX☆66Updated 2 years ago
- A collection of awesome algorithms, implemented in CUDA.☆25Updated 7 years ago
- ☆44Updated 7 years ago
- Extension to connect OpenPAI clusters, submit AI jobs, simulate jobs locally, manage files, and so on.☆14Updated 2 years ago
- The Hybrid Task Graph Scheduler API☆40Updated last month
- Codebase associated with the PyTorch compiler tutorial☆46Updated 5 years ago
- Samples demonstrating how to use the Compute Sanitizer Tools and Public API☆83Updated last year