hao-lh / awesome-cuda-programming
☆18Updated 7 years ago
Related projects: ⓘ
- ☆56Updated 6 years ago
- Personal collection of references for high performance mixed precision training.☆41Updated 4 years ago
- Some CUDA design patterns and a bit of template magic for CUDA☆144Updated last year
- ☆52Updated this week
- Example code to create and train a Pytorch model using the new C++ frontend.☆17Updated 5 years ago
- CUDA by practice☆110Updated 4 years ago
- Example codes appears in lectures☆23Updated 2 years ago
- numerical optimizaiton methods with msnhnet☆12Updated 3 years ago
- ☆56Updated this week
- CS344 - Introduction To Parallel Programming course (Udacity) proposed solutions☆49Updated 7 years ago
- Colab notebooks for d2l-book☆11Updated 4 years ago
- Deep Learning Compression and Acceleration SDK -- deep model compression for Edge and IoT embedded systems, and deep model acceleration f…☆20Updated 6 years ago
- Hands-On GPU Programming with Python and CUDA, Second Edition, published by Packt☆33Updated 3 years ago
- Implementing CNN for Digit Recognition (MNIST and SVHN dataset) using PyTorch C++ API☆24Updated 2 years ago
- A full-fledged yet minimalistic CUDA-based convolutional neural network library from scratch in C++☆15Updated 5 years ago
- A very naive and simple benchmark between dlib and pytorch in terms of space and time☆19Updated 4 years ago
- Move to https://github.com/apache/incubator-tvm-site☆27Updated 3 years ago
- ☆42Updated 6 years ago
- ☆35Updated this week
- kmeans clustering with multi-GPU capabilities☆114Updated last year
- PyTorch 1.0 inference in C++ on Windows10 platforms☆89Updated 5 years ago
- some CUDA programming example☆25Updated 7 years ago
- This library empowers users to seamlessly port pretrained models and checkpoints on the HuggingFace (HF) hub (developed using HF transfor…☆37Updated this week
- ☆26Updated last year
- A minimalistic header only C++11 Neural Network library based on Eigen::Tensor☆20Updated 6 years ago
- CUDA C++ syntax support & snippets for VSCode☆19Updated 3 years ago
- Matrix Algebra on GPU and Multicore Architectures (MAGMA) source releases from http://icl.cs.utk.edu/magma/index.html☆20Updated 9 years ago
- This example builds on the parallel-forall repo separate compilation example by adding CMake to it.☆17Updated 6 years ago
- Slides and code for my talk at MeetingC++ 2017☆48Updated 6 years ago
- PyTorch super resolution model with RGB support and ONNX exporter☆31Updated 5 years ago