kashif / cuda-workshop
Code examples for the CUDA workshop
☆36Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for cuda-workshop
- Python Framework for sparse neural networks☆19Updated 7 years ago
- Example code used in the CVPR 2015 tutorial☆39Updated 9 years ago
- This example builds on the parallel-forall repo separate compilation example by adding CMake to it.☆17Updated 7 years ago
- The "CUDA templates" are a collection of C++ template classes and functions which provide a consistent interface to NVIDIA's "Compute Uni…☆27Updated 13 years ago
- Massively Scalable Clustering☆23Updated 5 years ago
- Introduction to CUDA programming☆113Updated 7 years ago
- Utilities for CUDA programming☆39Updated 5 years ago
- FluidNet re-written with ATen tensor lib☆51Updated 5 years ago
- PLEASE SEE THE OFFICIAL REPOSITORY. THIS IS NOT MAINTAINED ANYMORE.☆93Updated 4 years ago
- Progressive Multigrid Eigensolver for Multiscale Angular Embedding Problems☆18Updated 8 years ago
- GPU implementation of classical molecular dynamics proxy application.☆30Updated 7 years ago
- Generating Families of Practical Fast Matrix Multiplication Algorithms☆12Updated 7 years ago
- Contains sources related to the lectures and labs for the NVIDIA OpenACC course.☆52Updated 5 years ago
- Simple example of implementing a new Tensorflow operation and its gradient in C++.☆56Updated 5 years ago
- Corrected source for the OpenCL in Action book (work in progress)☆61Updated 11 years ago
- Code for "Message Scheduling for Performant, Many-Core Belief Propagation"☆10Updated 5 years ago
- Cooperative Primitives for CUDA C++ Kernel Authors. This repository contains CUB PRs from Q4 2019 until Q4 2020.☆22Updated 4 years ago
- Optimized half precision gemm assembly kernels (deprecated due to ROCm)☆47Updated 7 years ago
- Test winograd convolution written in TVM for CUDA and AMDGPU☆40Updated 6 years ago
- A fork of Eigen 3.2 to use MAGMA (GPU & CPU) as backend in the same way it does with Intel MKL.☆48Updated 10 years ago
- Python Binding to NVRTC☆79Updated last month
- FastHOG library that has been fixed to work with CUDA 5.x on Ubuntu 12.04☆19Updated 10 years ago
- ☆42Updated 6 years ago
- Introduction to OpenACC☆27Updated 3 years ago
- CNNs in Halide☆23Updated 9 years ago
- This repository contains easy-to-read Python/CUDA implementations of fundamental GPU computing primitives.☆36Updated 9 years ago
- Some CUDA design patterns and a bit of template magic for CUDA☆146Updated last year
- ☆102Updated 5 years ago