kashif / cuda-workshopLinks
Code examples for the CUDA workshop
☆36Updated 3 years ago
Alternatives and similar repositories for cuda-workshop
Users that are interested in cuda-workshop are comparing it to the libraries listed below
Sorting:
- This is a cross-platform, CUDA-based C++ library for general-purpose, unconstrained nonlinear optimization on the GPU. It implements the …☆138Updated 5 years ago
- Generating Families of Practical Fast Matrix Multiplication Algorithms☆12Updated 8 years ago
- This example builds on the parallel-forall repo separate compilation example by adding CMake to it.☆17Updated 7 years ago
- ☆44Updated 7 years ago
- This repository contains easy-to-read Python/CUDA implementations of fundamental GPU computing primitives.☆36Updated 10 years ago
- A CUDA implementation of the PageRank Pipeline Benchmark☆33Updated 8 years ago
- PLEASE SEE THE OFFICIAL REPOSITORY. THIS IS NOT MAINTAINED ANYMORE.☆93Updated 5 years ago
- GPU implementation of classical molecular dynamics proxy application.☆31Updated 8 years ago
- A shallow fork of SuiteSparse adding build files for Visual Studio and support for ACML☆102Updated 9 years ago
- Some CUDA design patterns and a bit of template magic for CUDA☆156Updated 2 years ago
- a heterogeneous multiGPU level-3 BLAS library☆46Updated 5 years ago
- CNNs in Halide☆23Updated 9 years ago
- Corrected source for the OpenCL in Action book (work in progress)☆64Updated 12 years ago
- Fork of magma to include more BLAS☆28Updated 8 years ago
- Simple example of implementing a new Tensorflow operation and its gradient in C++.☆56Updated 6 years ago
- Utilities for CUDA programming☆41Updated 6 years ago
- FastHOG library that has been fixed to work with CUDA 5.x on Ubuntu 12.04☆20Updated 11 years ago
- CUDA implementation of exclusive prefix sum via Blelloch's algorithm☆29Updated 8 years ago
- Python Binding to NVRTC☆79Updated last year
- A fork of Eigen 3.2 to use MAGMA (GPU & CPU) as backend in the same way it does with Intel MKL.☆48Updated 11 years ago
- Python Framework for sparse neural networks☆19Updated 8 years ago
- kmeans clustering with multi-GPU capabilities☆119Updated 2 years ago
- Multi-core CPU implementation of deep learning for 2D and 3D sliding window convolutional networks (ConvNets).☆94Updated 8 years ago
- Progressive Multigrid Eigensolver for Multiscale Angular Embedding Problems☆18Updated 8 years ago
- FluidNet re-written with ATen tensor lib☆52Updated 6 years ago
- Some C++ codes for computing a 1D and 2D convolution product using the FFT implemented with the GSL or FFTW☆59Updated 12 years ago
- Jonker-Volgenant / LAPJV algorithm for the linear assignment problem, in Python☆51Updated last year
- Resources to work offline on the assignments of Heterogenous Parallel Programming course from Coursera.☆72Updated 6 years ago
- A portable high-level API with CUDA or OpenCL back-end☆55Updated 8 years ago
- Deep neural network framework for multiple GPUs☆33Updated 10 years ago