kashif / cuda-workshopLinks
Code examples for the CUDA workshop
☆36Updated 3 years ago
Alternatives and similar repositories for cuda-workshop
Users that are interested in cuda-workshop are comparing it to the libraries listed below
Sorting:
- This is a cross-platform, CUDA-based C++ library for general-purpose, unconstrained nonlinear optimization on the GPU. It implements the …☆138Updated 5 years ago
- Corrected source for the OpenCL in Action book (work in progress)☆64Updated 12 years ago
- Utilities for CUDA programming☆41Updated 6 years ago
- ☆43Updated 7 years ago
- kmeans clustering with multi-GPU capabilities☆120Updated 2 years ago
- This repository contains easy-to-read Python/CUDA implementations of fundamental GPU computing primitives.☆36Updated 10 years ago
- Introduction to CUDA programming☆129Updated 8 years ago
- Simple example of implementing a new Tensorflow operation and its gradient in C++.☆56Updated 6 years ago
- Greentea LibDNN - a universal convolution implementation supporting CUDA and OpenCL☆137Updated 8 years ago
- This example builds on the parallel-forall repo separate compilation example by adding CMake to it.☆17Updated 8 years ago
- Example code used in the CVPR 2015 tutorial☆42Updated 10 years ago
- Example of how to use CUDA with CMake >= 3.8☆70Updated 6 months ago
- a heterogeneous multiGPU level-3 BLAS library☆46Updated 6 years ago
- PLEASE SEE THE OFFICIAL REPOSITORY. THIS IS NOT MAINTAINED ANYMORE.☆93Updated 5 years ago
- kmeans☆55Updated 9 years ago
- TTC: A high-performance Compiler for Tensor Transpositions☆21Updated 8 years ago
- Resources to work offline on the assignments of Heterogenous Parallel Programming course from Coursera.☆72Updated 6 years ago
- FastHOG library that has been fixed to work with CUDA 5.x on Ubuntu 12.04☆20Updated 11 years ago
- Python Framework for sparse neural networks☆19Updated 8 years ago
- ☆101Updated 6 years ago
- A shallow fork of SuiteSparse adding build files for Visual Studio and support for ACML☆102Updated 10 years ago
- Python Binding to NVRTC☆79Updated last year
- Symbolic differentiation engine for optimization-based machine learning models.☆43Updated 8 years ago
- A CUDA implementation of the PageRank Pipeline Benchmark☆34Updated 8 years ago
- Parallel network flows using OpenMP and CUDA.☆28Updated 7 years ago
- GPU implementation of classical molecular dynamics proxy application.☆31Updated 8 years ago
- Some CUDA design patterns and a bit of template magic for CUDA☆157Updated 2 years ago
- A GPU (CUDA) based Artificial Neural Network library☆110Updated 4 years ago
- CUDA implementation of exclusive prefix sum via Blelloch's algorithm☆29Updated 8 years ago
- Source code from NVIDIA CUDACasts☆48Updated 11 years ago