eliben / cs344Links
Introduction to Parallel Programming class code
☆30Updated 10 years ago
Alternatives and similar repositories for cs344
Users that are interested in cs344 are comparing it to the libraries listed below
Sorting:
- Symbolic differentiation engine for optimization-based machine learning models.☆43Updated 7 years ago
- GPU Automatically Tuned Linear Algebra Software☆28Updated 9 years ago
- Boda: A C++ Framework for Efficient Experiments in Computer Vision☆64Updated 5 years ago
- The "CUDA templates" are a collection of C++ template classes and functions which provide a consistent interface to NVIDIA's "Compute Uni…☆27Updated 13 years ago
- Parallel Algorithm Scheduling Library☆106Updated 7 years ago
- A portable high-level API with CUDA or OpenCL back-end☆54Updated 7 years ago
- Greentea LibDNN - a universal convolution implementation supporting CUDA and OpenCL☆135Updated 8 years ago
- Fork of magma to include more BLAS☆28Updated 8 years ago
- C++ implementation of concurrent Binary Search Trees☆72Updated 9 years ago
- Fast matrix multiplication☆29Updated 3 years ago
- Deep neural network framework (C/C++/CUDA).☆31Updated 9 years ago
- a heterogeneous multiGPU level-3 BLAS library☆45Updated 5 years ago
- Simple example of implementing a new Tensorflow operation and its gradient in C++.☆56Updated 6 years ago
- Scientific library for high-precision computations and research☆49Updated 7 years ago
- A Light-weight and Fast Template Matrix Library☆132Updated 12 years ago
- Windows Visual Studio Solutions for class "Introduction to Parallel Programming"☆19Updated 6 years ago
- Slides and code for my talk at MeetingC++ 2017☆48Updated 7 years ago
- Resources to work offline on the assignments of Heterogenous Parallel Programming course from Coursera.☆72Updated 5 years ago
- neon tutorials☆93Updated 2 years ago
- Convolutional neural networks C++ framework with CPU and GPU (CUDA) backends☆178Updated 6 years ago
- C++ library [machine learning & numerical optimization] - superseeded by libnano☆1Updated 6 years ago
- tutorial to optimize GEMM performance on android☆51Updated 9 years ago
- My solutions to Udacity's Parallel Programming course (CS 344)☆75Updated 8 years ago
- source code for the book "C++ Concurrency in Action"☆28Updated 8 years ago
- (Spring 2017) Assignment 2: GPU Executor☆62Updated 8 years ago
- Generating Families of Practical Fast Matrix Multiplication Algorithms☆12Updated 7 years ago
- Optimized half precision gemm assembly kernels (deprecated due to ROCm)☆47Updated 8 years ago
- FastHOG library that has been fixed to work with CUDA 5.x on Ubuntu 12.04☆20Updated 11 years ago
- Training a Tensorflow graph in C++☆25Updated 8 years ago
- A minimalistic header only C++11 Neural Network library based on Eigen::Tensor☆20Updated 7 years ago