eliben / cs344Links
Introduction to Parallel Programming class code
☆30Updated 10 years ago
Alternatives and similar repositories for cs344
Users that are interested in cs344 are comparing it to the libraries listed below
Sorting:
- Symbolic differentiation engine for optimization-based machine learning models.☆43Updated 8 years ago
- C++ 11 implementation of Geoff Hinton's Deep Learning matlab code☆286Updated 10 years ago
- Python wrappers for the NVIDIA cuDNN libraries☆142Updated 8 years ago
- A collection of resources on modern C++☆47Updated 2 years ago
- Papers and blogs related to distributed deep learning☆96Updated 7 years ago
- My fork of Alex Krizhevsky's cuda-convnet from 2013 where I added dropout, among other features.☆259Updated 10 years ago
- Benchmarking matrix multiplication implementations☆102Updated 9 years ago
- Greentea LibDNN - a universal convolution implementation supporting CUDA and OpenCL☆137Updated 8 years ago
- A CUDNN minimal deep learning training code sample using LeNet.☆269Updated 2 years ago
- A lightweight and user friendly C++ library for deep and convolutional neural network with GPU acceleration☆347Updated 9 years ago
- neon tutorials☆93Updated 2 years ago
- Parallel Algorithm Scheduling Library☆107Updated 8 years ago
- Convolutional neural networks C++ framework with CPU and GPU (CUDA) backends☆182Updated 6 years ago
- A simple memory manager for CUDA designed to help Deep Learning frameworks manage memory☆298Updated 6 years ago
- Randomized Decision Trees: A Fast C++ Implementation of Random Forests.☆179Updated 5 years ago
- My solutions to Udacity's Parallel Programming course (CS 344)☆75Updated 8 years ago
- Fork of magma to include more BLAS☆28Updated 8 years ago
- This project is a simple deep neural network trained using only TensorFlow C++.☆117Updated 2 years ago
- Boda: A C++ Framework for Efficient Experiments in Computer Vision☆64Updated 6 years ago
- A machine vision library written in SYCL and C++ that shows performance-portable implementation of graph algorithms☆162Updated last year
- Proof-of-Concept CNN in Halide☆22Updated 9 years ago
- tutorial to optimize GEMM performance on android☆51Updated 9 years ago
- A Light-weight and Fast Template Matrix Library☆134Updated 12 years ago
- Benchmarking State-of-the-Art Deep Learning Software Tools☆171Updated 8 years ago
- CMake Examples (CMake, CMake+CUDA, CMake+CUDA+PandaRoot)☆42Updated 12 years ago
- Resources to work offline on the assignments of Heterogenous Parallel Programming course from Coursera.☆72Updated 6 years ago
- CMSC 12300 - Computer Science with Applications 3☆75Updated 8 years ago
- The "CUDA templates" are a collection of C++ template classes and functions which provide a consistent interface to NVIDIA's "Compute Uni…☆27Updated 14 years ago
- mojo cnn: c++ convolutional neural network☆198Updated 7 years ago
- A portable high-level API with CUDA or OpenCL back-end☆55Updated 8 years ago