jpowie01 / CUDA-DNN-MNIST
Example C++ CUDA implementation for training Neural Network on MNIST dataset
☆26Updated 6 years ago
Alternatives and similar repositories for CUDA-DNN-MNIST:
Users that are interested in CUDA-DNN-MNIST are comparing it to the libraries listed below
- Tutorials on Quantized Neural Network using Tensorflow Lite☆86Updated 5 years ago
- Personal collection of references for high performance mixed precision training.☆41Updated 5 years ago
- ☆53Updated 7 years ago
- This is a c++ implementation of an LSTM Neural Network parallelized for a GPU using CUDA☆23Updated 7 years ago
- Graph Transforms to Quantize and Retrain Deep Neural Nets in TensorFlow☆168Updated 5 years ago
- Pretrained VGG16 neural net in C language☆49Updated 3 years ago
- image to column☆30Updated 10 years ago
- Parse TFLite models (*.tflite) EASILY with Python. Check the API at https://zhenhuaw.me/tflite/docs/☆98Updated 3 weeks ago
- Implementing CNN code in CUDA and OpenCL to evaluate its performance on NVIDIA GPUs, AMD GPUs, and an FPGA platform.☆54Updated 7 years ago
- Convolutional Neural Network of vgg19 model using Cuda to accelerate☆12Updated 6 years ago
- Training with FP16 weights in PyTorch☆77Updated 5 years ago
- All about acceleration and compression of Deep Neural Networks☆33Updated 5 years ago
- Compress neural network with pruning and quantization using TensorFlow.☆105Updated 6 years ago
- Repository containing pruned models and related information☆37Updated 3 years ago
- cuDNN sample codes provided by Nvidia☆45Updated 6 years ago
- Quantized Neural Networks - networks trained for inference at arbitrary low precision.☆146Updated 7 years ago
- PyProf2: PyTorch Profiling tool☆82Updated 4 years ago
- A lightweight deep learning framework made with ❤️☆32Updated 5 years ago
- Post-training static quantization using ResNet18 architecture☆37Updated 4 years ago
- This is a PyTorch implementation of the Scalpel. Node pruning for five benchmark networks and SIMD-aware weight pruning for LeNet-300-100…☆41Updated 6 years ago
- Course Webpage for CS 217 Hardware Accelerators for Machine Learning, Stanford University☆96Updated last year
- Fast CUDA Kernels for ResNet Inference.☆171Updated 5 years ago
- This repository contains the results and code for the MLPerf™ Inference v0.5 benchmark.☆55Updated last year
- ☆58Updated 2 years ago
- Implementation of convolution layer in different flavors☆68Updated 7 years ago
- CUDA for MNIST training/inference☆37Updated last year
- PyTorch -> ONNX -> TVM for autotuning☆23Updated 4 years ago
- This is a collection of works on neural networks and neural accelerators.☆40Updated 5 years ago
- Modified version of PyTorch able to work with changes to GPGPU-Sim☆48Updated 2 years ago
- This repository represents training examples for the CVPR 2018 paper "SYQ:Learning Symmetric Quantization For Efficient Deep Neural Netwo…☆31Updated 5 years ago