jpowie01 / CUDA-DNN-MNISTLinks

Example C++ CUDA implementation for training Neural Network on MNIST dataset

☆27

Alternatives and similar repositories for CUDA-DNN-MNIST

Users that are interested in CUDA-DNN-MNIST are comparing it to the libraries listed below

Sorting:

mcarilli / mixed_precision_references
Personal collection of references for high performance mixed precision training.
☆41Updated 5 years ago
Dynmi / AlexNet
implement AlexNet with C / convolutional nerual network / machine learning / computer vision
☆191Updated 3 years ago
suvojit-0x55aa / mixed-precision-pytorch
Training with FP16 weights in PyTorch
☆79Updated 5 years ago
zhenhuaw-me / tflite
Parse TFLite models (*.tflite) EASILY with Python. Check the API at https://zhenhuaw.me/tflite/docs/
☆102Updated 5 months ago
romulus0914 / CNN_VGG19_CUDA
Convolutional Neural Network of vgg19 model using Cuda to accelerate
☆12Updated 7 years ago
ZFTurbo / VGG16-Pretrained-C
Pretrained VGG16 neural net in C language
☆50Updated 3 years ago
OpenHero / im2col
image to column
☆30Updated 11 years ago
lzhengchun / matrix-cuda
matrix multiplication in CUDA
☆123Updated last year
holli / yolov3_pytorch
Yolov3 (+tiny) pythonic pytorch implementation.
☆34Updated 6 years ago
qinyao-he / bit-rnn
Quantize weights and activations in Recurrent Neural Networks.
☆94Updated 6 years ago
masahi / tvm-winograd
Test winograd convolution written in TVM for CUDA and AMDGPU
☆41Updated 6 years ago
mingxingtan / efficientnet
EfficientNets snapshot
☆90Updated 6 years ago
quettabit / convolution_kernel
Accelerating CNN's convolution operation on GPUs by using memory-efficient data access patterns.
☆14Updated 7 years ago
paramhanji / CUDA-CNN
Implementation of a simple CNN using CUDA
☆68Updated 8 years ago
dlsys-course / examples
Example codes appears in lectures
☆23Updated 3 years ago
ucla-labx / distbelief
Implementing Google's DistBelief paper
☆110Updated 2 years ago
Deeplite / deeplite-profiler
A collection of metrics to profile a single deep learning model or compare two different deep learning models
☆26Updated last year
bwasti / pytorch_compiler_tutorial
Codebase associated with the PyTorch compiler tutorial
☆46Updated 5 years ago
joelgrus / autograd
coding an autograd from scratch
☆177Updated 6 years ago
renmengye / np-conv2d
2D Convolution using NumPy
☆17Updated 3 years ago
Xilinx / graffitist
Graph Transforms to Quantize and Retrain Deep Neural Nets in TensorFlow
☆168Updated 5 years ago
jwfromm / Riptide
Simple Training and Deployment of Fast End-to-End Binary Networks
☆157Updated 3 years ago
BertMoons / QuantizedNeuralNetworks-Keras-Tensorflow
Quantized Neural Networks - networks trained for inference at arbitrary low precision.
☆146Updated 7 years ago
nmilosev / pytorch-arm-builds
Unofficial PyTorch and torchvision builds for ARM devices
☆212Updated 3 years ago
pytorch / extension-script
Example repository for custom C++/CUDA operators for TorchScript
☆114Updated 2 years ago
masahi / torchscript-to-tvm
☆69Updated 2 years ago
DeGirum / pruned-models
Repository containing pruned models and related information
☆37Updated 4 years ago
guanh01 / CS692-mlsys
This is the (evolving) reading list for the seminar.
☆59Updated 4 years ago
ujjwal-9 / Knowledge-Distillation
Blog https://medium.com/neuralmachine/knowledge-distillation-dc241d7c2322
☆60Updated 7 years ago
jack-willturner / pytorch-onnx-tvm
PyTorch -> ONNX -> TVM for autotuning
☆24Updated 5 years ago