parallel-ml / asplos2018-workshopLinks

ReQuEST 2018 workshop: Real-Time Image Recognition Using Collaborative IoT Devices

☆9

Alternatives and similar repositories for asplos2018-workshop

Users that are interested in asplos2018-workshop are comparing it to the libraries listed below

Sorting:

TalwalkarLab / paleo
An analytical performance modeling tool for deep neural networks.
☆89Updated 4 years ago
hyln9 / GCNGEMM
Optimized half precision gemm assembly kernels (deprecated due to ROCm)
☆47Updated 8 years ago
masahi / tvm-winograd
Test winograd convolution written in TVM for CUDA and AMDGPU
☆41Updated 6 years ago
mlcommons / training_results_v0.5
This repository contains the results and code for the MLPerf™ Training v0.5 benchmark.
☆35Updated 3 months ago
JC1DA / DeepMon
☆37Updated 7 years ago
mlcommons / inference_results_v0.5
This repository contains the results and code for the MLPerf™ Inference v0.5 benchmark.
☆55Updated 2 weeks ago
Emma926 / paradnn
ParaDnn: A systematic performance analysis methodology for deep learning.
☆39Updated 5 years ago
ctuning / ck-request-asplos18-mobilenets-tvm-arm
CK workflow, portable packages and other artifacts for the ReQuEST-ASPLOS'18 submission:
☆11Updated 6 years ago
andersy005 / tvm-in-action
TVM stack: exploring the incredible explosion of deep-learning frameworks and how to bring them together
☆64Updated 7 years ago
IntelLabs / SkimCaffe
Caffe for Sparse Convolutional Neural Network
☆238Updated 2 years ago
ctuning / ck-mlperf
This repository is outdated! Join the open MLPerf workgroup to participate in the development of the next generation of automation workfl…
☆31Updated 2 years ago
tbd-ai / tbd-suite
☆47Updated 2 years ago
dmlc / nnvm-fusion
Kernel Fusion and Runtime Compilation Based on NNVM
☆70Updated 8 years ago
ravi-teja-mullapudi / Halide-NN
CNNs in Halide
☆23Updated 9 years ago
rdadolf / fathom
Reference workloads for modern deep learning methods.
☆73Updated 2 years ago
CSshengxy / MEC
ICML2017 MEC: Memory-efficient Convolution for Deep Neural Network C++实现(非官方)
☆17Updated 6 years ago
kunglab / branchynet
☆131Updated last year
naibaf7 / libdnn
Greentea LibDNN - a universal convolution implementation supporting CUDA and OpenCL
☆136Updated 8 years ago
MatthieuCourbariaux / deep-learning-multipliers
Training deep neural networks with low precision multiplications
☆63Updated 10 years ago
zhuwenxi / pytorch-profiling-tool
☆54Updated 7 years ago
zhiqi-0 / RDMA-MXNet-ps-lite
RDMA Optimization on MXNet
☆14Updated 7 years ago
czhu95 / ternarynet
Implementation for Trained Ternary Network.
☆108Updated 8 years ago
xingyul / sparse-winograd-cnn
Efficient Sparse-Winograd Convolutional Neural Networks (ICLR 2018)
☆191Updated 6 years ago
deep500 / deep500
A Deep Learning Meta-Framework and HPC Benchmarking Library
☆81Updated 3 years ago
XiuYuLi / flexible-gemm
flexible-gemm conv of deepcore
☆17Updated 5 years ago
hcho3 / relayviz
Visualize TVM Relay program graph
☆12Updated 5 years ago
ppwwyyxx / haDNN
Proof-of-Concept CNN in Halide
☆22Updated 9 years ago
ctuning / ck-tensorrt
Collective Knowledge repository for NVIDIA's TensorRT
☆37Updated 4 years ago
mlcommons / training_results_v0.6
This repository contains the results and code for the MLPerf™ Training v0.6 benchmark.
☆42Updated 2 years ago
csehydrogen / Winograd-OpenCL
Winograd-based convolution implementation in OpenCL
☆28Updated 8 years ago