Test winograd convolution written in TVM for CUDA and AMDGPU
☆41Oct 12, 2018Updated 7 years ago
Alternatives and similar repositories for tvm-winograd
Users that are interested in tvm-winograd are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- GPU implementation of Winograd convolution☆10Oct 23, 2017Updated 8 years ago
- Benchmark of TVM quantized model on CUDA☆112Jun 19, 2020Updated 5 years ago
- a model zoo☆11Jul 19, 2017Updated 8 years ago
- Implements an infinite sum of poisson-weighted convolutions☆27Aug 22, 2018Updated 7 years ago
- An Example of MXNet Models Comilation and Deployment with NNVM in C++☆16Apr 25, 2018Updated 7 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Train Neuronal networks to automate your home☆19Mar 1, 2023Updated 3 years ago
- Implementation of Neural Arithmetic Logic Units (https://arxiv.org/pdf/1808.00508.pdf)☆31Oct 24, 2018Updated 7 years ago
- TVM learning and research☆13Jan 8, 2021Updated 5 years ago
- PyTorch -> ONNX -> TVM for autotuning☆24Feb 28, 2020Updated 6 years ago
- ☆10Sep 2, 2023Updated 2 years ago
- Demos interesting image-in, image-out networks running on both NVIDIA and AMD GPUs, with NNVM☆49Nov 21, 2017Updated 8 years ago
- ☆25Dec 12, 2017Updated 8 years ago
- Fast CUDA Kernels for ResNet Inference.☆183May 26, 2019Updated 6 years ago
- study of Ampere' Sparse Matmul☆18Jan 10, 2021Updated 5 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Diagonalwise Refactorization: An Efficient Training Method for Depthwise Convolutions (in Caffe)☆34Dec 29, 2017Updated 8 years ago
- ☆13Apr 10, 2017Updated 9 years ago
- nnvm&tvm example of cross compilation and deployment in Nvidia Jetson TX2 platform☆11Apr 17, 2018Updated 8 years ago
- Static analysis framework for analyzing programs written in TVM's Relay IR.☆29Oct 31, 2019Updated 6 years ago
- flexible-gemm conv of deepcore☆17Dec 2, 2019Updated 6 years ago
- auto-tuning momentum SGD optimizer☆23Jul 14, 2017Updated 8 years ago
- Caffe implementation of Optimal-Ternary-Weights-Approximation in "Two-Step Quantization for Low-bit Neural Networks" (CVPR2018).☆15Sep 21, 2018Updated 7 years ago
- A prototype implementation of AllReduce collective communication routine.☆19Sep 27, 2018Updated 7 years ago
- MXNet Model Serving☆25Oct 4, 2017Updated 8 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- a mxnet multi-task tutorial☆33May 16, 2016Updated 9 years ago
- Just-in-time Dynamic Batching with MXNet Gluon.☆52May 18, 2020Updated 5 years ago
- demo about the usage of tvm.☆12Jan 31, 2019Updated 7 years ago
- repo for tvm☆26Updated this week
- ☆24Mar 22, 2018Updated 8 years ago
- ☆68Mar 4, 2023Updated 3 years ago
- ☆16Nov 21, 2017Updated 8 years ago
- An exploration of log domain "alternative floating point" for hardware ML/AI accelerators.☆400Mar 11, 2023Updated 3 years ago
- Add-on package for ONNX format support in Chainer☆86Nov 6, 2019Updated 6 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- List of papers that applied graph network to NLP☆13Feb 26, 2019Updated 7 years ago
- Densely Connected Convolutional Network implementation by Chainer☆39Jul 15, 2017Updated 8 years ago
- ☆13Nov 25, 2019Updated 6 years ago
- (Python3- TensorFlow 1.5) Code for the ACL 2017 paper "Get To The Point: Summarization with Pointer-Generator Networks"☆13Mar 23, 2018Updated 8 years ago
- A MXNet tiny face detector☆95Sep 7, 2018Updated 7 years ago
- mxnet deploy version of pseudo-3d-residual-networks(P-3D), sport1m and Kinetics pretrained model is supported☆13Jul 27, 2018Updated 7 years ago
- examples for tvm schedule API☆101Jun 12, 2023Updated 2 years ago