Test winograd convolution written in TVM for CUDA and AMDGPU
☆41Oct 12, 2018Updated 7 years ago
Alternatives and similar repositories for tvm-winograd
Users that are interested in tvm-winograd are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- compile yolov3 in TVM☆13Aug 14, 2023Updated 2 years ago
- GPU implementation of Winograd convolution☆10Oct 23, 2017Updated 8 years ago
- a model zoo☆11Jul 19, 2017Updated 8 years ago
- Implements an infinite sum of poisson-weighted convolutions☆27Aug 22, 2018Updated 7 years ago
- An Example of MXNet Models Comilation and Deployment with NNVM in C++☆16Apr 25, 2018Updated 8 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Train Neuronal networks to automate your home☆20Mar 1, 2023Updated 3 years ago
- Implementation of Neural Arithmetic Logic Units (https://arxiv.org/pdf/1808.00508.pdf)☆31Oct 24, 2018Updated 7 years ago
- ☆10Sep 2, 2023Updated 2 years ago
- Demos interesting image-in, image-out networks running on both NVIDIA and AMD GPUs, with NNVM☆49Nov 21, 2017Updated 8 years ago
- ☆25Dec 12, 2017Updated 8 years ago
- Fast CUDA Kernels for ResNet Inference.☆183May 26, 2019Updated 7 years ago
- study of Ampere' Sparse Matmul☆18Jan 10, 2021Updated 5 years ago
- Diagonalwise Refactorization: An Efficient Training Method for Depthwise Convolutions (in Caffe)☆34Dec 29, 2017Updated 8 years ago
- ☆13Apr 10, 2017Updated 9 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Implementation of Adversarial Variational Optimization in PyTorch☆42Aug 7, 2018Updated 7 years ago
- nnvm&tvm example of cross compilation and deployment in Nvidia Jetson TX2 platform☆11Apr 17, 2018Updated 8 years ago
- Static analysis framework for analyzing programs written in TVM's Relay IR.☆29Oct 31, 2019Updated 6 years ago
- flexible-gemm conv of deepcore☆17Dec 2, 2019Updated 6 years ago
- auto-tuning momentum SGD optimizer☆23Jul 14, 2017Updated 8 years ago
- Caffe implementation of Optimal-Ternary-Weights-Approximation in "Two-Step Quantization for Low-bit Neural Networks" (CVPR2018).☆15Sep 21, 2018Updated 7 years ago
- A prototype implementation of AllReduce collective communication routine.☆19Sep 27, 2018Updated 7 years ago
- MXNet Model Serving☆25Oct 4, 2017Updated 8 years ago
- a mxnet multi-task tutorial☆33May 16, 2016Updated 10 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Just-in-time Dynamic Batching with MXNet Gluon.☆52May 18, 2020Updated 6 years ago
- repo for tvm☆26Updated this week
- ☆24Mar 22, 2018Updated 8 years ago
- ☆68Mar 4, 2023Updated 3 years ago
- An exploration of log domain "alternative floating point" for hardware ML/AI accelerators.☆400Mar 11, 2023Updated 3 years ago
- ☆16Nov 21, 2017Updated 8 years ago
- Add-on package for ONNX format support in Chainer☆86Nov 6, 2019Updated 6 years ago
- ☆13Nov 25, 2019Updated 6 years ago
- (Python3- TensorFlow 1.5) Code for the ACL 2017 paper "Get To The Point: Summarization with Pointer-Generator Networks"☆13Mar 23, 2018Updated 8 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- A MXNet tiny face detector☆95Sep 7, 2018Updated 7 years ago
- mxnet deploy version of pseudo-3d-residual-networks(P-3D), sport1m and Kinetics pretrained model is supported☆13Jul 27, 2018Updated 7 years ago
- examples for tvm schedule API☆101Jun 12, 2023Updated 3 years ago
- A simplify version of mobilenet, with less group and feature maps, trained on Imagenet.☆18Jul 17, 2017Updated 8 years ago
- An experimental ahead of time compiler for Relay.☆49Apr 21, 2020Updated 6 years ago
- Implement asm gemm on vega64 for 4096x4096 fp32 matrix☆22Oct 12, 2019Updated 6 years ago
- Caffe Computation Graph Optimization.☆29Jan 7, 2020Updated 6 years ago