Test winograd convolution written in TVM for CUDA and AMDGPU
☆41Oct 12, 2018Updated 7 years ago
Alternatives and similar repositories for tvm-winograd
Users that are interested in tvm-winograd are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- compile yolov3 in TVM☆13Aug 14, 2023Updated 2 years ago
- GPU implementation of Winograd convolution☆10Oct 23, 2017Updated 8 years ago
- Benchmark of TVM quantized model on CUDA☆112Jun 19, 2020Updated 5 years ago
- a model zoo☆11Jul 19, 2017Updated 8 years ago
- An Example of MXNet Models Comilation and Deployment with NNVM in C++☆16Apr 25, 2018Updated 8 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Train Neuronal networks to automate your home☆19Mar 1, 2023Updated 3 years ago
- TVM learning and research☆13Jan 8, 2021Updated 5 years ago
- PyTorch -> ONNX -> TVM for autotuning☆24Feb 28, 2020Updated 6 years ago
- ☆10Sep 2, 2023Updated 2 years ago
- Demos interesting image-in, image-out networks running on both NVIDIA and AMD GPUs, with NNVM☆49Nov 21, 2017Updated 8 years ago
- Fast CUDA Kernels for ResNet Inference.☆183May 26, 2019Updated 6 years ago
- study of Ampere' Sparse Matmul☆18Jan 10, 2021Updated 5 years ago
- Diagonalwise Refactorization: An Efficient Training Method for Depthwise Convolutions (in Caffe)☆34Dec 29, 2017Updated 8 years ago
- ☆41Mar 31, 2022Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆13Apr 10, 2017Updated 9 years ago
- Implementation of Adversarial Variational Optimization in PyTorch☆42Aug 7, 2018Updated 7 years ago
- Static analysis framework for analyzing programs written in TVM's Relay IR.☆29Oct 31, 2019Updated 6 years ago
- flexible-gemm conv of deepcore☆17Dec 2, 2019Updated 6 years ago
- auto-tuning momentum SGD optimizer☆23Jul 14, 2017Updated 8 years ago
- A prototype implementation of AllReduce collective communication routine.☆19Sep 27, 2018Updated 7 years ago
- MXNet Model Serving☆25Oct 4, 2017Updated 8 years ago
- a mxnet multi-task tutorial☆33May 16, 2016Updated 9 years ago
- Just-in-time Dynamic Batching with MXNet Gluon.☆52May 18, 2020Updated 5 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- demo about the usage of tvm.☆12Jan 31, 2019Updated 7 years ago
- repo for tvm☆26Updated this week
- ☆24Mar 22, 2018Updated 8 years ago
- An exploration of log domain "alternative floating point" for hardware ML/AI accelerators.☆400Mar 11, 2023Updated 3 years ago
- ☆68Mar 4, 2023Updated 3 years ago
- ☆16Nov 21, 2017Updated 8 years ago
- Add-on package for ONNX format support in Chainer☆86Nov 6, 2019Updated 6 years ago
- Densely Connected Convolutional Network implementation by Chainer☆39Jul 15, 2017Updated 8 years ago
- ☆13Nov 25, 2019Updated 6 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- A MXNet tiny face detector☆95Sep 7, 2018Updated 7 years ago
- mxnet deploy version of pseudo-3d-residual-networks(P-3D), sport1m and Kinetics pretrained model is supported☆13Jul 27, 2018Updated 7 years ago
- An experimental ahead of time compiler for Relay.☆49Apr 21, 2020Updated 6 years ago
- Implement asm gemm on vega64 for 4096x4096 fp32 matrix☆22Oct 12, 2019Updated 6 years ago
- Caffe Computation Graph Optimization.☆29Jan 7, 2020Updated 6 years ago
- Kaggle Avito Demand Challenge (top 1% solution)☆17Jul 31, 2018Updated 7 years ago
- ☆27Apr 18, 2019Updated 7 years ago