Test winograd convolution written in TVM for CUDA and AMDGPU
☆41Oct 12, 2018Updated 7 years ago
Alternatives and similar repositories for tvm-winograd
Users that are interested in tvm-winograd are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- GPU implementation of Winograd convolution☆10Oct 23, 2017Updated 8 years ago
- Benchmark of TVM quantized model on CUDA☆112Jun 19, 2020Updated 5 years ago
- a model zoo☆11Jul 19, 2017Updated 8 years ago
- Implements an infinite sum of poisson-weighted convolutions☆27Aug 22, 2018Updated 7 years ago
- An Example of MXNet Models Comilation and Deployment with NNVM in C++☆16Apr 25, 2018Updated 8 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Train Neuronal networks to automate your home☆19Mar 1, 2023Updated 3 years ago
- Implementation of Neural Arithmetic Logic Units (https://arxiv.org/pdf/1808.00508.pdf)☆31Oct 24, 2018Updated 7 years ago
- ☆10Sep 2, 2023Updated 2 years ago
- Demos interesting image-in, image-out networks running on both NVIDIA and AMD GPUs, with NNVM☆49Nov 21, 2017Updated 8 years ago
- ☆25Dec 12, 2017Updated 8 years ago
- Fast CUDA Kernels for ResNet Inference.☆183May 26, 2019Updated 7 years ago
- study of Ampere' Sparse Matmul☆18Jan 10, 2021Updated 5 years ago
- Diagonalwise Refactorization: An Efficient Training Method for Depthwise Convolutions (in Caffe)☆34Dec 29, 2017Updated 8 years ago
- ☆41Mar 31, 2022Updated 4 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- ☆13Apr 10, 2017Updated 9 years ago
- Implementation of Adversarial Variational Optimization in PyTorch☆42Aug 7, 2018Updated 7 years ago
- nnvm&tvm example of cross compilation and deployment in Nvidia Jetson TX2 platform☆11Apr 17, 2018Updated 8 years ago
- Static analysis framework for analyzing programs written in TVM's Relay IR.☆29Oct 31, 2019Updated 6 years ago
- flexible-gemm conv of deepcore☆17Dec 2, 2019Updated 6 years ago
- Caffe implementation of Optimal-Ternary-Weights-Approximation in "Two-Step Quantization for Low-bit Neural Networks" (CVPR2018).☆15Sep 21, 2018Updated 7 years ago
- A prototype implementation of AllReduce collective communication routine.☆19Sep 27, 2018Updated 7 years ago
- a mxnet multi-task tutorial☆33May 16, 2016Updated 10 years ago
- Just-in-time Dynamic Batching with MXNet Gluon.☆52May 18, 2020Updated 6 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- demo about the usage of tvm.☆12Jan 31, 2019Updated 7 years ago
- ☆24Mar 22, 2018Updated 8 years ago
- An exploration of log domain "alternative floating point" for hardware ML/AI accelerators.☆400Mar 11, 2023Updated 3 years ago
- ☆16Nov 21, 2017Updated 8 years ago
- Add-on package for ONNX format support in Chainer☆86Nov 6, 2019Updated 6 years ago
- Densely Connected Convolutional Network implementation by Chainer☆39Jul 15, 2017Updated 8 years ago
- A MXNet tiny face detector☆95Sep 7, 2018Updated 7 years ago
- mxnet deploy version of pseudo-3d-residual-networks(P-3D), sport1m and Kinetics pretrained model is supported☆13Jul 27, 2018Updated 7 years ago
- examples for tvm schedule API☆101Jun 12, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A simplify version of mobilenet, with less group and feature maps, trained on Imagenet.☆18Jul 17, 2017Updated 8 years ago
- A collection of papers on reinforcement learning applied to NLP☆14Sep 7, 2018Updated 7 years ago
- An experimental ahead of time compiler for Relay.☆49Apr 21, 2020Updated 6 years ago
- Implement asm gemm on vega64 for 4096x4096 fp32 matrix☆22Oct 12, 2019Updated 6 years ago
- Caffe Computation Graph Optimization.☆29Jan 7, 2020Updated 6 years ago
- Kaggle Avito Demand Challenge (top 1% solution)☆17Jul 31, 2018Updated 7 years ago
- Using TVM to depoly Transformer on CPU and GPU☆11Aug 25, 2021Updated 4 years ago