DLPack for Tensorflow
☆35Apr 13, 2020Updated 5 years ago
Alternatives and similar repositories for tf-dlpack
Users that are interested in tf-dlpack are comparing it to the libraries listed below
Sorting:
- TensorFlow and TVM integration☆36Apr 27, 2020Updated 5 years ago
- Just-in-time Dynamic Batching with MXNet Gluon.☆52May 18, 2020Updated 5 years ago
- An implementation of the ICCP '16 paper "Blind Dehazing Using Internal Patch Recurrence".☆12Aug 14, 2018Updated 7 years ago
- An experimental ahead of time compiler for Relay.☆49Apr 21, 2020Updated 5 years ago
- ☆16Nov 21, 2017Updated 8 years ago
- A basic Docker-based installation of TVM☆11Jun 23, 2022Updated 3 years ago
- Visualize TVM Relay program graph☆12Nov 19, 2019Updated 6 years ago
- A home for the final text of all TVM RFCs.☆109Sep 24, 2024Updated last year
- Simple Training and Deployment of Fast End-to-End Binary Networks☆160Feb 1, 2022Updated 4 years ago
- Mille Crepe Bench: layer-wise performance analysis for deep learning frameworks.☆18Oct 22, 2019Updated 6 years ago
- The Tensor Algebra SuperOptimizer for Deep Learning☆739Jan 26, 2023Updated 3 years ago
- ☆42Sep 8, 2023Updated 2 years ago
- tutorial to optimize GEMM performance on android☆51Feb 17, 2016Updated 10 years ago
- ☆22Mar 27, 2022Updated 3 years ago
- Integration of Tiramisu (Compiler) into PyTorch☆25May 27, 2020Updated 5 years ago
- ☆24Feb 20, 2024Updated 2 years ago
- A flexible and efficient deep neural network (DNN) compiler that generates high-performance executable from a DNN model description.☆1,006Sep 19, 2024Updated last year
- a Deep Residual Network Example for MXNet on cifar10 dataset☆20Jan 27, 2016Updated 10 years ago
- this repo attemps to reproduce Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial Networks(CycleGAN) use gluon reimpl…☆32Aug 19, 2018Updated 7 years ago
- An IR for efficiently simulating distributed ML computation.☆32Jan 13, 2024Updated 2 years ago
- auto-tuning momentum SGD optimizer☆23Jul 14, 2017Updated 8 years ago
- TVMScript kernel for deformable attention☆25Dec 15, 2021Updated 4 years ago
- PyTorch-Based Fast and Efficient Processing for Various Machine Learning Applications with Diverse Sparsity☆120Dec 22, 2025Updated 2 months ago
- TVMFuzz: fuzzing tensor-level intermediate representation in TVM☆30May 24, 2020Updated 5 years ago
- Languages, Tools, and Techniques for Accelerator Design☆33Nov 2, 2021Updated 4 years ago
- Instructions, Docker images, and examples for Nsight Compute and Nsight Systems☆137May 19, 2020Updated 5 years ago
- ☆68Mar 4, 2023Updated 2 years ago
- NCCL Fast Socket is a transport layer plugin to improve NCCL collective communication performance on Google Cloud.☆122Nov 15, 2023Updated 2 years ago
- Kernel Fusion and Runtime Compilation Based on NNVM☆73Nov 21, 2016Updated 9 years ago
- Re-implementation of the TASO compiler using equality saturation☆138Jun 28, 2021Updated 4 years ago
- TensorFlow Serving benchmark☆33Feb 12, 2018Updated 8 years ago
- ☆84Feb 5, 2026Updated 3 weeks ago
- Dynamic Tensor Rematerialization prototype (modified PyTorch) and simulator. Paper: https://arxiv.org/abs/2006.09616☆133Jul 6, 2023Updated 2 years ago
- common in-memory tensor structure☆1,171Jan 26, 2026Updated last month
- An implementation of the NAACL'18 paper "Punny Captions: Witty Wordplay in Image Descriptions".☆33Jun 27, 2018Updated 7 years ago
- A schedule language for large model training☆152Aug 21, 2025Updated 6 months ago
- Documentation for StreamExecutor open source proposal☆83Mar 28, 2016Updated 9 years ago
- Race Condition Running☆11Feb 22, 2026Updated last week
- C++ Hough Forests with OpenCV☆11Jul 28, 2016Updated 9 years ago