The quantitative performance comparison among DL compilers on CNN models.
☆73Aug 27, 2020Updated 5 years ago
Alternatives and similar repositories for dlcompiler-comparison
Users that are interested in dlcompiler-comparison are comparing it to the libraries listed below
Sorting:
- [MLSys 2021] IOS: Inter-Operator Scheduler for CNN Acceleration☆199Apr 27, 2022Updated 3 years ago
- examples for tvm schedule API☆101Jun 12, 2023Updated 2 years ago
- 记录阅读各类paper的想法笔记(关注体系结构,机器学习系统,深度学习,计算机视觉)☆25Oct 25, 2019Updated 6 years ago
- Repository for artifact evaluation of ASPLOS 2023 paper "SparseTIR: Composable Abstractions for Sparse Compilation in Deep Learning"☆25Feb 24, 2023Updated 3 years ago
- SparseTIR: Sparse Tensor Compiler for Deep Learning☆143Mar 31, 2023Updated 2 years ago
- Neural Network Acceleration such as ASIC, FPGA, GPU, and PIM☆54Apr 13, 2020Updated 5 years ago
- A list of awesome compiler projects and papers for tensor computation and deep learning.☆2,733Oct 19, 2024Updated last year
- An MLIR-based compiler framework bridges DSLs (domain-specific languages) to DSAs (domain-specific architectures).☆696Updated this week
- ☆23Dec 8, 2022Updated 3 years ago
- Automated DNN generation for fuzz testing and more☆143Jan 14, 2025Updated last year
- Dive into Deep Learning Compiler☆646Jun 19, 2022Updated 3 years ago
- The malsource dataset☆12Aug 31, 2021Updated 4 years ago
- A flexible and efficient deep neural network (DNN) compiler that generates high-performance executable from a DNN model description.☆1,005Sep 19, 2024Updated last year
- Social Disatancing Monitor using yolov3 and DPU HW acceleration for Xilinx adaptive computing challenge 2020☆12Feb 17, 2023Updated 3 years ago
- Accelerate convolution neural network for face recognition using GPU☆13Nov 24, 2020Updated 5 years ago
- Must read research papers and links to tools and datasets that are related to using machine learning for compilers and systems optimisati…☆1,657Jan 21, 2026Updated last month
- Representation and Reference Lowering of ONNX Models in MLIR Compiler Infrastructure☆977Feb 24, 2026Updated last week
- Accelerating CNN's convolution operation on GPUs by using memory-efficient data access patterns.☆14Dec 8, 2017Updated 8 years ago
- TVM learning and research☆13Jan 8, 2021Updated 5 years ago
- BladeDISC is an end-to-end DynamIc Shape Compiler project for machine learning workloads.☆917Dec 30, 2024Updated last year
- TVMFuzz: fuzzing tensor-level intermediate representation in TVM☆30May 24, 2020Updated 5 years ago
- CNN Accelerator in Frequency Domain☆12Feb 22, 2020Updated 6 years ago
- PET: Optimizing Tensor Programs with Partially Equivalent Transformations and Automated Corrections☆125Jun 23, 2022Updated 3 years ago
- ☆95Nov 4, 2022Updated 3 years ago
- ☆14Mar 10, 2024Updated last year
- A quick tour to *Data types à la carte* for reading group presentation.☆16Feb 7, 2023Updated 3 years ago
- An external memory allocator example for PyTorch.☆16Aug 10, 2025Updated 6 months ago
- [NeurIPS 2021] "Drawing Robust Scratch Tickets: Subnetworks with Inborn Robustness Are Found within Randomly Initialized Networks" by Yon…☆13Feb 13, 2022Updated 4 years ago
- [ICML 2021] "Double-Win Quant: Aggressively Winning Robustness of Quantized DeepNeural Networks via Random Precision Training and Inferen…☆16Feb 13, 2022Updated 4 years ago
- The Tensor Algebra SuperOptimizer for Deep Learning☆739Jan 26, 2023Updated 3 years ago
- Re-implementation of the TASO compiler using equality saturation☆138Jun 28, 2021Updated 4 years ago
- ☆34Jun 7, 2021Updated 4 years ago
- TinyVers Heterogeneous SoC consists of a reconfigurable FlexML accelerator, a RISC-V processor, an eMRAM and a power management system.☆23Jul 12, 2023Updated 2 years ago
- This is a clone of an SVN repository at svn://vcs.exim.org/pcre/code/trunk. It had been cloned by http://svn2github.com/ , but the servic…☆13Jan 4, 2019Updated 7 years ago
- ☆112Apr 19, 2024Updated last year
- The code for our paper "Neural Architecture Search as Program Transformation Exploration"☆16Apr 28, 2021Updated 4 years ago
- Latte is a convolutional neural network (CNN) inference engine written in C++ and uses AVX to vectorize operations. The engine runs on Wi…☆13Jun 25, 2018Updated 7 years ago
- Distributed Algorithms — Online Textbook☆18Jan 2, 2021Updated 5 years ago
- ☆14Sep 27, 2021Updated 4 years ago