Tiramisu-Compiler / tiramisu_pytorchLinks
Integration of Tiramisu (Compiler) into PyTorch
☆25Updated 5 years ago
Alternatives and similar repositories for tiramisu_pytorch
Users that are interested in tiramisu_pytorch are comparing it to the libraries listed below
Sorting:
- A self-contained version of the tutorial which can be easily cloned and viewed by others.☆24Updated 6 years ago
- ☆20Updated 6 years ago
- Test suite for probing the numerical behavior of NVIDIA tensor cores☆41Updated last year
- CUDA Matrix Multiplication Optimization☆247Updated last year
- A Data-Centric Compiler for Machine Learning☆85Updated 2 weeks ago
- Poplar libraries☆121Updated 2 years ago
- Reference implementations of popular Binarized Neural Networks☆109Updated 3 weeks ago
- GEMM and Winograd based convolutions using CUTLASS☆28Updated 5 years ago
- ParaDnn: A systematic performance analysis methodology for deep learning.☆40Updated 5 years ago
- Issues related to MLPerf™ training policies, including rules and suggested changes☆95Updated 2 weeks ago
- SparseTIR: Sparse Tensor Compiler for Deep Learning☆141Updated 2 years ago
- ☆186Updated last year
- ☆23Updated 4 months ago
- A polyhedral compiler for expressing fast and portable data parallel algorithms☆953Updated last year
- Training neural networks in TensorFlow 2.0 with 5x less memory☆137Updated 3 years ago
- The quantitative performance comparison among DL compilers on CNN models.☆74Updated 5 years ago
- System for automated integration of deep learning backends.☆47Updated 3 years ago
- A library of GPU kernels for sparse matrix operations.☆280Updated 5 years ago
- ☆110Updated last year
- TVM stack: exploring the incredible explosion of deep-learning frameworks and how to bring them together☆64Updated 7 years ago
- ☆17Updated 3 years ago
- This repository contains companion software for the Colfax Research paper "Categorical Foundations for CuTe Layouts".☆83Updated 3 months ago
- Memory Optimizations for Deep Learning (ICML 2023)☆114Updated last year
- A schedule language for large model training☆152Updated 4 months ago
- MLIR-based partitioning system☆153Updated last week
- TensorFlow for the IPU☆79Updated 2 months ago
- parser script to process pytorch autograd profiler result, convert json file to excel.☆14Updated 6 years ago
- Chameleon: Adaptive Code Optimization for Expedited Deep Neural Network Compilation☆27Updated 6 years ago
- A lightweight, Pythonic, frontend for MLIR☆80Updated 2 years ago
- Sparsity support for PyTorch☆37Updated 9 months ago