Tiramisu-Compiler / tiramisu_pytorchLinks
Integration of Tiramisu (Compiler) into PyTorch
☆25Updated 5 years ago
Alternatives and similar repositories for tiramisu_pytorch
Users that are interested in tiramisu_pytorch are comparing it to the libraries listed below
Sorting:
- A Data-Centric Compiler for Machine Learning☆84Updated last year
- Chameleon: Adaptive Code Optimization for Expedited Deep Neural Network Compilation☆27Updated 5 years ago
- GEMM and Winograd based convolutions using CUTLASS☆27Updated 5 years ago
- ParaDnn: A systematic performance analysis methodology for deep learning.☆39Updated 5 years ago
- Test suite for probing the numerical behavior of NVIDIA tensor cores☆41Updated last year
- MLIR-based partitioning system☆132Updated this week
- SparseTIR: Sparse Tensor Compiler for Deep Learning☆138Updated 2 years ago
- This is the implementation for paper: AdaTune: Adaptive Tensor Program CompilationMade Efficient (NeurIPS 2020).☆14Updated 4 years ago
- Generator for MLIR files from known front-ends☆16Updated last year
- CUDA templates for tile-sparse matrix multiplication based on CUTLASS.☆51Updated 7 years ago
- A self-contained version of the tutorial which can be easily cloned and viewed by others.☆24Updated 6 years ago
- ☆12Updated 4 years ago
- A lightweight, Pythonic, frontend for MLIR☆80Updated last year
- Poplar libraries☆121Updated last year
- TVM stack: exploring the incredible explosion of deep-learning frameworks and how to bring them together☆64Updated 7 years ago
- Memory Optimizations for Deep Learning (ICML 2023)☆107Updated last year
- An experimental CPU backend for Triton (https//github.com/openai/triton)☆45Updated last month
- Mille Crepe Bench: layer-wise performance analysis for deep learning frameworks.☆17Updated 5 years ago
- Sparsity support for PyTorch☆37Updated 6 months ago
- ☆16Updated 4 years ago
- Training neural networks in TensorFlow 2.0 with 5x less memory☆134Updated 3 years ago
- ColTraIn HBFP Training Emulator☆16Updated 2 years ago
- A Deep Learning Meta-Framework and HPC Benchmarking Library☆81Updated 3 years ago
- The quantitative performance comparison among DL compilers on CNN models.☆74Updated 5 years ago
- MLIRX is now defunct. Please see PolyBlocks - https://docs.polymagelabs.com☆38Updated last year
- parser script to process pytorch autograd profiler result, convert json file to excel.☆15Updated 5 years ago
- Conversions to MLIR EmitC☆133Updated 9 months ago
- ☆23Updated last month
- System for automated integration of deep learning backends.☆47Updated 3 years ago
- ☆50Updated last year