Tiramisu-Compiler / tiramisu_pytorch
Integration of Tiramisu (Compiler) into PyTorch
☆26Updated 4 years ago
Related projects ⓘ
Alternatives and complementary repositories for tiramisu_pytorch
- Hybrid Tiny Hardware-aware Neural Architecture Search☆15Updated 2 years ago
- A polyhedral compiler for expressing fast and portable data parallel algorithms☆919Updated last month
- HW-PR-NAS is a single surrogate model trained to Pareto rank the architectures based on Accuracy, Latency and energy consumption☆12Updated 2 years ago
- Fast sparse deep learning on CPUs☆51Updated 2 years ago
- Memory Optimizations for Deep Learning (ICML 2023)☆60Updated 8 months ago
- A self-contained version of the tutorial which can be easily cloned and viewed by others.☆24Updated 5 years ago
- GEMM and Winograd based convolutions using CUTLASS☆25Updated 4 years ago
- A Data-Centric Compiler for Machine Learning☆82Updated 10 months ago
- Benchmarks to capture important workloads.☆28Updated 5 months ago
- Generator for MLIR files from known front-ends☆15Updated last year
- ☆17Updated 4 years ago
- ParaDnn: A systematic performance analysis methodology for deep learning.☆39Updated 4 years ago
- Training neural networks in TensorFlow 2.0 with 5x less memory☆129Updated 2 years ago
- Test suite for probing the numerical behavior of NVIDIA tensor cores☆30Updated 3 months ago
- ColTraIn HBFP Training Emulator☆16Updated last year
- A Deep Learning Meta-Framework and HPC Benchmarking Library☆81Updated 2 years ago
- TVM stack: exploring the incredible explosion of deep-learning frameworks and how to bring them together☆64Updated 6 years ago
- Research and development for optimizing transformers☆125Updated 3 years ago
- An experimental CPU backend for Triton (https//github.com/openai/triton)☆35Updated 6 months ago
- The quantitative performance comparison among DL compilers on CNN models.☆75Updated 4 years ago
- Chameleon: Adaptive Code Optimization for Expedited Deep Neural Network Compilation☆26Updated 5 years ago
- SparseTIR: Sparse Tensor Compiler for Deep Learning☆131Updated last year
- ☆11Updated 3 years ago
- ☆16Updated 7 months ago
- GoldenEye is a functional simulator with fault injection capabilities for common and emerging numerical formats, implemented for the PyTo…☆22Updated last month
- Conversions to MLIR EmitC☆124Updated 3 months ago
- Code for paper "Design Principles for Sparse Matrix Multiplication on the GPU" accepted to Euro-Par 2018☆71Updated 4 years ago
- ☆11Updated 2 years ago
- ☆31Updated 2 years ago
- MLIRX is now defunct. Please see PolyBlocks - https://docs.polymagelabs.com☆38Updated 11 months ago