Tiramisu-Compiler / tiramisu_pytorch
Integration of Tiramisu (Compiler) into PyTorch
☆25Updated 4 years ago
Alternatives and similar repositories for tiramisu_pytorch:
Users that are interested in tiramisu_pytorch are comparing it to the libraries listed below
- Hybrid Tiny Hardware-aware Neural Architecture Search☆15Updated 2 years ago
- A polyhedral compiler for expressing fast and portable data parallel algorithms☆933Updated 5 months ago
- A self-contained version of the tutorial which can be easily cloned and viewed by others.☆24Updated 5 years ago
- Test suite for probing the numerical behavior of NVIDIA tensor cores☆37Updated 9 months ago
- GEMM and Winograd based convolutions using CUTLASS☆26Updated 4 years ago
- Official pytorch code for "APP: Anytime Progressive Pruning" (DyNN @ ICML, 2022; CLL @ ACML, 2022, SNN @ ICML, 2022 and SlowDNN 2023)☆16Updated 2 years ago
- HW-PR-NAS is a single surrogate model trained to Pareto rank the architectures based on Accuracy, Latency and energy consumption☆13Updated 2 years ago
- CUDA templates for tile-sparse matrix multiplication based on CUTLASS.☆51Updated 7 years ago
- ☆23Updated 5 months ago
- Memory Optimizations for Deep Learning (ICML 2023)☆64Updated last year
- An experimental CPU backend for Triton (https//github.com/openai/triton)☆40Updated last month
- Chameleon: Adaptive Code Optimization for Expedited Deep Neural Network Compilation☆27Updated 5 years ago
- Fast sparse deep learning on CPUs☆53Updated 2 years ago
- SparseTIR: Sparse Tensor Compiler for Deep Learning☆135Updated 2 years ago
- Benchmark PyTorch Custom Operators☆14Updated last year
- Benchmarks to capture important workloads.☆31Updated 2 months ago
- ☆12Updated 3 years ago
- Yaae: Yet another autodiff engine (written in Numpy).☆27Updated last year
- ☆18Updated 5 years ago
- A lightweight, Pythonic, frontend for MLIR☆81Updated last year
- Training material for IPU users: tutorials, feature examples, simple applications☆86Updated 2 years ago
- A sandbox for quick iteration and experimentation on projects related to IREE, MLIR, and LLVM☆57Updated last month
- A Deep Learning Meta-Framework and HPC Benchmarking Library☆81Updated 2 years ago
- Codebase associated with the PyTorch compiler tutorial☆45Updated 5 years ago
- MLIR-based partitioning system☆80Updated this week
- PyTorch interface for the IPU☆179Updated last year
- Samples demonstrating how to use the Compute Sanitizer Tools and Public API☆79Updated last year
- System for automated integration of deep learning backends.☆47Updated 2 years ago
- A list of awesome neural symbolic papers.☆47Updated 2 years ago
- Mille Crepe Bench: layer-wise performance analysis for deep learning frameworks.☆17Updated 5 years ago