brightlaboratory / polydlLinks
☆12Updated 3 years ago
Alternatives and similar repositories for polydl
Users that are interested in polydl are comparing it to the libraries listed below
Sorting:
- A source-to-source compiler for optimizing CUDA dynamic parallelism by aggregating launches☆15Updated 6 years ago
- ☆21Updated 4 months ago
- Code base for OOPSLA'24 paper: UniSparse: An Intermediate Language for General Sparse Format Customization☆30Updated 7 months ago
- ☆14Updated 3 years ago
- Mille Crepe Bench: layer-wise performance analysis for deep learning frameworks.☆17Updated 5 years ago
- ☆27Updated last year
- Multi-target compiler for Sum-Product Networks, based on MLIR and LLVM.☆23Updated 6 months ago
- ☆34Updated 2 years ago
- GPTPU for SC 2021☆52Updated 2 years ago
- ColTraIn HBFP Training Emulator☆16Updated 2 years ago
- ☆12Updated last year
- A novel spatial accelerator for horizontal diffusion weather stencil computation, as described in ICS 2023 paper by Singh et al. (https:/…☆19Updated last year
- ☆18Updated 5 years ago
- Artifacts of EVT ASPLOS'24☆26Updated last year
- ☆22Updated 2 years ago
- [ICML 2021] "Auto-NBA: Efficient and Effective Search Over the Joint Space of Networks, Bitwidths, and Accelerators" by Yonggan Fu, Yonga…☆16Updated 3 years ago
- A Data-Centric Compiler for Machine Learning☆84Updated last year
- PyTorch compilation tutorial covering TorchScript, torch.fx, and Slapo☆18Updated 2 years ago
- Memory Optimizations for Deep Learning (ICML 2023)☆64Updated last year
- Cavs: An Efficient Runtime System for Dynamic Neural Networks☆14Updated 4 years ago
- Adaptive floating-point based numerical format for resilient deep learning☆14Updated 3 years ago
- A simulation framework for modeling efficiency of Graph Neural Network Dataflows☆22Updated 4 months ago
- An Attention Superoptimizer☆21Updated 5 months ago
- Sparsity support for PyTorch☆35Updated 3 months ago
- A memory profiler for NVIDIA GPUs to explore memory inefficiencies in GPU-accelerated applications.☆25Updated 8 months ago
- ☆23Updated 7 months ago
- Test suite for probing the numerical behavior of NVIDIA tensor cores☆40Updated 11 months ago
- A curated list for Efficient Large Language Models☆11Updated last year
- GEMM and Winograd based convolutions using CUTLASS☆26Updated 4 years ago
- ☆18Updated 4 years ago