brightlaboratory / polydl
☆12Updated 3 years ago
Alternatives and similar repositories for polydl
Users that are interested in polydl are comparing it to the libraries listed below
Sorting:
- Multi-target compiler for Sum-Product Networks, based on MLIR and LLVM.☆23Updated 5 months ago
- ☆14Updated 3 years ago
- ☆21Updated 3 months ago
- A simulation framework for modeling efficiency of Graph Neural Network Dataflows☆22Updated 3 months ago
- A novel spatial accelerator for horizontal diffusion weather stencil computation, as described in ICS 2023 paper by Singh et al. (https:/…☆19Updated last year
- Adaptive floating-point based numerical format for resilient deep learning☆14Updated 3 years ago
- A source-to-source compiler for optimizing CUDA dynamic parallelism by aggregating launches☆15Updated 5 years ago
- Test suite for probing the numerical behavior of NVIDIA tensor cores☆38Updated 9 months ago
- ☆18Updated 5 years ago
- Mille Crepe Bench: layer-wise performance analysis for deep learning frameworks.☆17Updated 5 years ago
- ☆41Updated this week
- Code base for OOPSLA'24 paper: UniSparse: An Intermediate Language for General Sparse Format Customization☆30Updated 6 months ago
- GPTPU for SC 2021☆51Updated 2 years ago
- Memory Optimizations for Deep Learning (ICML 2023)☆64Updated last year
- ☆16Updated last year
- ☆22Updated 2 years ago
- Provides the code for the paper "EBPC: Extended Bit-Plane Compression for Deep Neural Network Inference and Training Accelerators" by Luk…☆19Updated 5 years ago
- Cavs: An Efficient Runtime System for Dynamic Neural Networks☆14Updated 4 years ago
- PyTorch compilation tutorial covering TorchScript, torch.fx, and Slapo☆18Updated 2 years ago
- GoldenEye is a functional simulator with fault injection capabilities for common and emerging numerical formats, implemented for the PyTo…☆24Updated 6 months ago
- This is the implementation for paper: AdaTune: Adaptive Tensor Program CompilationMade Efficient (NeurIPS 2020).☆13Updated 4 years ago
- Heterogeneous Accelerated Computed Cluster (HACC) Resources Page☆21Updated this week
- ☆15Updated 2 years ago
- HW/SW co-design of sentence-level energy optimizations for latency-aware multi-task NLP inference☆48Updated last year
- FRAME: Fast Roofline Analytical Modeling and Estimation☆34Updated last year
- AIM: Accelerating Arbitrary-precision Integer Multiplication on Heterogeneous Reconfigurable Computing Platform Versal ACAP (Full Paper a…☆22Updated this week
- Benchmark PyTorch Custom Operators☆14Updated last year
- Chameleon: Adaptive Code Optimization for Expedited Deep Neural Network Compilation☆27Updated 5 years ago
- ☆37Updated 2 years ago
- ☆13Updated 3 years ago