jack-willturner / nas-as-program-transformation-exploration
The code for our paper "Neural Architecture Search as Program Transformation Exploration"
☆18Updated 3 years ago
Alternatives and similar repositories for nas-as-program-transformation-exploration:
Users that are interested in nas-as-program-transformation-exploration are comparing it to the libraries listed below
- This is the implementation for paper: AdaTune: Adaptive Tensor Program CompilationMade Efficient (NeurIPS 2020).☆13Updated 3 years ago
- Chameleon: Adaptive Code Optimization for Expedited Deep Neural Network Compilation☆27Updated 5 years ago
- Benchmark for matrix multiplications between dense and block sparse (BSR) matrix in TVM, blocksparse (Gray et al.) and cuSparse.☆24Updated 4 years ago
- An external memory allocator example for PyTorch.☆14Updated 3 years ago
- ☆13Updated 3 years ago
- Benchmark PyTorch Custom Operators☆14Updated last year
- An extention of TVMScript to write simple and high performance GPU kernels with tensorcore.☆50Updated 9 months ago
- ☆17Updated 3 years ago
- DietCode Code Release☆63Updated 2 years ago
- ☆14Updated 3 years ago
- ☆19Updated 6 months ago
- Repository for artifact evaluation of ASPLOS 2023 paper "SparseTIR: Composable Abstractions for Sparse Compilation in Deep Learning"☆25Updated 2 years ago
- Artifacts of EVT ASPLOS'24☆23Updated last year
- Training with Block Minifloat number representation☆14Updated 3 years ago
- System for automated integration of deep learning backends.☆47Updated 2 years ago
- PyTorch compilation tutorial covering TorchScript, torch.fx, and Slapo☆18Updated 2 years ago
- SparseTIR: Sparse Tensor Compiler for Deep Learning☆135Updated 2 years ago
- GoldenEye is a functional simulator with fault injection capabilities for common and emerging numerical formats, implemented for the PyTo…☆24Updated 6 months ago
- Code for ICML 2021 submission☆34Updated 4 years ago
- ☆18Updated 4 years ago
- one-shot-tuner☆8Updated 2 years ago
- The quantitative performance comparison among DL compilers on CNN models.☆74Updated 4 years ago
- ColTraIn HBFP Training Emulator☆16Updated 2 years ago
- An Attention Superoptimizer☆21Updated 3 months ago
- ThrillerFlow is a Dataflow Analysis and Codegen Framework written in Rust.☆14Updated 5 months ago
- ☆21Updated 2 months ago
- HeteroHalide: From Image Processing DSL to Efficient FPGA Acceleration☆15Updated 4 years ago
- Benchmark scripts for TVM☆74Updated 3 years ago
- ☆18Updated 3 years ago
- Tacker: Tensor-CUDA Core Kernel Fusion for Improving the GPU Utilization while Ensuring QoS☆25Updated 2 months ago