enyac-group / NeuralPowerLinks
The code for paper: Neuralpower: Predict and deploy energy-efficient convolutional neural networks
☆21Updated 6 years ago
Alternatives and similar repositories for NeuralPower
Users that are interested in NeuralPower are comparing it to the libraries listed below
Sorting:
- Simulator for BitFusion☆102Updated 5 years ago
- ☆14Updated 4 years ago
- Official implementation of "Searching for Winograd-aware Quantized Networks" (MLSys'20)☆27Updated 2 years ago
- Tool for optimize CNN blocking☆93Updated 5 years ago
- Chameleon: Adaptive Code Optimization for Expedited Deep Neural Network Compilation☆27Updated 5 years ago
- Linux docker for the DNN accelerator exploration infrastructure composed of Accelergy and Timeloop☆58Updated 2 weeks ago
- pytorch fixed point training tool/framework☆34Updated 5 years ago
- This is the implementation for paper: AdaTune: Adaptive Tensor Program CompilationMade Efficient (NeurIPS 2020).☆14Updated 4 years ago
- ☆36Updated 6 years ago
- ☆71Updated 5 years ago
- ONNXim is a fast cycle-level simulator that can model multi-core NPUs for DNN inference☆157Updated 8 months ago
- Explore the energy-efficient dataflow scheduling for neural networks.☆228Updated 5 years ago
- DAC System Design Contest 2020☆29Updated 5 years ago
- BSQ: Exploring Bit-Level Sparsity for Mixed-Precision Neural Network Quantization (ICLR 2021)☆41Updated 4 years ago
- DNN quantization with outlier channel splitting (ICML'19)☆113Updated 5 years ago
- ☆32Updated 4 years ago
- Implementation of "NITI: Training Integer Neural Networks Using Integer-only Arithmetic" on arxiv☆86Updated 3 years ago
- MICRO22 artifact evaluation for Sparseloop☆44Updated 3 years ago
- agile hardware-software co-design☆52Updated 3 years ago
- HW/SW co-design of sentence-level energy optimizations for latency-aware multi-task NLP inference☆52Updated last year
- Benchmark for matrix multiplications between dense and block sparse (BSR) matrix in TVM, blocksparse (Gray et al.) and cuSparse.☆23Updated 5 years ago
- PyTorch extension for emulating FP8 data formats on standard FP32 Xeon/GPU hardware.☆111Updated 10 months ago
- Automatic Schedule Exploration and Optimization Framework for Tensor Computations☆180Updated 3 years ago
- ☆29Updated 6 years ago
- ☆29Updated 3 years ago
- ☆111Updated last year
- Approximate layers - TensorFlow extension☆26Updated 6 months ago
- ☆35Updated 5 years ago
- Automatic Mapping Generation, Verification, and Exploration for ISA-based Spatial Accelerators☆115Updated 3 years ago
- A reference implementation of the Mind Mappings Framework.☆30Updated 3 years ago