IHIaadj / HW-PR-NAS
HW-PR-NAS is a single surrogate model trained to Pareto rank the architectures based on Accuracy, Latency and energy consumption
☆11Updated last year
Related projects: ⓘ
- Hybrid Tiny Hardware-aware Neural Architecture Search☆15Updated 2 years ago
- ☆20Updated 2 years ago
- Analog AI Neural Architecture Search (analog-nas) is a modular and flexible framework to facilitate implementation of Analog-aware Neural…☆41Updated 4 months ago
- Integration of Tiramisu (Compiler) into PyTorch☆26Updated 4 years ago
- μNAS is a neural architecture search (NAS) system that designs small-yet-powerful microcontroller-compatible neural networks.☆75Updated 3 years ago
- ☆66Updated this week
- ☆15Updated 3 months ago
- ☆23Updated last year
- Reproducing Quantization paper PACT☆55Updated 2 years ago
- Binarize convolutional neural networks using pytorch☆131Updated 2 years ago
- [ICLR 2021] HW-NAS-Bench: Hardware-Aware Neural Architecture Search Benchmark☆101Updated last year
- [ICCAD'22 TinyML Contest] Efficient Heart Stroke Detection on Low-cost Microcontrollers☆15Updated last year
- QONNX: Arbitrary-Precision Quantized Neural Networks in ONNX☆121Updated this week
- BSQ: Exploring Bit-Level Sparsity for Mixed-Precision Neural Network Quantization (ICLR 2021)☆36Updated 3 years ago
- ☆23Updated 2 years ago
- Official implementation of "Searching for Winograd-aware Quantized Networks" (MLSys'20)☆26Updated 11 months ago
- Lightweight Neural Architecture Search for Temporal Convolutional Networks at the Edge☆9Updated last year
- Low Precision(quantized) Yolov5☆30Updated 7 months ago
- Static Block Floating Point Quantization for CNN☆32Updated 3 years ago
- Layer-wise Pruning of Transformer Heads for Efficient Language Modeling☆19Updated 2 years ago
- ☆67Updated 2 years ago
- A Plug-and-play Lightweight tool for the Inference Optimization of Deep Neural networks☆31Updated this week
- CMix-NN: Mixed Low-Precision CNN Library for Memory-Constrained Edge Devices☆39Updated 4 years ago
- Position-based Scaled Gradient for Model Quantization and Pruning Code (NeurIPS 2020)☆26Updated 3 years ago
- ☆47Updated 2 years ago
- ☆47Updated 4 years ago
- Torch2Chip (MLSys, 2024)☆49Updated 3 weeks ago
- This repository containts the pytorch scripts to train mixed-precision networks for microcontroller deployment, based on the memory contr…☆47Updated 4 months ago
- ☆113Updated last year
- [ICLR 2022 Oral] F8Net: Fixed-Point 8-bit Only Multiplication for Network Quantization☆96Updated 2 years ago