KULeuven-MICAS / htvm
Efficient Neural Network Deployment on Heterogenous TinyML Platforms
☆14Updated last year
Alternatives and similar repositories for htvm:
Users that are interested in htvm are comparing it to the libraries listed below
- DNN Compiler for Heterogeneous SoCs☆22Updated this week
- Multi-core HW accelerator mapping optimization framework for layer-fused ML workloads.☆47Updated this week
- HLSFactory: A Framework Empowering High-Level Synthesis Datasets for Machine Learning and Beyond☆28Updated 2 months ago
- An Open Workflow to Build Custom SoCs and run Deep Models at the Edge☆70Updated 2 months ago
- Performance and resource models for fpgaConvNet: a Streaming-Architecture-based CNN Accelerator.☆28Updated 2 months ago
- dMazeRunner: Dataflow acceleration optimization infrastructure for coarse-grained programmable accelerators☆45Updated 2 years ago
- HW Architecture-Mapping Design Space Exploration Framework for Deep Learning Accelerators☆124Updated this week
- CGRA framework with vectorization support.☆21Updated this week
- SAMO: Streaming Architecture Mapping Optimisation☆32Updated last year
- An MLIR dialect to enable the efficient acceleration of ML model on CGRAs.☆55Updated 3 months ago
- Provides the hardware code for the paper "EBPC: Extended Bit-Plane Compression for Deep Neural Network Inference and Training Accelerator…☆24Updated 4 years ago
- ☆10Updated 2 months ago
- ☆71Updated last year
- [TECS'23] A project on the co-design of Accelerators and CNNs.☆20Updated 2 years ago
- Benchmark framework of 3D integrated CIM accelerators for popular DNN inference, support both monolithic and heterogeneous 3D integration☆21Updated 3 years ago
- PolyLUT is the first quantized neural network training methodology that maps a neuron to a LUT while using multivariate polynomial functi…☆45Updated 11 months ago
- A collection of tutorials for the fpgaConvNet framework.☆38Updated 4 months ago
- Code for paper "FuSeConv Fully Separable Convolutions for Fast Inference on Systolic Arrays" published at DATE 2021☆14Updated 3 years ago
- An FPGA accelerator for general-purpose Sparse-Matrix Dense-Matrix Multiplication (SpMM).☆77Updated 6 months ago
- ☆56Updated 4 years ago
- ☆83Updated 7 months ago
- ☆3Updated 3 years ago
- FlexASR: A Reconfigurable Hardware Accelerator for Attention-based Seq-to-Seq Networks☆42Updated 2 years ago
- ☆53Updated last year
- Low level design of a chip built for optimizing/accelerating CNN classifiers over gray scale images.☆12Updated 5 years ago
- RTL implementation of Flex-DPE.☆97Updated 4 years ago
- An LLVM pass that can generate CDFG and map the target loops onto a parameterizable CGRA.☆59Updated 2 months ago
- Sparse CNN Accelerator targeting Intel FPGA☆11Updated 3 years ago
- ACM TODAES Best Paper Award, 2022☆24Updated last year
- ☆33Updated this week