ucb-bar / Baremetal-NNLinks
Tool for converting PyTorch models into raw C codes with minimal dependency and some performance optimizations.
☆44Updated 2 months ago
Alternatives and similar repositories for Baremetal-NN
Users that are interested in Baremetal-NN are comparing it to the libraries listed below
Sorting:
- Chisel RISC-V Vector 1.0 Implementation☆121Updated last month
- muRISCV-NN is a collection of efficient deep learning kernels for embedded platforms and microcontrollers.☆89Updated last month
- DNN Compiler for Heterogeneous SoCs☆54Updated last week
- Spatz is a compact RISC-V-based vector processor meant for high-performance, small computing clusters.☆124Updated last week
- Heterogeneous Research Platform (HERO) for exploration of heterogeneous computers consisting of programmable many-core accelerators and a…☆113Updated 2 years ago
- DHLS (Dynamic High-Level Synthesis) compiler based on MLIR☆146Updated last week
- Vector processor for RISC-V vector ISA☆130Updated 5 years ago
- An Open-Hardware CGRA for accelerated computation on the edge.☆38Updated last month
- ☆88Updated this week
- Unit tests generator for RVV 1.0☆95Updated 2 weeks ago
- A matrix extension proposal for AI applications under RISC-V architecture☆155Updated 9 months ago
- FlexGripPlus: an open-source GPU model for reliability evaluation and micro architectural simulation☆112Updated 2 years ago
- An LLVM pass that can generate CDFG and map the target loops onto a parameterizable CGRA.☆79Updated last month
- Tests for example Rocket Custom Coprocessors☆75Updated 5 years ago
- ☆35Updated last week
- Transactional Verilog design and Verilator Testbench for a RISC-V TensorCore Vector co-processor for reproducible linear algebra☆59Updated 3 years ago
- ☆57Updated 7 months ago
- An energy-efficient RISC-V floating-point compute cluster.☆114Updated this week
- Vector Acceleration IP core for RISC-V*☆189Updated 6 months ago
- matrix-coprocessor for RISC-V☆25Updated 7 months ago
- An MLIR dialect to enable the efficient acceleration of ML model on CGRAs.☆64Updated last year
- ☆64Updated 7 months ago
- A Chisel RTL generator for network-on-chip interconnects☆222Updated 3 weeks ago
- Microarchitecture implementation of the decoupled vector-fetch accelerator☆157Updated last year
- A 32-bit RISC-V Processor Designed with High-Level Synthesis☆54Updated 5 years ago
- CGRA-Flow is an integrated framework for CGRA compilation, exploration, synthesis, and development.☆144Updated last week
- Fork of seldridge/rocket-rocc-examples with tests for a systolic array based matmul accelerator☆60Updated 5 months ago
- Administrative repository for the Integrated Matrix Extension Task Group☆30Updated last month
- A DSL for Systolic Arrays☆82Updated 6 years ago
- PiDRAM is the first flexible end-to-end framework that enables system integration studies and evaluation of real Processing-using-Memory …☆71Updated last year