UIUC-ChenLab / scalehls
A scalable High-Level Synthesis framework on MLIR
☆217Updated 4 months ago
Related projects: ⓘ
- AutoSA: Polyhedral-Based Systolic Array Compiler☆192Updated last year
- HeteroCL: A Multi-Paradigm Programming Infrastructure for Software-Defined Heterogeneous Computing☆322Updated 4 months ago
- End-to-end SoC simulation: integrating the gem5 system simulator with the Aladdin accelerator simulator.☆212Updated last year
- RapidStream-TAPA compiles task-parallel HLS program into high-frequency FPGA accelerators.☆149Updated this week
- OpenCGRA is an open-source framework for modeling, testing, and evaluating CGRAs.☆131Updated last year
- PyTorch model to RTL flow for low latency inference☆118Updated 6 months ago
- CGRA-Flow is an integrated framework for CGRA compilation, exploration, synthesis, and development.☆102Updated this week
- Ramulator 2.0 is a modern, modular, extensible, and fast cycle-accurate DRAM simulator. It provides support for agile implementation and …☆217Updated 2 months ago
- Release of stream-specialization software/hardware stack.☆116Updated last year
- Allo: A Programming Model for Composable Accelerator Design☆122Updated this week
- Repository to host and maintain scale-sim-v2 code☆212Updated last week
- Benchmarks for Accelerator Design and Customized Architectures☆113Updated 4 years ago
- CHARM: Composing Heterogeneous Accelerators on Versal ACAP Architecture☆119Updated last month
- ☆83Updated 7 months ago
- STONNE: A Simulation Tool for Neural Networks Engines☆115Updated 3 months ago
- An integrated power, area, and timing modeling framework for multicore and manycore architectures☆160Updated 4 years ago
- ☆128Updated 10 months ago
- RiVEC Bencmark Suite☆88Updated 3 weeks ago
- HW Architecture-Mapping Design Space Exploration Framework for Deep Learning Accelerators☆99Updated last week
- ☆76Updated this week
- A matrix extension proposal for AI applications under RISC-V architecture☆93Updated 2 months ago
- ☆32Updated 2 months ago
- Timeloop performs modeling, mapping and code-generation for tensor algebra workloads on various accelerator architectures.☆321Updated last week
- Implementations of Buffets, which are efficient, composable idioms for implementing Explicit Decoupled Data Orchestration.☆62Updated 5 years ago
- This is the top-level repository for the Accel-Sim framework.☆290Updated 2 weeks ago
- An MLIR-based toolchain for AMD AI Engine-enabled devices.☆283Updated this week
- DRAMSys a SystemC TLM-2.0 based DRAM simulator.☆200Updated last week
- RTL implementation of Flex-DPE.☆84Updated 4 years ago
- SystemC/C++ library of commonly-used hardware functions and components for HLS.☆253Updated last week
- ☆86Updated 6 months ago