fpgasystems / FPGA-Recommendation-Accelerator
MLSys 2021 paper: MicroRec: efficient recommendation inference by hardware and data structure solutions
☆15Updated 3 years ago
Related projects ⓘ
Alternatives and complementary repositories for FPGA-Recommendation-Accelerator
- ☆15Updated 3 years ago
- FleetRec: Large-Scale Recommendation Inference on Hybrid GPU-FPGA Clusters☆15Updated 3 years ago
- ☆25Updated 3 years ago
- agile hardware-software co-design☆46Updated 2 years ago
- A reference implementation of the Mind Mappings Framework.☆28Updated 2 years ago
- ☆61Updated 3 years ago
- ☆23Updated 3 years ago
- EQueue Dialect☆39Updated 2 years ago
- Linux docker for the DNN accelerator exploration infrastructure composed of Accelergy and Timeloop☆46Updated 2 weeks ago
- mNPUsim: A Cycle-accurate Multi-core NPU Simulator (IISWC 2023)☆38Updated 6 months ago
- ONNXim is a fast cycle-level simulator that can model multi-core NPUs for DNN inference☆68Updated last week
- ☆22Updated last month
- [FPGA'21] Microbenchmarks for Demystifying the Memory System of Modern Datacenter FPGAs for Software Programmers☆29Updated 2 years ago
- PAAS: A System Level Simulator for Heterogeneous (CPU-FPGA) Computing Systems☆43Updated 3 years ago
- STONNE Simulator integrated into SST Simulator☆17Updated 7 months ago
- HeteroCL-MLIR dialect for accelerator design☆40Updated 2 months ago
- A PIM instrumentation, compilation, execution, simulation, and evaluation repository for BLIMP-style architectures.☆16Updated 2 years ago
- Domain-Specific Architecture Generator 2☆20Updated 2 years ago
- Polyhedral High-Level Synthesis in MLIR☆29Updated last year
- Multi-target compiler for Sum-Product Networks, based on MLIR and LLVM.☆23Updated last week
- The source code for GPGPUSim+Ramulator simulator. In this version, GPGPUSim uses Ramulator to simulate the DRAM. This simulator is used t…☆49Updated 5 years ago
- ☆33Updated 3 years ago
- HeteroHalide: From Image Processing DSL to Efficient FPGA Acceleration☆15Updated 4 years ago
- High Bandwidth Memory (HBM) timing model based on DRAMSim2☆41Updated 7 years ago
- PIM-DL: Expanding the Applicability of Commodity DRAM-PIMs for Deep Learning via Algorithm-System Co-Optimization☆25Updated 9 months ago
- Implementations of Buffets, which are efficient, composable idioms for implementing Explicit Decoupled Data Orchestration.☆64Updated 5 years ago
- Stencil with Optimized Dataflow Architecture Compiler☆16Updated 4 years ago
- Heterogenous ML accelerator☆16Updated last month
- ☆10Updated 8 months ago
- A Toy-Purpose TPU Simulator☆10Updated 5 months ago