ollycassidy13 / CompressedLUT
A tool to generate optimized hardware files for univariate functions.
☆14Updated 5 months ago
Alternatives and similar repositories for CompressedLUT:
Users that are interested in CompressedLUT are comparing it to the libraries listed below
- Implementation of Microscaling data formats in SystemVerilog.☆14Updated 5 months ago
- DOSA: Differentiable Model-Based One-Loop Search for DNN Accelerators☆13Updated 4 months ago
- [FPGA 2024] Source code and bitstream for LevelST: Stream-based Accelerator for Sparse Triangular Solver☆11Updated last year
- An Automatic Synthesis Tool for PIM-based CNN Accelerators.☆11Updated 11 months ago
- Official implementation of EMNLP'23 paper "Revisiting Block-based Quantisation: What is Important for Sub-8-bit LLM Inference?"☆19Updated last year
- ☆14Updated 2 years ago
- Compiler coursework repository for Instruction Architectures and Compilers module at Imperial College London☆20Updated 3 weeks ago
- Benchmark framework of compute-in-memory based accelerators for deep neural network (on-chip training chip focused)☆134Updated 11 months ago
- some knowleage about SystemC/TLM etc.☆17Updated last year
- An extention of pytorch for low precision training / inference☆9Updated last year
- A Fast DNN Accelerator Design Space Exploration Framework.☆45Updated 2 years ago
- ☆12Updated 10 months ago
- (Verilog) A simple convolution layer implementation with systolic array structure☆12Updated 2 years ago
- ☆17Updated 2 months ago
- Here are some implementations of basic hardware units in RTL language (verilog for now), which can be used for area/power evaluation and …☆10Updated last year
- ☆13Updated 5 months ago
- Multi-core HW accelerator mapping optimization framework for layer-fused ML workloads.☆48Updated 3 weeks ago
- CNN hardware accelerator to accelerate quantized LeNet-5 model☆28Updated last year
- AMD University Program HLS tutorial☆78Updated 3 months ago
- An efficient spatial accelerator enabling hybrid sparse attention mechanisms for long sequences☆24Updated 11 months ago
- a general-purpose machine learning-driven auto-tuner for heterogeneous platforms☆10Updated 5 months ago
- [FPGA 2024]FPGA Accelerator for Imbalanced SpMV using HLS☆8Updated last week
- A co-design architecture on sparse attention☆51Updated 3 years ago
- Tender: Accelerating Large Language Models via Tensor Decompostion and Runtime Requantization (ISCA'24)☆13Updated 7 months ago
- Performance and resource models for fpgaConvNet: a Streaming-Architecture-based CNN Accelerator.☆28Updated 3 months ago
- ☆20Updated 6 months ago