aiha-lab / AI-thermometer
☆9Updated 2 years ago
Alternatives and similar repositories for AI-thermometer:
Users that are interested in AI-thermometer are comparing it to the libraries listed below
- TernGEMM: General Matrix Multiply Library with Ternary Weights for Fast DNN Inference☆13Updated 3 years ago
- ☆95Updated last year
- ☆26Updated this week
- Repository to host and maintain scale-sim-v2 code☆289Updated this week
- [HPCA 2023] ViTCoD: Vision Transformer Acceleration via Dedicated Algorithm and Accelerator Co-Design☆105Updated last year
- Official implementation of EMNLP'23 paper "Revisiting Block-based Quantisation: What is Important for Sub-8-bit LLM Inference?"☆19Updated last year
- A co-design architecture on sparse attention☆52Updated 3 years ago
- Accelergy is an energy estimation infrastructure for accelerator energy estimations☆136Updated 2 months ago
- An efficient spatial accelerator enabling hybrid sparse attention mechanisms for long sequences☆26Updated last year
- An open-source parameterizable NPU generator with full-stack multi-target compilation stack for intelligent workloads.☆50Updated last month
- SDA: Low-Bit Stable Diffusion Acceleration on Edge FPGAs☆17Updated 11 months ago
- Tender: Accelerating Large Language Models via Tensor Decompostion and Runtime Requantization (ISCA'24)☆14Updated 9 months ago
- Multi-core HW accelerator mapping optimization framework for layer-fused ML workloads.☆51Updated 2 months ago
- ONNXim is a fast cycle-level simulator that can model multi-core NPUs for DNN inference☆111Updated 2 months ago
- List of papers related to Vision Transformers quantization and hardware acceleration in recent AI conferences and journals.☆84Updated 10 months ago
- FracBNN: Accurate and FPGA-Efficient Binary Neural Networks with Fractional Activations☆91Updated 3 years ago
- ☆29Updated 4 months ago
- STONNE: A Simulation Tool for Neural Networks Engines☆130Updated 10 months ago
- Timeloop performs modeling, mapping and code-generation for tensor algebra workloads on various accelerator architectures.☆378Updated 3 weeks ago
- MICRO22 artifact evaluation for Sparseloop☆43Updated 2 years ago
- A framework for fast exploration of the depth-first scheduling space for DNN accelerators☆38Updated 2 years ago
- The codes and artifacts associated with our MICRO'22 paper titled: "Adaptable Butterfly Accelerator for Attention-based NNs via Hardware …☆128Updated last year
- ☆56Updated 3 weeks ago
- [TCAD'23] AccelTran: A Sparsity-Aware Accelerator for Transformers☆40Updated last year
- AFP is a hardware-friendly quantization framework for DNNs, which is contributed by Fangxin Liu and Wenbo Zhao.☆12Updated 3 years ago
- ☆13Updated 2 years ago
- ☆43Updated 2 years ago
- Open-source of MSD framework☆16Updated last year
- ☆18Updated 3 years ago
- Linux docker for the DNN accelerator exploration infrastructure composed of Accelergy and Timeloop☆52Updated last week