aiha-lab / AI-thermometerLinks

☆9

Alternatives and similar repositories for AI-thermometer

Users that are interested in AI-thermometer are comparing it to the libraries listed below

Sorting:

aiha-lab / TernGEMM
TernGEMM: General Matrix Multiply Library with Ternary Weights for Fast DNN Inference
☆13Updated 3 years ago
clevercool / ANT-Quantization
☆98Updated last year
aiha-lab / Attention-Head-Pruning
Layer-wise Pruning of Transformer Heads for Efficient Language Modeling
☆21Updated 3 years ago
scalesim-project / scale-sim-v2
Repository to host and maintain scale-sim-v2 code
☆300Updated last month
jeffreyyu0602 / quantized-training
☆27Updated this week
pku-liang / Sanger
A co-design architecture on sparse attention
☆52Updated 3 years ago
GATECH-EIC / ViTCoD
[HPCA 2023] ViTCoD: Vision Transformer Acceleration via Dedicated Algorithm and Accelerator Co-Design
☆107Updated last year
snu-comparch / Tender
Tender: Accelerating Large Language Models via Tensor Decompostion and Runtime Requantization (ISCA'24)
☆14Updated 11 months ago
PrincetonUniversity / LLMCompass
☆148Updated 11 months ago
pku-liang / TENET
An analytical framework that models hardware dataflow of tensor applications on spatial architectures using the relation-centric notation…
☆85Updated last year
sjtu-zhao-lab / SALO
An efficient spatial accelerator enabling hybrid sparse attention mechanisms for long sequences
☆27Updated last year
PSAL-POSTECH / ONNXim
ONNXim is a fast cycle-level simulator that can model multi-core NPUs for DNN inference
☆120Updated 3 months ago
ECASLab / hls-fpga-accelerators
Collection of kernel accelerators optimised for LLM execution
☆17Updated 2 months ago
cornell-zhang / FracBNN
FracBNN: Accurate and FPGA-Efficient Binary Neural Networks with Fractional Activations
☆94Updated 3 years ago
microsoft / microxcaling
PyTorch emulation library for Microscaling (MX)-compatible data formats
☆241Updated last week
DD-DuDa / awesome-vit-quantization-acceleration
List of papers related to Vision Transformers quantization and hardware acceleration in recent AI conferences and journals.
☆90Updated last year
casys-kaist / NeuPIMs
NeuPIMs: NPU-PIM Heterogeneous Acceleration for Batched LLM Inferencing
☆83Updated 11 months ago
hsharma35 / bitfusion
Simulator for BitFusion
☆100Updated 4 years ago
chiragsakhuja / spotlight
☆16Updated 2 years ago
tgrogers / ece695-2021
Programming and Assignment Material for ECE 695
☆15Updated 4 years ago
abdelfattah-lab / BitMoD-HPCA-25
☆41Updated 5 months ago
KULeuven-MICAS / DeFiNES
A framework for fast exploration of the depth-first scheduling space for DNN accelerators
☆39Updated 2 years ago
sefaburakokcu / quantized-yolov5
Low Precision(quantized) Yolov5
☆38Updated 2 months ago
Accelergy-Project / accelergy
Accelergy is an energy estimation infrastructure for accelerator energy estimations
☆138Updated last week
jha-lab / acceltran
[TCAD'23] AccelTran: A Sparsity-Aware Accelerator for Transformers
☆44Updated last year
SET-Scheduling-Project / SET-ISCA2023
The framework for the paper "Inter-layer Scheduling Space Definition and Exploration for Tiled Accelerators" in ISCA 2023.
☆67Updated 2 months ago
fangjh21 / PALM
PALM: A Efficient Performance Simulator for Tiled Accelerators with Large-scale Model Training
☆16Updated 11 months ago
albertomarchisio / SwiftTron
☆44Updated 2 years ago
actlab-genesys / GeneSys
An open-source parameterizable NPU generator with full-stack multi-target compilation stack for intelligent workloads.
☆53Updated 2 months ago
mean9park / BitFusion-verilog
bitfusion verilog implementation
☆8Updated 3 years ago