jeffreyyu0602 / quantized-trainingLinks

☆32

Alternatives and similar repositories for quantized-training

Users that are interested in quantized-training are comparing it to the libraries listed below

Sorting:

clevercool / ANT-Quantization
☆112Updated 2 years ago
pku-liang / Sanger
A co-design architecture on sparse attention
☆53Updated 4 years ago
sjtu-zhao-lab / SALO
An efficient spatial accelerator enabling hybrid sparse attention mechanisms for long sequences
☆30Updated last year
hatsu3 / Sanger
☆47Updated 4 years ago
mit-han-lab / spatten
[HPCA'21] SpAtten: Efficient Sparse Attention Architecture with Cascade Token and Head Pruning
☆114Updated last year
snu-comparch / Tender
Tender: Accelerating Large Language Models via Tensor Decompostion and Runtime Requantization (ISCA'24)
☆21Updated last year
jha-lab / acceltran
[TCAD'23] AccelTran: A Sparsity-Aware Accelerator for Transformers
☆54Updated last year
leesou / H2-LLM-ISCA-2025
H2-LLM: Hardware-Dataflow Co-Exploration for Heterogeneous Hybrid-Bonding-based Low-Batch LLM Inference
☆76Updated 6 months ago
ChengZhang-98 / llm-mixed-q
Official implementation of EMNLP'23 paper "Revisiting Block-based Quantisation: What is Important for Sub-8-bit LLM Inference?"
☆24Updated 2 years ago
CLab-HKUST-GZ / micro58-axcore
☆21Updated 3 weeks ago
GATECH-EIC / ViTCoD
[HPCA 2023] ViTCoD: Vision Transformer Acceleration via Dedicated Algorithm and Accelerator Co-Design
☆123Updated 2 years ago
Accelergy-Project / micro22-sparseloop-artifact
MICRO22 artifact evaluation for Sparseloop
☆44Updated 3 years ago
actlab-genesys / GeneSys
An open-source parameterizable NPU generator with full-stack multi-target compilation stack for intelligent workloads.
☆69Updated last month
SET-Scheduling-Project / SoMa-HPCA2025
☆24Updated 8 months ago
ebby-s / MX-for-FPGA
Implementation of Microscaling data formats in SystemVerilog.
☆27Updated 4 months ago
SET-Scheduling-Project / GEMINI-HPCA2024
Open-source Framework for HPCA2024 paper: Gemini: Mapping and Architecture Co-exploration for Large-scale DNN Chiplet Accelerators
☆102Updated 6 months ago
isakedo / DNNsim
☆35Updated 5 years ago
Zhu-Zixuan / Bitlet-PE
A bit-level sparsity-awared multiply-accumulate process element.
☆18Updated last year
CASR-HKU / MSD-FCCM23
Open-source of MSD framework
☆16Updated 2 years ago
KULeuven-MICAS / stream
Multi-core HW accelerator mapping optimization framework for layer-fused ML workloads.
☆64Updated 4 months ago
Zhaoshixin-sky / CIM-MLC
[ASPLOS 2024] CIM-MLC: A Multi-level Compilation Stack for Computing-In-Memory Accelerators
☆48Updated last year
GATECH-EIC / ViTALiTy
ViTALiTy (HPCA'23) Code Repository
☆23Updated 2 years ago
SFU-HiAccel / HiSpMV
[TRETS 2025][FPGA 2024] FPGA Accelerator for Imbalanced SpMV using HLS
☆16Updated 2 months ago
kelvin0207 / SparSynergy
Open source RTL implementation of Tensor Core, Sparse Tensor Core, BitWave and SparSynergy in the article: "SparSynergy: Unlocking Flexib…
☆20Updated 7 months ago
scalesim-project / scale-sim-v3
☆49Updated 3 months ago
witmemtech / CIM-Technical-Papers-Collection
Computing in memory optimizes data handling by performing operations directly in memory, ideal for high-speed data processing needs. This…
☆28Updated 11 months ago
KULeuven-MICAS / DeFiNES
A framework for fast exploration of the depth-first scheduling space for DNN accelerators
☆41Updated 2 years ago
pku-liang / TENET
An analytical framework that models hardware dataflow of tensor applications on spatial architectures using the relation-centric notation…
☆87Updated last year
hsharma35 / bitfusion
Simulator for BitFusion
☆102Updated 5 years ago
fangjh21 / PALM
PALM: A Efficient Performance Simulator for Tiled Accelerators with Large-scale Model Training
☆19Updated last year