ChengZhang-98/QERA

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ChengZhang-98/QERA)

ChengZhang-98 / QERA

Official implementation of the ICLR'25 paper "QERA: an Analytical Framework for Quantization Error Reconstruction".

☆14

Alternatives and similar repositories for QERA

Users that are interested in QERA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ChengZhang-98 / LQER
View on GitHub
Official implementation of ICML'24 paper "LQER: Low-Rank Quantization Error Reconstruction for LLMs"
☆19Jul 11, 2024Updated 2 years ago
fmp453 / few-shot-erasing
View on GitHub
[BMVC2024] Erasing Concepts from Text-to-Image Diffusion Models with Few-shot Unlearning
☆14Jul 22, 2026Updated last week
P-bibs / Lobster
View on GitHub
Lobster: A GPU-Accelerated Framework for Neurosymbolic Programming
☆17Mar 26, 2026Updated 4 months ago
facebookresearch / flowception
View on GitHub
Authors implementation of "Flowception Temporally Expansive Flow Matching for Video Generation".
☆21May 9, 2026Updated 2 months ago
spcl / spatial-collectives
View on GitHub
Optimized communication collectives for the Cerebras waferscale engine
☆17Jun 5, 2024Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
BrotherHappy / OSTQuant
View on GitHub
[ICLR2025]: OSTQuant: Refining Large Language Model Quantization with Orthogonal and Scaling Transformations for Better Distribution Fitt…
☆94Apr 8, 2025Updated last year
souravsanyal06 / DNN-Dataflow-simulator
View on GitHub
Implementation of Input Stationary, Weight Stationary and Output Stationary dataflow for given neural network on a tiled architecture
☆10Apr 19, 2020Updated 6 years ago
zhengchen3 / HLS_Transformer
View on GitHub
c++ version of ViT
☆12Nov 13, 2022Updated 3 years ago
shihuihong214 / P2-ViT
View on GitHub
☆13Jun 4, 2024Updated 2 years ago
edwar-vhd / SFU-Piecewise-Polynomial-Approximation
View on GitHub
Special Function Units (SFUs) are hardware accelerators, their implementation helps improve the performance of GPUs to process some of th…
☆17Sep 21, 2025Updated 10 months ago
Fiwo735 / Transformer_Neural_Network_HLS
View on GitHub
☆14Jun 22, 2022Updated 4 years ago
cjerzak / causalimages-software
View on GitHub
[CLeaR 2023, 2025] causalimages: An R package for performing causal inference with image and image sequence data
☆27Jun 7, 2026Updated last month
yushuiwx / MH-MoE
View on GitHub
☆20Nov 5, 2024Updated last year
chandar-lab / EfficientLLMs
View on GitHub
☆22Jul 30, 2024Updated last year
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
joseph-nagel / diffusion-demo
View on GitHub
PyTorch denoising diffusion demo
☆21Jul 16, 2026Updated last week
CASR-HKU / DPACS
View on GitHub
☆20Mar 21, 2023Updated 3 years ago
jjxxmiin / Network_Trimming_Pytorch
View on GitHub
Implementation network trimming using pytorch
☆15Apr 20, 2020Updated 6 years ago
Paramathic / slim
View on GitHub
SLiM: One-shot Quantized Sparse Plus Low-rank Approximation of LLMs (ICML 2025)
☆37Nov 28, 2025Updated 8 months ago
dynodroid / dynodroid
View on GitHub
Automatic Input Generation System for Android Apps
☆37Nov 24, 2019Updated 6 years ago
Spiritator / FPGA_LeNet5_ws_8x8
View on GitHub
FPGA implement of 8x8 weight stationary systolic array DNN accelerator
☆18Feb 27, 2021Updated 5 years ago
kxh001 / Info-Decomp
View on GitHub
Interpretable Diffusion Via Information Decomposition
☆29Jul 18, 2024Updated 2 years ago
VincentWang1998 / ai_on_chip_project1
View on GitHub
tpu-systolic-array-weight-stationary
☆25May 7, 2021Updated 5 years ago
SivannaKing / SEU-ASIC-IOT-ECGAI
View on GitHub
Arrhythmia Detection Using Algorithm and Hardware Co-design for Neural Network Inference Accelerators
☆16Jun 5, 2023Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
antofuller / lookwhere
View on GitHub
Official repo of LookWhere (NeurIPS 2025) for efficient high-res visual recognition
☆17Oct 23, 2025Updated 9 months ago
KID-22 / LLM-Unlearning-Paper-List
View on GitHub
☆28Dec 18, 2025Updated 7 months ago
ECoLab-POSTECH / NIPQ
View on GitHub
☆18Jul 1, 2023Updated 3 years ago
chengquan / IC_FLOW
View on GitHub
☆22Oct 29, 2025Updated 9 months ago
UIUC-ChenLab / Chrysalis-HLS
View on GitHub
☆17Aug 29, 2024Updated last year
bingoe1010 / FamilyGuard
View on GitHub
Taurus AI & Pegasus ,Mixpose-short
☆12May 7, 2023Updated 3 years ago
IEEE-AICAS / AICAS2025_GC
View on GitHub
☆19Apr 23, 2025Updated last year
xiaohangt / wd1
View on GitHub
Official Implementation of wd1
☆32Sep 25, 2025Updated 10 months ago
CASR-HKU / MSD-FCCM23
View on GitHub
Open-source of MSD framework
☆16Sep 12, 2023Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
ChengyueWang / ISCA25-Stream-Network-Arch
View on GitHub
Reconfigurable Stream Network Architecture
☆18May 8, 2025Updated last year
Verdvana / Verdvana.github.io
View on GitHub
Verdvana‘s Blog
☆23Updated this week
GuoqingWang1 / Awesome-dLLM-Papers
View on GitHub
☆20Mar 11, 2026Updated 4 months ago
robertoBosio / nn2FPGA
View on GitHub
nn2FPGA converts ONNX models into FPGA dataflow accelerators with seamless ONNX Runtime integration.
☆21Jul 21, 2026Updated last week
Cerebras / sdk-examples
View on GitHub
☆48Apr 27, 2026Updated 3 months ago
Zhu-ZiXuan / Bitlet-PE
View on GitHub
A bit-level sparsity-awared multiply-accumulate process element.
☆19Jul 9, 2024Updated 2 years ago
parsa-epfl / quantization-sparsity-interplay
View on GitHub
This repo contains the code for studying the interplay between quantization and sparsity methods
☆26Feb 26, 2025Updated last year