ECoLab-POSTECH/NIPQ

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ECoLab-POSTECH/NIPQ)

ECoLab-POSTECH / NIPQ

☆18

Alternatives and similar repositories for NIPQ

Users that are interested in NIPQ are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

xvyaward / owq
View on GitHub
Code for the AAAI 2024 Oral paper "OWQ: Outlier-Aware Weight Quantization for Efficient Fine-Tuning and Inference of Large Language Model…
☆72Mar 7, 2024Updated 2 years ago
htqin / QuantSR
View on GitHub
[NeurIPS 2023 Spotlight] This project is the official implementation of our accepted NeurIPS 2023 (spotlight) paper QuantSR: Accurate Low…
☆53May 13, 2024Updated 2 years ago
aiha-lab / MX-QLLM
View on GitHub
LLM Inference with Microscaling Format
☆35Nov 12, 2024Updated last year
YoungHyun197 / ptq4vm
View on GitHub
ptq4vm official repository
☆28Apr 7, 2025Updated last year
YujieLu10 / Seeker
View on GitHub
☆11May 24, 2024Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
gilshm / sparq
View on GitHub
Post-training sparsity-aware quantization
☆34Feb 26, 2023Updated 3 years ago
nbasyl / OFQ
View on GitHub
The official implementation of the ICML 2023 paper OFQ-ViT
☆39Oct 3, 2023Updated 2 years ago
Cheeun / DAQ-pytorch
View on GitHub
[WACV2022] Official Code for the "DAQ: Channel-Wise Distribution-Aware Quantization for Deep Image Super-Resolution Networks"
☆27Feb 19, 2024Updated 2 years ago
ChengZhang-98 / QERA
View on GitHub
Official implementation of the ICLR'25 paper "QERA: an Analytical Framework for Quantization Error Reconstruction".
☆14Feb 4, 2025Updated last year
souravsanyal06 / DNN-Dataflow-simulator
View on GitHub
Implementation of Input Stationary, Weight Stationary and Output Stationary dataflow for given neural network on a tiled architecture
☆10Apr 19, 2020Updated 6 years ago
zhengchen3 / HLS_Transformer
View on GitHub
c++ version of ViT
☆12Nov 13, 2022Updated 3 years ago
shihuihong214 / P2-ViT
View on GitHub
☆13Jun 4, 2024Updated 2 years ago
PositionalHidden / PositionalHidden
View on GitHub
To mitigate position bias in LLMs, especially in long-context scenarios, we scale only one dimension of LLMs, reducing position bias and …
☆12Jun 18, 2024Updated 2 years ago
wimh966 / outlier_suppression
View on GitHub
The official PyTorch implementation of the NeurIPS2022 (spotlight) paper, Outlier Suppression: Pushing the Limit of Low-bit Transformer L…
☆49Oct 5, 2022Updated 3 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
DravenALG / ReSTE
View on GitHub
(ICCV 2023) Official implementation of Rectified Straight Through Estimator (ReSTE).
☆34Sep 20, 2024Updated last year
ayushiagg / ELASTICO
View on GitHub
Implementation of elastico protocol
☆13Jun 12, 2019Updated 7 years ago
cmpark0126 / pytorch-LARS
View on GitHub
☆10Jul 14, 2019Updated 7 years ago
huangyuxiang03 / Locret
View on GitHub
☆14Oct 3, 2024Updated last year
CASR-HKU / DPACS
View on GitHub
☆19Mar 21, 2023Updated 3 years ago
jjxxmiin / Network_Trimming_Pytorch
View on GitHub
Implementation network trimming using pytorch
☆15Apr 20, 2020Updated 6 years ago
Spiritator / FPGA_LeNet5_ws_8x8
View on GitHub
FPGA implement of 8x8 weight stationary systolic array DNN accelerator
☆18Feb 27, 2021Updated 5 years ago
1157942086 / CVPR2020_Auxiliary_Quantization
View on GitHub
Training Quantized Neural Networks with a Full-precision Auxiliary Module
☆13Jun 19, 2020Updated 6 years ago
SivannaKing / SEU-ASIC-IOT-ECGAI
View on GitHub
Arrhythmia Detection Using Algorithm and Hardware Co-design for Neural Network Inference Accelerators
☆16Jun 5, 2023Updated 3 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
blockchain-lab / ScaleOutDistributedLedger
View on GitHub
TU Delft Blockchain Engineering course project on scale-out distributed ledger
☆13Mar 4, 2018Updated 8 years ago
HaoranREN / TensorFlow_Model_Quantization
View on GitHub
A tutorial of model quantization using TensorFlow
☆11Aug 2, 2021Updated 4 years ago
antofuller / lookwhere
View on GitHub
Official repo of LookWhere (NeurIPS 2025) for efficient high-res visual recognition
☆16Oct 23, 2025Updated 8 months ago
RobertCsordas / switchhead
View on GitHub
☆16Jun 11, 2025Updated last year
ciki000 / DID
View on GitHub
☆14Dec 17, 2023Updated 2 years ago
linkinpark213 / quantization-networks-cifar10
View on GitHub
A re-implementation of the CVPR19 paper Quantization Networks on CIFAR-10, MNIST and ImageNet
☆10Aug 9, 2020Updated 5 years ago
magronp / phase-madtwinnet
View on GitHub
Code for phase recovery in MadTwinNet for monaural singing voice separation
☆12Jul 17, 2018Updated 8 years ago
vedant-jumle / Mamba-tf
View on GitHub
Open source implementation of the Mamba architecture in TensorFlow
☆20Jul 15, 2024Updated 2 years ago
AsahiLiu / PointDetectron
View on GitHub
☆19Nov 27, 2020Updated 5 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
hughperkins / torch-modder-notes
View on GitHub
Notes for torch maintainers/modders
☆10Mar 29, 2016Updated 10 years ago
chengquan / IC_FLOW
View on GitHub
☆22Oct 29, 2025Updated 8 months ago
UIUC-ChenLab / Chrysalis-HLS
View on GitHub
☆17Aug 29, 2024Updated last year
bingoe1010 / FamilyGuard
View on GitHub
Taurus AI & Pegasus ,Mixpose-short
☆12May 7, 2023Updated 3 years ago
IEEE-AICAS / AICAS2025_GC
View on GitHub
☆19Apr 23, 2025Updated last year
robertoBosio / nn2FPGA
View on GitHub
nn2FPGA converts ONNX models into FPGA dataflow accelerators with seamless ONNX Runtime integration.
☆21Jul 7, 2026Updated 2 weeks ago
Zhu-ZiXuan / Bitlet-PE
View on GitHub
A bit-level sparsity-awared multiply-accumulate process element.
☆19Jul 9, 2024Updated 2 years ago