zhexinli/Q-ViT-DeiT

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/zhexinli/Q-ViT-DeiT)

zhexinli / Q-ViT-DeiT

DeiT implementation for Q-ViT

☆26

Alternatives and similar repositories for Q-ViT-DeiT

Users that are interested in Q-ViT-DeiT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

wimh966 / outlier_suppression
View on GitHub
The official PyTorch implementation of the NeurIPS2022 (spotlight) paper, Outlier Suppression: Pushing the Limit of Low-bit Transformer L…
☆49Oct 5, 2022Updated 3 years ago
YanjingLi0202 / Q-ViT
View on GitHub
The official implementation of the NeurIPS 2022 paper Q-ViT.
☆106May 22, 2023Updated 3 years ago
os-hxfan / Static_BFP_HW
View on GitHub
This repository contains the hardware implementation for Static BFP convolution on FPGA
☆10Oct 15, 2019Updated 6 years ago
hustzxd / LSQuantization
View on GitHub
The PyTorch implementation of Learned Step size Quantization (LSQ) in ICLR2020 (unofficial)
☆139Nov 19, 2020Updated 5 years ago
zhutmost / lsq-net
View on GitHub
Unofficial implementation of LSQ-Net, a neural network quantization framework
☆316May 8, 2024Updated 2 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
YanjingLi0202 / Bi-ViT
View on GitHub
The official implementation of the AAAI 2024 paper Bi-ViT.
☆13Dec 18, 2023Updated 2 years ago
megvii-research / FQ-ViT
View on GitHub
[IJCAI 2022] FQ-ViT: Post-Training Quantization for Fully Quantized Vision Transformer
☆360Apr 11, 2023Updated 3 years ago
jakc4103 / scale-adjusted-training
View on GitHub
PyTorch implementation of Towards Efficient Training for Neural Network Quantization
☆16Jan 16, 2020Updated 6 years ago
ZLKong / Tri-Level-ViT
View on GitHub
[AAAI 2023 Oral] Peeling the Onion: Hierarchical Reduction of Data Redundancy for Efficient Vision Transformer Training
☆14Apr 19, 2023Updated 3 years ago
MXHX7199 / ICCV_2021_AFP
View on GitHub
AFP is a hardware-friendly quantization framework for DNNs, which is contributed by Fangxin Liu and Wenbo Zhao.
☆13Nov 8, 2021Updated 4 years ago
GATECH-EIC / Auto-NBA
View on GitHub
[ICML 2021] "Auto-NBA: Efficient and Effective Search Over the Joint Space of Networks, Bitwidths, and Accelerators" by Yonggan Fu, Yonga…
☆16Jan 3, 2022Updated 4 years ago
Andrew-Tierno / QuantizedTransformer
View on GitHub
Implementation of a Quantized Transformer Model
☆20Mar 20, 2019Updated 7 years ago
GATECH-EIC / ViTCoD
View on GitHub
[HPCA 2023] ViTCoD: Vision Transformer Acceleration via Dedicated Algorithm and Accelerator Co-Design
☆133Jun 27, 2023Updated 3 years ago
vineeths96 / Compressed-Transformers
View on GitHub
In this repository, we explore model compression for transformer architectures via quantization. We specifically explore quantization awa…
☆24May 14, 2021Updated 5 years ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
zkkli / PSAQ-ViT
View on GitHub
[ECCV 2022] Patch Similarity Aware Data-Free Quantization for Vision Transformers
☆124Dec 22, 2022Updated 3 years ago
1hunters / LIMPQ
View on GitHub
Official implementation for ECCV 2022 paper LIMPQ - "Mixed-Precision Neural Network Quantization via Learned Layer-wise Importance"
☆62Mar 19, 2023Updated 3 years ago
hahnyuan / PTQ4ViT
View on GitHub
Post-Training Quantization for Vision transformers.
☆245Jul 19, 2022Updated 4 years ago
liuzechun / Nonuniform-to-Uniform-Quantization
View on GitHub
Nonuniform-to-Uniform Quantization: Towards Accurate Quantization via Generalized Straight-Through Estimation. In CVPR 2022.
☆139Apr 28, 2022Updated 4 years ago
Qualcomm-AI-research / transformer-quantization
View on GitHub
☆212Nov 9, 2021Updated 4 years ago
yanghr / BSQ
View on GitHub
BSQ: Exploring Bit-Level Sparsity for Mixed-Precision Neural Network Quantization (ICLR 2021)
☆41Jan 12, 2021Updated 5 years ago
PannenetsF / TQT
View on GitHub
TQT's pytorch implementation.
☆22Dec 17, 2021Updated 4 years ago
SteveTsui / Q-DETR
View on GitHub
☆38Sep 3, 2023Updated 2 years ago
ZiweiWangTHU / Quantformer
View on GitHub
This is the official pytorch implementation for the paper: *Quantformer: Learning Extremely Low-precision Vision Transformers*.
☆31Nov 14, 2022Updated 3 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
Markin-Wang / CLEViT
View on GitHub
[IJCAI 2023] CLE-ViT: Contrastive Learning Encoded Transformer for Ultra-Fine-Grained Visual Categorization.
☆10Nov 3, 2023Updated 2 years ago
harvard-acc / FlexASR
View on GitHub
FlexASR: A Reconfigurable Hardware Accelerator for Attention-based Seq-to-Seq Networks
☆52May 20, 2026Updated 2 months ago
enyac-group / MaxEVA
View on GitHub
MaxEVA: Maximizing the Efficiency of Matrix Multiplication on Versal AI Engine (accepted as full paper at FPT'23)
☆22Apr 17, 2024Updated 2 years ago
hatsu3 / Sanger
View on GitHub
☆48Aug 23, 2021Updated 4 years ago
PeiyanFlying / SPViT
View on GitHub
☆53Aug 28, 2024Updated last year
IST-DASLab / gemm-fp8
View on GitHub
High Performance FP8 GEMM Kernels for SM89 and later GPUs.
☆21Jan 24, 2025Updated last year
facebookresearch / bit
View on GitHub
Code repo for the paper BiT Robustly Binarized Multi-distilled Transformer
☆115Jun 26, 2023Updated 3 years ago
YujieLu10 / Seeker
View on GitHub
☆11May 24, 2024Updated 2 years ago
GATECH-EIC / DNN-Chip-Predictor
View on GitHub
[ICASSP'20] DNN-Chip Predictor: An Analytical Performance Predictor for DNN Accelerators with Various Dataflows and Hardware Architecture…
☆23Oct 1, 2022Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
bohanzhuang / Towards-Effective-Low-bitwidth-Convolutional-Neural-Networks
View on GitHub
This repository implements the paper "Effective Training of Convolutional Neural Networks with Low-bitwidth Weights and Activations"
☆20Aug 30, 2021Updated 4 years ago
nzjin / awesome_moe
View on GitHub
The collections of MOE (Mixture Of Expert) papers, code and tools, etc.
☆12Mar 15, 2024Updated 2 years ago
dicecco1 / fpga_cpfp
View on GitHub
HLS Custom-Precision Floating-Point Library
☆13Nov 6, 2017Updated 8 years ago
EnnengYang / An-Efficient-Dataset-Condensation-Plugin
View on GitHub
An Efficient Dataset Condensation Plugin and Its Application to Continual Learning. NeurIPS, 2023.
☆12Nov 29, 2023Updated 2 years ago
Thinklab-SJTU / twns
View on GitHub
☆23Jan 19, 2023Updated 3 years ago
jerry-D / HedgeHog-Fused-Spiking-Neural-Network-Emulator-Compute-Engine
View on GitHub
HedgeHog Fused Spiking Neural Network Emulator/Compute Engine is a hardware implementation of a SNN designed for implementation in Xilinx…
☆62Feb 10, 2026Updated 5 months ago
Qualcomm-AI-research / BayesianBits
View on GitHub
☆22Feb 11, 2022Updated 4 years ago