htqin/BiBERT

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/htqin/BiBERT)

htqin / BiBERT

This project is the official implementation of our accepted ICLR 2022 paper BiBERT: Accurate Fully Binarized BERT.

☆89

Alternatives and similar repositories for BiBERT

Users that are interested in BiBERT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

facebookresearch / bit
View on GitHub
Code repo for the paper BiT Robustly Binarized Multi-distilled Transformer
☆114Jun 26, 2023Updated 2 years ago
htqin / IR-Net
View on GitHub
[CVPR 2020] This project is the PyTorch implementation of our accepted CVPR 2020 paper : forward and backward information retention for a…
☆180Mar 14, 2020Updated 6 years ago
htqin / BiPointNet
View on GitHub
This project is the official implementation of our accepted ICLR 2021 paper BiPointNet: Binary Neural Network for Point Clouds.
☆77Mar 1, 2021Updated 5 years ago
ThisisBillhe / BiViT
View on GitHub
The official implementation of BiViT: Extremely Compressed Binary Vision Transformers
☆16Jun 18, 2023Updated 2 years ago
htqin / BiBench
View on GitHub
[ICML 2023] This project is the official implementation of our accepted ICML 2023 paper BiBench: Benchmarking and Analyzing Network Binar…
☆56Mar 4, 2024Updated 2 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
Phuoc-Hoan-Le / BinaryViT
View on GitHub
BinaryViT: Pushing Binary Vision Transformers Towards Convolutional Models
☆38Feb 4, 2024Updated 2 years ago
htqin / BiFSMN
View on GitHub
Pytorch implementation of BiFSMN, IJCAI 2022
☆22Feb 10, 2023Updated 3 years ago
bywmm / Bi-GCN
View on GitHub
Implementation of "Binary Graph Convolutional Network", CVPR 2021, and TPAMI 2024.
☆26Apr 8, 2024Updated 2 years ago
facebookresearch / Ternary_Binary_Transformer
View on GitHub
ACL 2023
☆39Jun 6, 2023Updated 2 years ago
liuzechun / ReActNet
View on GitHub
ReActNet: Towards Precise Binary NeuralNetwork with Generalized Activation Functions. In ECCV 2020.
☆265Nov 11, 2021Updated 4 years ago
csyhhu / MetaQuant
View on GitHub
Codes for Accepted Paper : "MetaQuant: Learning to Quantize by Learning to Penetrate Non-differentiable Quantization" in NeurIPS 2019
☆54May 8, 2020Updated 5 years ago
Efficient-ML / Awesome-Model-Quantization
View on GitHub
A list of papers, docs, codes about model quantization. This repo is aimed to provide the info for model quantization research, we are co…
☆2,360Apr 25, 2026Updated last week
itayhubara / CalibTIP
View on GitHub
Improving Post Training Neural Quantization: Layer-wise Calibration and Integer Programming
☆98Jun 10, 2021Updated 4 years ago
ziplab / QTool
View on GitHub
Collections of model quantization algorithms. Any issues, please contact Peng Chen (blueardour@gmail.com)
☆73Oct 7, 2021Updated 4 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
huqinghao / PalQuant
View on GitHub
☆12Aug 26, 2022Updated 3 years ago
SteveTsui / ReBNN
View on GitHub
☆12Nov 17, 2023Updated 2 years ago
Efficient-ML / Awesome-Efficient-AIGC
View on GitHub
A list of papers, docs, codes about efficient AIGC. This repo is aimed to provide the info for efficient AIGC research, including languag…
☆205Feb 10, 2025Updated last year
billhhh / FQSR
View on GitHub
Codes for ACMMM 2021 paper "Fully Quantized Image Super-Resolution Networks".
☆19Jul 25, 2021Updated 4 years ago
chrundle / biprop
View on GitHub
Identify a binary weight or binary weight and activation subnetwork within a randomly initialized network by only pruning and binarizing …
☆51Feb 24, 2022Updated 4 years ago
cvlab-yonsei / EWGS
View on GitHub
An official implementation of "Network Quantization with Element-wise Gradient Scaling" (CVPR 2021) in PyTorch.
☆95Jul 14, 2023Updated 2 years ago
jakc4103 / scale-adjusted-training
View on GitHub
PyTorch implementation of Towards Efficient Training for Neural Network Quantization
☆16Jan 16, 2020Updated 6 years ago
peiswang / BitSplit
View on GitHub
BitSplit Post-trining Quantization
☆49Dec 20, 2021Updated 4 years ago
deJQK / AdaBits
View on GitHub
☆42Dec 15, 2022Updated 3 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
zzzxxxttt / pytorch_DoReFaNet
View on GitHub
A pytorch implementation of DoReFa-Net
☆132Dec 26, 2019Updated 6 years ago
hahnyuan / PB-LLM
View on GitHub
PB-LLM: Partially Binarized Large Language Models
☆155Nov 20, 2023Updated 2 years ago
yhhhli / BRECQ
View on GitHub
Pytorch implementation of BRECQ, ICLR 2021
☆297Aug 1, 2021Updated 4 years ago
htqin / IR-QLoRA
View on GitHub
[ICML 2024 Oral] This project is the official implementation of our Accurate LoRA-Finetuning Quantization of LLMs via Information Retenti…
☆65Apr 15, 2024Updated 2 years ago
SqueezeAILab / SqueezeLLM
View on GitHub
[ICML 2024] SqueezeLLM: Dense-and-Sparse Quantization
☆718Aug 13, 2024Updated last year
submission2019 / AnalyticalScaleForIntegerQuantization
View on GitHub
Example for applying Gaussian and Laplace clipping on activations of CNN.
☆34Jan 20, 2019Updated 7 years ago
htqin / DSG
View on GitHub
This project is the official implementation of our accepted IEEE TPAMI paper Diverse Sample Generation: Pushing the Limit of Data-free Qu…
☆15Feb 26, 2023Updated 3 years ago
Aaronhuang-778 / BiLLM
View on GitHub
[ICML 2024] BiLLM: Pushing the Limit of Post-Training Quantization for LLMs
☆229Jan 11, 2025Updated last year
hahnyuan / PTQ4ViT
View on GitHub
Post-Training Quantization for Vision transformers.
☆242Jul 19, 2022Updated 3 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
aim-uofa / model-quantization
View on GitHub
Collections of model quantization algorithms. Any issues, please contact Peng Chen (blueardour@gmail.com)
☆45Aug 19, 2021Updated 4 years ago
allenbai01 / ProxQuant
View on GitHub
ProxQuant: Quantized Neural Networks via Proximal Operators
☆30Feb 19, 2019Updated 7 years ago
htqin / QuantSR
View on GitHub
[NeurIPS 2023 Spotlight] This project is the official implementation of our accepted NeurIPS 2023 (spotlight) paper QuantSR: Accurate Low…
☆51May 13, 2024Updated last year
Zhen-Dong / BitPack
View on GitHub
BitPack is a practical tool to efficiently save ultra-low precision/mixed-precision quantized models.
☆58Feb 7, 2023Updated 3 years ago
Cornell-RelaxML / QuIP
View on GitHub
Code for paper: "QuIP: 2-Bit Quantization of Large Language Models With Guarantees"
☆397Feb 24, 2024Updated 2 years ago
liuzechun / Bi-Real-net
View on GitHub
Bi-Real Net: Enhancing the Performance of 1-bit CNNs With Improved Representational Capability and Advanced Training Algorithm. In ECCV 2…
☆186Mar 28, 2021Updated 5 years ago
huawei-noah / Pretrained-Language-Model
View on GitHub
Pretrained language model and its related optimization techniques developed by Huawei Noah's Ark Lab.
☆3,160Jan 22, 2024Updated 2 years ago