Implementation of a Quantized Transformer Model
☆19Mar 20, 2019Updated 6 years ago
Alternatives and similar repositories for QuantizedTransformer
Users that are interested in QuantizedTransformer are comparing it to the libraries listed below
Sorting:
- PyTorch code for full quantization of DNN using BCGD☆14Jul 24, 2019Updated 6 years ago
- ☆13Mar 27, 2023Updated 2 years ago
- Implementation of several knowledge distillation techniques on PyTorch☆15Feb 25, 2019Updated 7 years ago
- ☆13Nov 7, 2021Updated 4 years ago
- AFP is a hardware-friendly quantization framework for DNNs, which is contributed by Fangxin Liu and Wenbo Zhao.☆13Nov 8, 2021Updated 4 years ago
- An 8bit automated quantization conversion tool for the pytorch (Post-training quantization based on KL divergence)☆32Nov 17, 2019Updated 6 years ago
- 🔮 LLM GPU Calculator☆21Aug 19, 2023Updated 2 years ago
- BSQ: Exploring Bit-Level Sparsity for Mixed-Precision Neural Network Quantization (ICLR 2021)☆42Jan 12, 2021Updated 5 years ago
- This is the pytorch implementation for the paper: Generalizable Mixed-Precision Quantization via Attribution Rank Preservation, which is…☆24Aug 17, 2021Updated 4 years ago
- [KDD'22] Learned Token Pruning for Transformers☆101Feb 27, 2023Updated 3 years ago
- Proximal Mean-field for Neural Network Quantization☆21Apr 9, 2020Updated 5 years ago
- DL quantization for pytorch☆26Mar 30, 2019Updated 6 years ago
- Vision Longformer For Object Detection☆34May 17, 2021Updated 4 years ago
- ☆36Sep 3, 2023Updated 2 years ago
- Notch filtering using ofxCv☆10May 17, 2021Updated 4 years ago
- Personal collection of references for high performance mixed precision training.☆41Oct 21, 2019Updated 6 years ago
- A pytorch implementation of DoReFa-Net☆132Dec 26, 2019Updated 6 years ago
- TensorFlow implementation of "TokenLearner: What Can 8 Learned Tokens Do for Images and Videos?"☆36Dec 17, 2021Updated 4 years ago
- Implementation of Sparse Shift Layer and Active Shift Layer (3D, 4D, 5D tensors) for PyTorch(CPU,GPU)☆35May 5, 2021Updated 4 years ago
- Static Block Floating Point Quantization for CNN☆37Jun 9, 2021Updated 4 years ago
- This repository represents training examples for the CVPR 2018 paper "SYQ:Learning Symmetric Quantization For Efficient Deep Neural Netwo…☆31Jul 25, 2019Updated 6 years ago
- ☆11Feb 18, 2022Updated 4 years ago
- ☆16Updated this week
- Bluespec SystemVerilog library for use of the IBM Coherent Accelerator-Processor Interface (CAPI)☆11May 25, 2016Updated 9 years ago
- ☆11Apr 8, 2024Updated last year
- SKFAC Preconditioner for MindSpore☆12Jul 2, 2021Updated 4 years ago
- ☆12Jan 23, 2024Updated 2 years ago
- The official re-implementation of the Neurips 2021 paper, "Targeted Neural Dynamical Modeling".☆10Mar 4, 2022Updated 3 years ago
- Baidu 100G Chasiss Switch hardware spec☆12Sep 20, 2017Updated 8 years ago
- [NeurIPS'21] "Chasing Sparsity in Vision Transformers: An End-to-End Exploration" by Tianlong Chen, Yu Cheng, Zhe Gan, Lu Yuan, Lei Zhang…☆89Dec 1, 2023Updated 2 years ago
- Contains code for Binary, Ternary, N-bit Quantized and Hybrid CNNs for low precision experiments.☆26Oct 30, 2018Updated 7 years ago
- Structural RNN using PyTorch☆41Oct 16, 2017Updated 8 years ago
- ☆12Sep 24, 2024Updated last year
- AI wiki☆10Dec 9, 2022Updated 3 years ago
- This repository contains the hardware implementation for Static BFP convolution on FPGA☆10Oct 15, 2019Updated 6 years ago
- Codes for our paper "Exploring Bit-Slice Sparsity in Deep Neural Networks for Efficient ReRAM-Based Deployment" [NeurIPS'19 EMC2 workshop]…☆10Oct 12, 2020Updated 5 years ago
- ☆13Oct 26, 2023Updated 2 years ago
- ActiveHARNet: Towards On-Device Deep Bayesian Active Learning for Human Activity Recognition☆12Nov 7, 2020Updated 5 years ago
- 팡요랩 자료☆11May 31, 2019Updated 6 years ago