In this repository, we explore model compression for transformer architectures via quantization. We specifically explore quantization aware training of the linear layers and demonstrate the performance for 8 bits, 4 bits, 2 bits and 1 bit (binary) quantization.
☆24May 14, 2021Updated 5 years ago
Alternatives and similar repositories for Compressed-Transformers
Users that are interested in Compressed-Transformers are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This repository contains the hardware implementation for Static BFP convolution on FPGA☆10Oct 15, 2019Updated 6 years ago
- ☆14Oct 24, 2022Updated 3 years ago
- bitfusion verilog implementation☆13Feb 21, 2022Updated 4 years ago
- C++ package for learning optimal wavelet bases using a neural network approach.☆14Dec 2, 2016Updated 9 years ago
- ☆49Jan 21, 2022Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Systolic-array based Deep Learning Accelerator generator☆29Dec 11, 2020Updated 5 years ago
- DeiT implementation for Q-ViT☆26Apr 21, 2025Updated last year
- 基于Point Transformers复现点云分割任务,并使用HAQ算法进行自动量化压缩,几乎不影响精度☆26Aug 25, 2022Updated 3 years ago
- Post-Training Quantization for Vision transformers.☆242Jul 19, 2022Updated 3 years ago
- Official Repository for "Efficient Vocal Source Separation Through Windowed RoFormer"☆45Oct 30, 2025Updated 7 months ago
- SSR: Spatial Sequential Hybrid Architecture for Latency Throughput Tradeoff in Transformer Acceleration (Full Paper Accepted in FPGA'24)☆36Mar 12, 2026Updated 3 months ago
- ☆29Dec 5, 2023Updated 2 years ago
- ☆12Jun 4, 2024Updated 2 years ago
- HLS Custom-Precision Floating-Point Library☆13Nov 6, 2017Updated 8 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Download and create a tfreader for the audioset dataset☆17Apr 16, 2020Updated 6 years ago
- Generate an FPGA design for a TWN☆11Nov 4, 2019Updated 6 years ago
- A guide on how to package HDL code (VHDL or Verilog) for PYNQ environments☆11Aug 14, 2025Updated 9 months ago
- Comparing Audio Features for Unsupervised Sound Classification☆10Jun 22, 2022Updated 3 years ago
- HLS project modeling various sparse accelerators.☆12Jan 11, 2022Updated 4 years ago
- Downloadable prebuilt AppImages for KiCad☆14Sep 14, 2019Updated 6 years ago
- Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.☆13Feb 13, 2021Updated 5 years ago
- PyTorch implementation of Towards Efficient Training for Neural Network Quantization☆16Jan 16, 2020Updated 6 years ago
- An HTTP server library in C++☆16Jan 10, 2019Updated 7 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Pytorch implementation of the paper : A Global-local Attention Framework for Weakly Labelled Audio Tagging.☆13Feb 6, 2021Updated 5 years ago
- Classify modulation of signals☆16Jan 16, 2020Updated 6 years ago
- Peking University Embedded Microprocessor System Lesson’s all Homework☆10Dec 28, 2021Updated 4 years ago
- LLM4HWDesign Starting Toolkit☆19Oct 4, 2024Updated last year
- Meta-embeddings are a probabilistic generalization of embeddings in machine learning.☆23Nov 23, 2018Updated 7 years ago
- Unsupervised speech activity detection system.☆11Jul 2, 2018Updated 7 years ago
- Codes for our paper "Exploring Bit-Slice Sparsity in Deep Neural Networks for Efficient ReRAM-Based Deployment" [NeurIPS'19 EMC2 workshop]…☆10Oct 12, 2020Updated 5 years ago
- superfast text to speech in any voice☆62Feb 16, 2026Updated 3 months ago
- This is an extension of kaldi speech recognition software which allows to perform decoding of speech with hybrid word and phoneme graphs.…☆11Feb 4, 2020Updated 6 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Official code for Wav2Seq☆97Jul 19, 2022Updated 3 years ago
- Official implementation of EMNLP'23 paper "Revisiting Block-based Quantisation: What is Important for Sub-8-bit LLM Inference?"☆24Oct 25, 2023Updated 2 years ago
- ☆30Apr 29, 2026Updated last month
- The source code for target sound detection☆15Feb 26, 2022Updated 4 years ago
- ☆17Jul 16, 2020Updated 5 years ago
- SChunk-Encoder (Transformer or Conformer) for streaming E2E ASR☆11Oct 21, 2022Updated 3 years ago
- Open Source Projects from Pallas Lab☆21Oct 10, 2021Updated 4 years ago