In this repository, we explore model compression for transformer architectures via quantization. We specifically explore quantization aware training of the linear layers and demonstrate the performance for 8 bits, 4 bits, 2 bits and 1 bit (binary) quantization.
☆24May 14, 2021Updated 4 years ago
Alternatives and similar repositories for Compressed-Transformers
Users that are interested in Compressed-Transformers are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This repository contains the hardware implementation for Static BFP convolution on FPGA☆10Oct 15, 2019Updated 6 years ago
- ☆14Oct 24, 2022Updated 3 years ago
- bitfusion verilog implementation☆13Feb 21, 2022Updated 4 years ago
- C++ package for learning optimal wavelet bases using a neural network approach.☆14Dec 2, 2016Updated 9 years ago
- ☆49Jan 21, 2022Updated 4 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Systolic-array based Deep Learning Accelerator generator☆29Dec 11, 2020Updated 5 years ago
- DeiT implementation for Q-ViT☆25Apr 21, 2025Updated last year
- Post-Training Quantization for Vision transformers.☆242Jul 19, 2022Updated 3 years ago
- Official Repository for "Efficient Vocal Source Separation Through Windowed RoFormer"☆45Oct 30, 2025Updated 6 months ago
- SSR: Spatial Sequential Hybrid Architecture for Latency Throughput Tradeoff in Transformer Acceleration (Full Paper Accepted in FPGA'24)☆35Mar 12, 2026Updated last month
- HLS Custom-Precision Floating-Point Library☆13Nov 6, 2017Updated 8 years ago
- Download and create a tfreader for the audioset dataset☆17Apr 16, 2020Updated 6 years ago
- A guide on how to package HDL code (VHDL or Verilog) for PYNQ environments☆11Aug 14, 2025Updated 8 months ago
- Comparing Audio Features for Unsupervised Sound Classification☆10Jun 22, 2022Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.☆13Feb 13, 2021Updated 5 years ago
- superfast text to speech in any voice☆62Feb 16, 2026Updated 2 months ago
- PyTorch implementation of Towards Efficient Training for Neural Network Quantization☆16Jan 16, 2020Updated 6 years ago
- [AAAI 2025] Official data and code for "TB-HSU: Hierarchical 3D Scene Understanding with Contextual Affordances"☆15Sep 11, 2025Updated 7 months ago
- An HTTP server library in C++☆16Jan 10, 2019Updated 7 years ago
- ☆11Aug 2, 2024Updated last year
- Classify modulation of signals☆16Jan 16, 2020Updated 6 years ago
- ☆19Jan 13, 2022Updated 4 years ago
- LLM4HWDesign Starting Toolkit☆19Oct 4, 2024Updated last year
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Unsupervised speech activity detection system.☆11Jul 2, 2018Updated 7 years ago
- This is an extension of kaldi speech recognition software which allows to perform decoding of speech with hybrid word and phoneme graphs.…☆11Feb 4, 2020Updated 6 years ago
- Official code for Wav2Seq☆97Jul 19, 2022Updated 3 years ago
- The source code for target sound detection☆15Feb 26, 2022Updated 4 years ago
- ☆30Updated this week
- The project is related to the development of labs for the ITMO Speaker Recognition Course.☆16Apr 1, 2026Updated last month
- SChunk-Encoder (Transformer or Conformer) for streaming E2E ASR☆11Oct 21, 2022Updated 3 years ago
- This repository contains code that was used as an example of how to use Python to download part of the AudioSet dataset and use Tensorflo…☆13Aug 24, 2017Updated 8 years ago
- ☆13Jul 2, 2016Updated 9 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Open Source Projects from Pallas Lab☆21Oct 10, 2021Updated 4 years ago
- Quantize pytorch model, support post-training quantization and quantization aware training methods☆14Jun 15, 2023Updated 2 years ago
- Facebook SAM3例程☆46Apr 7, 2026Updated 3 weeks ago
- A linear array of PEs with RISC-V ISA targeting extreme high frequency on Xilinx ZYNQ Ultrascale+, specificially for applications such as…☆13Jun 4, 2024Updated last year
- Material for the class "Testing, debugging, profiling -- Python tools for building software"☆14Nov 7, 2025Updated 5 months ago
- ☆33May 17, 2024Updated last year
- ECNU Compiler Project: x0 compiler, implemented by lex and yacc(bison)☆11Mar 10, 2026Updated last month