kssteven418 / Q-ASR
[ICASSP'22] Integer-only Zero-shot Quantization for Efficient Speech Recognition
☆31Updated 3 years ago
Alternatives and similar repositories for Q-ASR
Users that are interested in Q-ASR are comparing it to the libraries listed below
Sorting:
- In this repository, we explore model compression for transformer architectures via quantization. We specifically explore quantization awa…☆24Updated 4 years ago
- Pytorch implementation of BiFSMN, IJCAI 2022☆21Updated 2 years ago
- Torch-based tool for quantizing high-dimensional vectors using additive codebooks☆54Updated 2 years ago
- FitHuBERT: Going Thinner and Deeper for Knowledge Distillation of Speech Self-Supervised Learning (INTERSPEECH 2022)☆17Updated last year
- ☆43Updated last year
- Pytorch implementation of BiFSMNv2, TNNLS 2023☆30Updated 2 years ago
- This repository containts the pytorch scripts to train mixed-precision networks for microcontroller deployment, based on the memory contr…☆49Updated last year
- Test Framework for few-shot open set KWS☆31Updated 6 months ago
- Multi-Head-Attention RNN pytorch implement for keyword spotting☆21Updated 4 years ago
- TernGEMM: General Matrix Multiply Library with Ternary Weights for Fast DNN Inference☆13Updated 3 years ago
- Position-based Scaled Gradient for Model Quantization and Pruning Code (NeurIPS 2020)☆26Updated 4 years ago
- ☆62Updated last year
- ☆19Updated 5 years ago
- Standalone implementation of the CUDA-accelerated WFST Decoder available in Riva☆88Updated 2 months ago
- SpeechNAS-Better-Trade-off-between-Latency-and-Accuracy-for-Large-Scale-Speaker-Verification☆30Updated 2 years ago
- Computes the MWER (minimum WER) Loss with CTC beam search. Knowledge distillation for CTC loss.☆57Updated last year
- Collection of PyTorch implementations of Spoken Keyword Spotting presented in research papers.☆25Updated last year
- Post-training sparsity-aware quantization☆34Updated 2 years ago
- [ICLR 2022] "Audio Lottery: Speech Recognition Made Ultra-Lightweight, Noise-Robust, and Transferable", by Shaojin Ding, Tianlong Chen, Z…☆30Updated 3 years ago
- Memory efficient transducer loss computation☆68Updated 2 years ago
- Dynamic Chunk Streaming and Offline Conformer based on athena-team/Athena.☆44Updated 2 years ago
- Improving Post Training Neural Quantization: Layer-wise Calibration and Integer Programming☆35Updated last year
- An efficient implementation of RNN-T Prefix Beam Search in C++/CUDA.☆68Updated 4 years ago
- Broadcasted Residual Learning for Efficient Keyword Spotting☆23Updated 3 years ago
- LightHuBERT: Lightweight and Configurable Speech Representation Learning with Once-for-All Hidden-Unit BERT☆73Updated 2 years ago
- [ICML 2022] "Coarsening the Granularity: Towards Structurally Sparse Lottery Tickets" by Tianlong Chen, Xuxi Chen, Xiaolong Ma, Yanzhi Wa…☆32Updated 2 years ago
- ☆17Updated 2 years ago
- [ICLR 2022 Oral] F8Net: Fixed-Point 8-bit Only Multiplication for Network Quantization☆94Updated 3 years ago
- Rainbow Keywords - Official PyTorch Implementation☆13Updated 10 months ago
- Compressed version of Tacotron 2 using Tensor Train + Waveglow.☆22Updated 5 years ago