kssteven418 / Q-ASRLinks
[ICASSP'22] Integer-only Zero-shot Quantization for Efficient Speech Recognition
☆33Updated 3 years ago
Alternatives and similar repositories for Q-ASR
Users that are interested in Q-ASR are comparing it to the libraries listed below
Sorting:
- In this repository, we explore model compression for transformer architectures via quantization. We specifically explore quantization awa…☆24Updated 4 years ago
- ☆44Updated last year
- Pytorch implementation of BiFSMN, IJCAI 2022☆21Updated 2 years ago
- Pytorch implementation of BiFSMNv2, TNNLS 2023☆31Updated 2 years ago
- This repository containts the pytorch scripts to train mixed-precision networks for microcontroller deployment, based on the memory contr…☆50Updated last year
- Computes the MWER (minimum WER) Loss with CTC beam search. Knowledge distillation for CTC loss.☆58Updated last year
- Position-based Scaled Gradient for Model Quantization and Pruning Code (NeurIPS 2020)☆26Updated 4 years ago
- [ICLR 2022] "Audio Lottery: Speech Recognition Made Ultra-Lightweight, Noise-Robust, and Transferable", by Shaojin Ding, Tianlong Chen, Z…☆31Updated 3 years ago
- Collection of PyTorch implementations of Spoken Keyword Spotting presented in research papers.☆25Updated last year
- Torch-based tool for quantizing high-dimensional vectors using additive codebooks☆54Updated 3 years ago
- FitHuBERT: Going Thinner and Deeper for Knowledge Distillation of Speech Self-Supervised Learning (INTERSPEECH 2022)☆17Updated last year
- Improving Post Training Neural Quantization: Layer-wise Calibration and Integer Programming☆36Updated last year
- LightHuBERT: Lightweight and Configurable Speech Representation Learning with Once-for-All Hidden-Unit BERT☆74Updated 2 years ago
- Manually implemented quantization-aware training☆21Updated 2 years ago
- Test Framework for few-shot open set KWS☆31Updated 7 months ago
- ☆19Updated 5 years ago
- Standalone implementation of the CUDA-accelerated WFST Decoder available in Riva☆91Updated 4 months ago
- WarpRNNT loss ported in Numba CPU/CUDA for Pytorch☆17Updated 3 years ago
- ☆205Updated 3 years ago
- TernGEMM: General Matrix Multiply Library with Ternary Weights for Fast DNN Inference☆13Updated 3 years ago
- Memory efficient transducer loss computation☆68Updated 3 years ago
- BitSplit Post-trining Quantization☆50Updated 3 years ago
- Multi-Head-Attention RNN pytorch implement for keyword spotting☆21Updated 4 years ago
- [ICML 2022] "Coarsening the Granularity: Towards Structurally Sparse Lottery Tickets" by Tianlong Chen, Xuxi Chen, Xiaolong Ma, Yanzhi Wa…☆33Updated 2 years ago
- A native-PyTorch library for large scale M-LLM (text/audio) training with tp/cp/dp/pp.☆74Updated 3 weeks ago
- Implementation of Google's USM speech model in Pytorch☆31Updated 2 months ago
- ☆17Updated 2 years ago
- Post-training sparsity-aware quantization☆34Updated 2 years ago
- ☆16Updated 6 months ago
- Rainbow Keywords - Official PyTorch Implementation☆13Updated last year