kssteven418 / Q-ASR
[ICASSP'22] Integer-only Zero-shot Quantization for Efficient Speech Recognition
☆31Updated 3 years ago
Alternatives and similar repositories for Q-ASR:
Users that are interested in Q-ASR are comparing it to the libraries listed below
- In this repository, we explore model compression for transformer architectures via quantization. We specifically explore quantization awa…☆22Updated 3 years ago
- Pytorch implementation of BiFSMN, IJCAI 2022☆21Updated last year
- FitHuBERT: Going Thinner and Deeper for Knowledge Distillation of Speech Self-Supervised Learning (INTERSPEECH 2022)☆17Updated last year
- This repository containts the pytorch scripts to train mixed-precision networks for microcontroller deployment, based on the memory contr…☆49Updated 8 months ago
- Pytorch implementation of BiFSMNv2, TNNLS 2023☆30Updated last year
- ☆43Updated last year
- TernGEMM: General Matrix Multiply Library with Ternary Weights for Fast DNN Inference☆13Updated 2 years ago
- LightHuBERT: Lightweight and Configurable Speech Representation Learning with Once-for-All Hidden-Unit BERT☆70Updated 2 years ago
- Manually implemented quantization-aware training☆21Updated 2 years ago
- ☆197Updated 3 years ago
- Position-based Scaled Gradient for Model Quantization and Pruning Code (NeurIPS 2020)☆26Updated 4 years ago
- ☆53Updated last year
- ☆19Updated 5 years ago
- Improving Post Training Neural Quantization: Layer-wise Calibration and Integer Programming☆34Updated last year
- Computes the MWER (minimum WER) Loss with CTC beam search. Knowledge distillation for CTC loss.☆57Updated last year
- Implementaion RNN tranceducer☆21Updated 5 years ago
- [ICLR 2022] "Audio Lottery: Speech Recognition Made Ultra-Lightweight, Noise-Robust, and Transferable", by Shaojin Ding, Tianlong Chen, Z…☆30Updated 2 years ago
- Multi-Head-Attention RNN pytorch implement for keyword spotting☆21Updated 4 years ago
- [ICLR 2022 Oral] F8Net: Fixed-Point 8-bit Only Multiplication for Network Quantization☆94Updated 2 years ago
- Test Framework for few-shot open set KWS☆25Updated 2 months ago
- Standalone implementation of the CUDA-accelerated WFST Decoder available in Riva☆83Updated last month
- Memory efficient transducer loss computation☆68Updated 2 years ago
- PyTorch implementation of "Quantization and Training of Neural Networks for Efficient Integer-Arithmetic-Only Inference"☆55Updated 5 years ago
- Collection of PyTorch implementations of Spoken Keyword Spotting presented in research papers.☆22Updated 9 months ago
- WarpRNNT loss ported in Numba CPU/CUDA for Pytorch☆16Updated 2 years ago
- PyTorch Quantization Aware Training Example☆127Updated 8 months ago
- Dynamic Chunk Streaming and Offline Conformer based on athena-team/Athena.☆43Updated 2 years ago
- [ICML 2022] "Coarsening the Granularity: Towards Structurally Sparse Lottery Tickets" by Tianlong Chen, Xuxi Chen, Xiaolong Ma, Yanzhi Wa…☆32Updated last year
- Post-training sparsity-aware quantization☆34Updated last year
- Rainbow Keywords - Official PyTorch Implementation☆12Updated 7 months ago