kssteven418 / Q-ASRLinks

[ICASSP'22] Integer-only Zero-shot Quantization for Efficient Speech Recognition

☆34

Alternatives and similar repositories for Q-ASR

Users that are interested in Q-ASR are comparing it to the libraries listed below

Sorting:

vineeths96 / Compressed-Transformers
In this repository, we explore model compression for transformer architectures via quantization. We specifically explore quantization awa…
☆24Updated 4 years ago
htqin / BiFSMNv2
Pytorch implementation of BiFSMNv2, TNNLS 2023
☆31Updated 2 years ago
Qualcomm-AI-research / transformer-quantization
☆205Updated 3 years ago
danpovey / quantization
Torch-based tool for quantizing high-dimensional vectors using additive codebooks
☆54Updated 3 years ago
glory20h / FitHuBERT
FitHuBERT: Going Thinner and Deeper for Knowledge Distillation of Speech Self-Supervised Learning (INTERSPEECH 2022)
☆17Updated last year
k2-fsa / multi_quantization
☆44Updated last year
VITA-Group / Audio-Lottery
[ICLR 2022] "Audio Lottery: Speech Recognition Made Ultra-Lightweight, Noise-Robust, and Transferable", by Shaojin Ding, Tianlong Chen, Z…
☆31Updated 3 years ago
dobby-seo / Pytorch-MHAtt-RNN-KWS
Multi-Head-Attention RNN pytorch implement for keyword spotting
☆21Updated 4 years ago
huckiyang / Interspeech23-Tutorial-Para-Efficient-Cross-Modal-Tutorial
Interspeech Tutorial - Resource Efficient and Cross-Modal Learning Toward Foundation Modeling
☆15Updated last year
renyuanL / ry-Speech-commands
☆19Updated 5 years ago
Jangho-Kim / PSG-pytorch
Position-based Scaled Gradient for Model Quantization and Pruning Code (NeurIPS 2020)
☆26Updated 4 years ago
wentaozhu / speechnas
SpeechNAS-Better-Trade-off-between-Latency-and-Accuracy-for-Large-Scale-Speaker-Verification
☆30Updated 2 years ago
htqin / BiFSMN
Pytorch implementation of BiFSMN, IJCAI 2022
☆21Updated 2 years ago
mrusci / ondevice-learning-kws
Test Framework for few-shot open set KWS
☆32Updated 9 months ago
titu1994 / warprnnt_numba
WarpRNNT loss ported in Numba CPU/CUDA for Pytorch
☆17Updated 3 years ago
jundaf2 / INT8-Flash-Attention-FMHA-Quantization
☆158Updated last year
TeaPoly / CTC-OptimizedLoss
Computes the MWER (minimum WER) Loss with CTC beam search. Knowledge distillation for CTC loss.
☆58Updated last year
mechanicalsea / lighthubert
LightHuBERT: Lightweight and Configurable Speech Representation Learning with Once-for-All Hidden-Unit BERT
☆74Updated 2 years ago
swagshaw / TorchKWS
Collection of PyTorch implementations of Spoken Keyword Spotting presented in research papers.
☆27Updated last year
tigert1998 / qat
Manually implemented quantization-aware training
☆21Updated 2 years ago
papers-submission / CalibTIP
Improving Post Training Neural Quantization: Layer-wise Calibration and Integer Programming
☆36Updated 2 years ago
mrusci / training-mixed-precision-quantized-networks
This repository containts the pytorch scripts to train mixed-precision networks for microcontroller deployment, based on the memory contr…
☆50Updated last year
gilshm / sparq
Post-training sparsity-aware quantization
☆34Updated 2 years ago
shincling / discreteSeparation
The demo for "Discretization and Re-synthesis: an alternative method to solve the Cocktail Party Problem".
☆12Updated 3 years ago
hfutami / distill-bert-for-seq2seq-asr
☆24Updated 5 years ago
nvidia-riva / riva-asrlib-decoder
Standalone implementation of the CUDA-accelerated WFST Decoder available in Riva
☆91Updated 5 months ago
csukuangfj / kaldi-hmm-gmm
☆25Updated 9 months ago
skmhrk1209 / QuanTorch
PyTorch implementation of "Quantization and Training of Neural Networks for Efficient Integer-Arithmetic-Only Inference"
☆57Updated 5 years ago
facebookresearch / gtn_applications
Applications using the GTN library and code to reproduce experiments in "Differentiable Weighted Finite-State Transducers"
☆84Updated 3 years ago
GATECH-EIC / S3-Router
[NeurIPS 2022] "Losses Can Be Blessings: Routing Self-Supervised Speech Representations Towards Efficient Multilingual and Multitask Spee…
☆17Updated last year