pyf98 / speech-model-compression
A collection of papers related to speech model compression
☆24Updated last year
Related projects ⓘ
Alternatives and complementary repositories for speech-model-compression
- Multipurpose Multi Speaker Mixture Signal Generator☆43Updated last month
- Clustering-based methods for overlapping diarization☆71Updated 10 months ago
- CMU multilingual speech repository☆31Updated 2 years ago
- ConMamba for Automatic Speech Recognition☆45Updated 3 months ago
- A CSRankings-like index for speech researchers☆31Updated last month
- A list of papers for child ASR☆26Updated last month
- Pypi installable TDNN and TDNN-F layers for PyTorch based acoustic model training☆38Updated 3 years ago
- ADAPTING SELF-SUPERVISED MODELS TO MULTI-TALKER SPEECH RECOGNITION USING SPEAKER EMBEDDINGS☆27Updated last year
- End-to-end diarization loss☆22Updated 3 years ago
- Official code for Interspeech 2023 paper "Self-supervised Fine-tuning for Improved Content Representations by Speaker-invariant Clusterin…☆44Updated last year
- ☆36Updated 2 years ago
- Official Code for SyllableLM: Learning Coarse Semantic Units for Speech Language Models☆35Updated last month
- ☆17Updated last week
- Data and code related to the ICASSP submission "A comparison of methods for OOV-word recognition"☆17Updated 2 years ago
- ☆25Updated 3 weeks ago
- Source Code for the Paper "UNIFIED KEYWORD SPOTTING AND AUDIO TAGGING ON MOBILE DEVICES WITH TRANSFORMERS"☆23Updated last year
- Python wrappers for Kaldi Levenshtein's distance and alignment code.☆61Updated 8 months ago
- Pronunciation-assisted Subword Modeling☆29Updated 5 years ago
- ☆27Updated last year
- Python wrapper for OpenFST and its extensions from Kaldi. Also support reading/writing ark/scp files☆47Updated 4 months ago
- ☆51Updated last week
- [ICLR 2022] "Audio Lottery: Speech Recognition Made Ultra-Lightweight, Noise-Robust, and Transferable", by Shaojin Ding, Tianlong Chen, Z…☆30Updated 2 years ago
- multilingual speech aligner☆72Updated last year
- ☆16Updated 2 years ago
- (Interspeech 2023 & ICASSP 2024) Official repository for ARMHuBERT and STaRHuBERT☆38Updated 2 months ago
- WarpRNNT loss ported in Numba CPU/CUDA for Pytorch☆16Updated 2 years ago
- ☆31Updated last year
- ARCH: Audio Representations benCHmark☆38Updated 2 months ago
- ☆55Updated 3 years ago