k2-fsa / multi_quantization
☆43Updated last year
Alternatives and similar repositories for multi_quantization:
Users that are interested in multi_quantization are comparing it to the libraries listed below
- ☆25Updated 4 months ago
- Python wrapper for OpenFST and its extensions from Kaldi. Also support reading/writing ark/scp files☆50Updated 8 months ago
- ADAPTING SELF-SUPERVISED MODELS TO MULTI-TALKER SPEECH RECOGNITION USING SPEAKER EMBEDDINGS☆28Updated 2 years ago
- Production first, nn-based on-device signal processing toolkit.☆64Updated last year
- Multipurpose Multi Speaker Mixture Signal Generator☆44Updated last month
- Code for the Interspeech 2024 paper "MM-KWS: Multi-modal Prompts for Multilingual User-defined Keyword Spotting"☆26Updated 3 months ago
- ☆57Updated 4 years ago
- Clustering-based methods for overlapping diarization☆77Updated last year
- A simple package for Guided source separation (GSS)☆117Updated 10 months ago
- Python wrapper for kaldi's arpa2fst☆38Updated 3 months ago
- End-to-End Keyword Spotting (E2E-KWS) using a character level LSTM☆39Updated 2 years ago
- Official implementation of "PhonMatchNet: Phoneme-Guided Zero-Shot Keyword Spotting for User-Defined Keywords" (INTERSPEECH 2023)☆45Updated 9 months ago
- Code of the paper "Low-Latency Speech Separation Guided Diarization for Telephone Conversations"☆13Updated 2 years ago
- Optimized loss based on cross-entropy (CE), like MWER (minimum WER) Loss with beam search and negative sampling strategy, Smoothed Max Po…☆21Updated 5 months ago
- ☆30Updated last year
- Code for reproducing experiments in "Domain-Adversarial Voice Activity Detection"☆23Updated 5 years ago
- ☆33Updated 3 years ago
- Computes the MWER (minimum WER) Loss with CTC beam search. Knowledge distillation for CTC loss.☆57Updated last year
- ☆68Updated 2 years ago
- Speech (audio) subjective evaluation system☆38Updated 4 years ago
- Implementation for paper "iMetricGAN: Intelligibility Enhancement for Speech-in-Noise using Generative Adversarial Network-based Metric L…☆54Updated last year
- An unofficial implementation of "UniCATS: A Unified Context-Aware Text-to-Speech Framework with Contextual VQ-Diffusion and Vocoding".☆25Updated last year
- ☆64Updated last year
- PyTorch implementation of Continuous Speech Separation☆13Updated 2 years ago
- LightHuBERT: Lightweight and Configurable Speech Representation Learning with Once-for-All Hidden-Unit BERT☆69Updated 2 years ago
- Dynamic Chunk Streaming and Offline Conformer based on athena-team/Athena.☆44Updated 2 years ago
- streaming attention networks for end-to-end automatic speech recognition☆55Updated 4 years ago
- Implementation of CTC alignment-based single step non-autoregressive transformer☆13Updated last year
- Improving Perceptual Quality by Phone-Fortified Perceptual Loss using Wasserstein Distance for Speech Enhancement☆76Updated 3 years ago
- ☆26Updated last year