csukuangfj / kaldi-hmm-gmm
☆25Updated 5 months ago
Alternatives and similar repositories for kaldi-hmm-gmm:
Users that are interested in kaldi-hmm-gmm are comparing it to the libraries listed below
- A simple command line tool to calculate WER for ASR.☆14Updated 5 months ago
- ☆43Updated last year
- Multipurpose Multi Speaker Mixture Signal Generator☆44Updated last month
- Python wrapper for OpenFST and its extensions from Kaldi. Also support reading/writing ark/scp files☆50Updated 8 months ago
- python wrapper for kaldi's native I/O☆27Updated 2 months ago
- Python wrapper for kaldi's arpa2fst☆38Updated 3 months ago
- Decoders from Kaldi using OpenFst☆27Updated 2 months ago
- PyTorch implementation of Continuous Speech Separation☆13Updated 2 years ago
- Production first, nn-based on-device signal processing toolkit.☆64Updated last year
- Official implementation of the APSIPA 2022 paper: Exploring Speaker Age Estimation on Different Self-Supervised Learning Models☆14Updated 2 years ago
- ☆26Updated last year
- This repository is the official implementation of unimodal aggregation (UMA) for automaticspeech recognition (ASR).☆24Updated 3 months ago
- Convert WSJ sphere format to waveform and do data simulation.☆16Updated 5 years ago
- faster inference☆27Updated 2 months ago
- ADAPTING SELF-SUPERVISED MODELS TO MULTI-TALKER SPEECH RECOGNITION USING SPEAKER EMBEDDINGS☆28Updated 2 years ago
- ☆30Updated last year
- HMM, CTC, RNN-Transducer, forward-backward algorithm☆21Updated last year
- Source Code for the Paper "UNIFIED KEYWORD SPOTTING AND AUDIO TAGGING ON MOBILE DEVICES WITH TRANSFORMERS"☆23Updated 2 years ago
- RepVgg + HiFiGAN☆34Updated 2 years ago
- Open Source Speech/Text Data on AI☆18Updated 2 years ago
- In this repository, I try to combine k2 with speechbrain to decode well and fastly.☆16Updated 2 years ago
- Spherical residual vector quantization (SRVQ)☆28Updated 7 months ago
- PyTorch implementation of WASE described in our ICASSP 2021: "Wase: Learning When to Attend for Speaker Extraction in Cocktail Party Envi…☆25Updated 3 years ago
- ☆16Updated 2 years ago
- ☆16Updated 3 months ago
- (WIP)long form speech generatoins☆30Updated this week
- Efficient Neural Architecture Search via Straight-Through Gradients☆13Updated 4 years ago
- ICASSP2025Dynamic Embedding Causal Target Speech Extraction☆2Updated 2 weeks ago
- Objective metrics used in several text-to-speech (TTS) papers.☆48Updated 2 years ago
- ☆14Updated 2 years ago