sarahjuan / ibanView external linksLinks
☆14Jun 12, 2015Updated 10 years ago
Alternatives and similar repositories for iban
Users that are interested in iban are comparing it to the libraries listed below
Sorting:
- Unsupervised speech activity detection system.☆11Jul 2, 2018Updated 7 years ago
- This is an extension of kaldi speech recognition software which allows to perform decoding of speech with hybrid word and phoneme graphs.…☆11Feb 4, 2020Updated 6 years ago
- Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.☆13Feb 13, 2021Updated 5 years ago
- Evaluation of STT models for german language☆15Jan 22, 2022Updated 4 years ago
- Web page for ISCA Special Interest Group: Robust Speech Processing (RoSP)☆11Dec 4, 2023Updated 2 years ago
- A fork of Idiap Research Institute's DiarTk diarization toolkit☆16Feb 20, 2016Updated 9 years ago
- WarpRNNT loss ported in Numba CPU/CUDA for Pytorch☆17Mar 11, 2022Updated 3 years ago
- This will hold the crowdsourcing platform to be used to store voice data from various speakers which will act as input dataset for speech…☆17Mar 6, 2023Updated 2 years ago
- ☆17Jun 30, 2020Updated 5 years ago
- Example workflow for our data-centric speech benchmark☆17Jul 6, 2023Updated 2 years ago
- Artie Bias Corpus: an audio corpus + code for detecting demographic bias☆20Jul 21, 2020Updated 5 years ago
- wake word spotting with kaldi☆19Dec 3, 2020Updated 5 years ago
- [ICLR 2022] "Audio Lottery: Speech Recognition Made Ultra-Lightweight, Noise-Robust, and Transferable", by Shaojin Ding, Tianlong Chen, Z…☆32Apr 8, 2022Updated 3 years ago
- ☆22Jun 30, 2021Updated 4 years ago
- MirasVoice is a data set consisting speech samples from bilinguals to train neural network for optimization of speaker verification algor…☆19Mar 15, 2020Updated 5 years ago
- BurrMill core☆22Nov 2, 2021Updated 4 years ago
- DSing ASR task: Resources and Baseline for an unaccompanied singing ASR.☆19Nov 23, 2021Updated 4 years ago
- A handy dataset of noises for ASR☆22May 29, 2019Updated 6 years ago
- A baseline Automatic Speech Recognition system for Polish based on Kaldi.☆18Dec 21, 2021Updated 4 years ago
- Source code of paper <End-to-End Language Diarization for Bilingual Code-switching Speech>☆19Jan 23, 2022Updated 4 years ago
- One-shot TTS with Improved Unseen Speaker and Style Transfer☆37Mar 2, 2022Updated 3 years ago
- Support tools for punctuation and boundary detection for ASR output.☆55Dec 8, 2022Updated 3 years ago
- Adapt Kaldi-ASR nnet3 chain models from Zamia-Speech.org to a different language model☆33Jan 26, 2020Updated 6 years ago
- An open-source tool for automatic speech recognition ASR quality estimation.☆23Dec 12, 2019Updated 6 years ago
- SChunk-Encoder (Transformer or Conformer) for streaming E2E ASR☆11Oct 21, 2022Updated 3 years ago
- ☆14Mar 15, 2022Updated 3 years ago
- Resources for "Simple Speech Representation Learning from Perceptual Data".☆11Sep 18, 2023Updated 2 years ago
- Grapheme to phoneme model for PyTorch☆43Jul 21, 2022Updated 3 years ago
- Research_speech_speaker_verification_nist_sre2010☆12Mar 1, 2016Updated 9 years ago
- Baidu's CTC Decoders, including Greedy, Beam Search and Beam Search with KenLM Language Model☆24Oct 28, 2023Updated 2 years ago
- A bunch of scripts exploiting several tools to perform inverse text normalization (ITN)☆21Sep 27, 2017Updated 8 years ago
- 📖 LanMIT: A Toolkit for Improving Language Models in Low-resourced Speech Recognition based on Kaldi.☆22Jul 12, 2019Updated 6 years ago
- A database of number names for 186 languages, locales, and scripts☆67Mar 3, 2023Updated 2 years ago
- A "Crowd-Built" continuously growing speech dataset with transcripts. The dataset contains multiple languages and is intended for anyone …☆43Aug 3, 2022Updated 3 years ago
- transcribe audio feeds into public web ui☆45Aug 31, 2022Updated 3 years ago
- (semi) Grapheme-to-Phoneme (G2P) - seq2seq model using PyTorch for Korean☆23Dec 17, 2017Updated 8 years ago
- 🎹 pyannote + 🗒 notebook = pyannotebook☆26Jun 12, 2023Updated 2 years ago
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.☆27Feb 15, 2024Updated 2 years ago
- Wenet speech to text for react native☆10Nov 1, 2022Updated 3 years ago