Appen / UHV-OTS-SpeechView external linksLinks
A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.
☆106Mar 25, 2023Updated 2 years ago
Alternatives and similar repositories for UHV-OTS-Speech
Users that are interested in UHV-OTS-Speech are comparing it to the libraries listed below
Sorting:
- Python wrapper for kaldi's arpa2fst☆38Aug 27, 2025Updated 5 months ago
- Torch-based tool for quantizing high-dimensional vectors using additive codebooks☆54May 25, 2022Updated 3 years ago
- Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.☆13Feb 13, 2021Updated 5 years ago
- [ICLR 2022] "Audio Lottery: Speech Recognition Made Ultra-Lightweight, Noise-Robust, and Transferable", by Shaojin Ding, Tianlong Chen, Z…☆32Apr 8, 2022Updated 3 years ago
- ☆16Apr 4, 2022Updated 3 years ago
- A handy dataset of noises for ASR☆22May 29, 2019Updated 6 years ago
- ☆32Jul 27, 2022Updated 3 years ago
- 📖 LanMIT: A Toolkit for Improving Language Models in Low-resourced Speech Recognition based on Kaldi.☆22Jul 12, 2019Updated 6 years ago
- ☆55Jan 13, 2023Updated 3 years ago
- ☆10Apr 8, 2024Updated last year
- ☆11Nov 5, 2021Updated 4 years ago
- Multistream CNN for Robust Acoustic Modeling☆40Jun 17, 2021Updated 4 years ago
- Filter Bank Implementaion as Convolutional Neural Network using Python Keras☆17Dec 18, 2024Updated last year
- Memory efficient transducer loss computation☆69Jun 10, 2022Updated 3 years ago
- ☆17Apr 14, 2023Updated 2 years ago
- Moved to https://github.com/k2-fsa/icefall☆146Oct 13, 2022Updated 3 years ago
- A repository for benchmarking neural vocoders by their quality and speed.☆212May 30, 2025Updated 8 months ago
- Properly handle position-dependent phones in a subword lexicon FST☆31Oct 26, 2020Updated 5 years ago
- The People’s Speech Dataset☆113Jan 11, 2024Updated 2 years ago
- An efficient OpenFST-based tool for calculating WER and aligning two transcript sequences.☆169Jan 7, 2026Updated last month
- Avocodo: Generative Adversarial Network for Artifact-free Vocoder☆122Jul 14, 2022Updated 3 years ago
- Denoising autoencoders for speaker identification on MCE 2018 challenge☆12Nov 8, 2018Updated 7 years ago
- Large, modern dataset for speech recognition☆719Feb 26, 2024Updated last year
- ☆37Nov 22, 2025Updated 2 months ago
- BurrMill core☆22Nov 2, 2021Updated 4 years ago
- Speech Parameter Estimation Using Differentiable Speech Synthesizer☆44May 9, 2023Updated 2 years ago
- Segment an audio file and obtain utterance alignments. (Python package)☆345May 15, 2024Updated last year
- Code for the Paper Speech Recognition and Multi-Speaker Diarization of Long Conversations☆38Jun 12, 2023Updated 2 years ago
- ☆197May 3, 2024Updated last year
- multilingual speech aligner☆76Nov 19, 2023Updated 2 years ago
- ☆14Jun 12, 2015Updated 10 years ago
- A library for speech data augmentation in time-domain☆682Aug 30, 2021Updated 4 years ago
- PPSpeech: Phrase based Parallel End-to-End TTS System☆35Aug 31, 2020Updated 5 years ago
- This repository describes our reproducible framework for assessing self-supervised representation learning from speech☆51Oct 8, 2021Updated 4 years ago
- Support tools for punctuation and boundary detection for ASR output.☆55Dec 8, 2022Updated 3 years ago
- Grapheme to phoneme conversion with deep learning.☆419Dec 8, 2023Updated 2 years ago
- ☆12Jun 10, 2021Updated 4 years ago
- Multilingual G2P in 100 languages☆374May 26, 2023Updated 2 years ago
- Segment a given audio into utterances using a trained end-to-end ASR model.☆74Oct 9, 2020Updated 5 years ago