Appen/UHV-OTS-Speech

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Appen/UHV-OTS-Speech)

Appen / UHV-OTS-Speech

A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.

☆106

Alternatives and similar repositories for UHV-OTS-Speech

Users that are interested in UHV-OTS-Speech are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

csukuangfj / kaldilm
View on GitHub
Python wrapper for kaldi's arpa2fst
☆38Aug 27, 2025Updated 9 months ago
danpovey / quantization
View on GitHub
Torch-based tool for quantizing high-dimensional vectors using additive codebooks
☆54May 25, 2022Updated 4 years ago
desh2608 / kaldi-noise-vectors
View on GitHub
Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.
☆13Feb 13, 2021Updated 5 years ago
charlesliucn / LanMIT
View on GitHub
📖 LanMIT: A Toolkit for Improving Language Models in Low-resourced Speech Recognition based on Kaldi.
☆22Jul 12, 2019Updated 6 years ago
patrickvonplaten / Wav2Vec2_ParlanceCTCDecode
View on GitHub
☆11Nov 5, 2021Updated 4 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
TUIlmenauAMS / FilterBanks_PythonKerasNeuralNetworkImplemention
View on GitHub
Filter Bank Implementaion as Convolutional Neural Network using Python Keras
☆17Dec 18, 2024Updated last year
VITA-Group / Audio-Lottery
View on GitHub
[ICLR 2022] "Audio Lottery: Speech Recognition Made Ultra-Lightweight, Noise-Robust, and Transferable", by Shaojin Ding, Tianlong Chen, Z…
☆32Apr 8, 2022Updated 4 years ago
facebookresearch / vocoder-benchmark
View on GitHub
A repository for benchmarking neural vocoders by their quality and speed.
☆212May 30, 2025Updated last year
MiniXC / LightningFastSpeech2
View on GitHub
☆55Jan 13, 2023Updated 3 years ago
k2-fsa / snowfall
View on GitHub
Moved to https://github.com/k2-fsa/icefall
☆146Oct 13, 2022Updated 3 years ago
SpeechColab / GigaSpeech
View on GitHub
Large, modern dataset for speech recognition
☆725Feb 26, 2024Updated 2 years ago
speechio / asr-noises
View on GitHub
A handy dataset of noises for ASR
☆22May 29, 2019Updated 7 years ago
revdotcom / fstalign
View on GitHub
An efficient OpenFST-based tool for calculating WER and aligning two transcript sequences.
☆170May 12, 2026Updated 2 weeks ago
asappresearch / multistream-cnn
View on GitHub
Multistream CNN for Robust Acoustic Modeling
☆40Jun 17, 2021Updated 4 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
csukuangfj / optimized_transducer
View on GitHub
Memory efficient transducer loss computation
☆70Jun 10, 2022Updated 3 years ago
mlcommons / peoples-speech
View on GitHub
The People’s Speech Dataset
☆113Jan 11, 2024Updated 2 years ago
Aria-K-Alethia / speaking-rate-controllable-hifi-gan
View on GitHub
☆16Apr 4, 2022Updated 4 years ago
k2-fsa / kaldi-decoder
View on GitHub
Decoders from Kaldi using OpenFst
☆36Apr 10, 2026Updated last month
xinjli / alqalign
View on GitHub
multilingual speech aligner
☆77Nov 19, 2023Updated 2 years ago
alumae / torch-xvectors-wav
View on GitHub
☆22Jun 30, 2021Updated 4 years ago
alumae / streaming-punctuator
View on GitHub
☆17Apr 14, 2023Updated 3 years ago
rishikksh20 / Avocodo-pytorch
View on GitHub
Avocodo: Generative Adversarial Network for Artifact-free Vocoder
☆122Jul 14, 2022Updated 3 years ago
facebookresearch / WavAugment
View on GitHub
A library for speech data augmentation in time-domain
☆687Aug 30, 2021Updated 4 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
BiometricVox / DAE_SpeakerID
View on GitHub
Denoising autoencoders for speaker identification on MCE 2018 challenge
☆12Nov 8, 2018Updated 7 years ago
rishikksh20 / PPSpeech
View on GitHub
PPSpeech: Phrase based Parallel End-to-End TTS System
☆35Aug 31, 2020Updated 5 years ago
cadia-lvl / punctuation-prediction
View on GitHub
Support tools for punctuation and boundary detection for ASR output.
☆55Dec 8, 2022Updated 3 years ago
RuABraun / texterrors
View on GitHub
☆37Apr 16, 2026Updated last month
TehreemFarooqi / Preparing-a-speech-recognition-dataset-using-YouTube-videos
View on GitHub
Using YouTube to prepare a speech recognition dataset for any language
☆10Mar 30, 2021Updated 5 years ago
tencent-ailab / pika
View on GitHub
a lightweight speech processing toolkit based on Pytorch and (Py)Kaldi
☆354Dec 25, 2020Updated 5 years ago
awslabs / speech-representations
View on GitHub
Code for DeCoAR (ICASSP 2020) and BERTphone (Odyssey 2020)
☆104Nov 26, 2022Updated 3 years ago
lumaku / ctc-segmentation
View on GitHub
Segment an audio file and obtain utterance alignments. (Python package)
☆347May 15, 2024Updated 2 years ago
yoyolicoris / variational-diffwave
View on GitHub
☆32Jul 27, 2022Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
csukuangfj / kaldifeat
View on GitHub
Kaldi-compatible online & offline feature extraction with PyTorch, supporting CUDA, batch processing, chunk processing, and autograd - P…
☆213Aug 7, 2025Updated 9 months ago
lingjzhu / CharsiuG2P
View on GitHub
Multilingual G2P in 100 languages
☆384May 26, 2023Updated 3 years ago
farisalasmary / wav2vec2-kenlm
View on GitHub
Incorporating KenLM language model with HuggingFace implementation of Wav2Vec2CTC Model using beam search decoding
☆75Oct 11, 2021Updated 4 years ago
axelspringer / DeepPhonemizer
View on GitHub
Grapheme to phoneme conversion with deep learning.
☆425Dec 8, 2023Updated 2 years ago
aalto-speech / subword-kaldi
View on GitHub
Properly handle position-dependent phones in a subword lexicon FST
☆31Oct 26, 2020Updated 5 years ago
YatingMusic / ddsp-singing-vocoders
View on GitHub
Official implementation of SawSing (ISMIR'22)
☆275Aug 28, 2022Updated 3 years ago
danpovey / pocolm
View on GitHub
Small language toolkit for creation, interpolation and pruning of ARPA language models
☆92Aug 6, 2022Updated 3 years ago