talonvoice / wav2trainView external linksLinks
automatically align transcribed audio and generate a wav2letter training corpus
☆36Apr 11, 2023Updated 2 years ago
Alternatives and similar repositories for wav2train
Users that are interested in wav2train are comparing it to the libraries listed below
Sorting:
- Facebook AI Research Automatic Speech Recognition Toolkit☆23Mar 13, 2021Updated 4 years ago
- DeepSpeech based forced alignment tool☆239Dec 12, 2020Updated 5 years ago
- A fully convolution-network for speech-to-text, built on pytorch.☆126May 20, 2020Updated 5 years ago
- Professor forcing future code☆10Sep 22, 2018Updated 7 years ago
- Common Voice Generator using Speech Synthesizer☆13Jul 28, 2021Updated 4 years ago
- A set of impulse response files used for convolution-based encoders and decoders☆15Feb 26, 2022Updated 3 years ago
- PyTorch implementation of Data-Efficient Image Recognition with Contrastive Predictive Coding☆13Feb 26, 2020Updated 5 years ago
- OHTI Open Head Tracking Initiative☆15Mar 1, 2023Updated 2 years ago
- ☆17Apr 14, 2023Updated 2 years ago
- speech engine training projects☆29Apr 19, 2021Updated 4 years ago
- Toward Scalable Neural Dialogue State Tracking Model☆20Sep 23, 2022Updated 3 years ago
- WavEncoder is a Python library for encoding audio signals, transforms for audio augmentation, and training audio classification models wi…☆92Jun 6, 2021Updated 4 years ago
- A webpage and API for using Mozilla DeepSpeech☆48Feb 24, 2021Updated 4 years ago
- ☆87Feb 9, 2022Updated 4 years ago
- ☆57Apr 18, 2023Updated 2 years ago
- A TTS Trained on Universal Audio.☆41Jun 6, 2025Updated 8 months ago
- Quartznet implementation on pytorch [https://arxiv.org/abs/1910.10261]☆26Jul 16, 2021Updated 4 years ago
- Binaural EBU ADM Renderer☆26Jan 24, 2025Updated last year
- CUDA-Warp RNN-Transducer☆216Feb 22, 2023Updated 2 years ago
- ☆37Sep 21, 2025Updated 4 months ago
- Da - ECHO - RetrievAl - daTasEt☆34Jul 7, 2024Updated last year
- SuperCollider class for GUI-assisted authoring of dynamic ambisonic sound fields.☆27Feb 5, 2026Updated last week
- Gecko - A Tool for Effective Annotation of Human Conversations☆301Dec 1, 2025Updated 2 months ago
- Blitzing Fast CTC Beam Search Decoder☆185Oct 27, 2025Updated 3 months ago
- Add motion-based magic to your React Native apps! ThinkSys Mediapipe Plugin offers real-time pose detection for iOS, with easy integratio…☆32Jan 19, 2026Updated 3 weeks ago
- PoC port of JUCE to the browser via emscripten☆35Dec 8, 2014Updated 11 years ago
- An auralisation system that takes a head-worn microphone array recordings as input and renders the audio for binaural playback; taking in…☆35Oct 10, 2023Updated 2 years ago
- A Fast Sequence Transducer Implementation with PyTorch Bindings☆199Sep 20, 2022Updated 3 years ago
- This repo contains a set of neural transducer, e.g. sequence-to-sequence model, focusing on character-level tasks.☆76Sep 13, 2023Updated 2 years ago
- Alternative implementation of the coreference scorer for the CoNLL-2011/2012 shared tasks on coreference resolution☆11Apr 29, 2021Updated 4 years ago
- This code is to run the WARP-Q speech quality metric.☆35Oct 15, 2024Updated last year
- ☆10Apr 16, 2020Updated 5 years ago
- A Swift library that makes it easier to create AVAudioEngine-based audio players☆11Oct 14, 2023Updated 2 years ago
- Based on Neural Amp Modeler 0.7.1 with some enhanced features☆12Apr 18, 2023Updated 2 years ago
- Creates CMM script that can directly executed on Kaggle from easy merge script☆14Jan 12, 2026Updated last month
- ☆37Mar 26, 2024Updated last year
- Open source audio annotation tool for humans☆1,130Feb 3, 2026Updated last week
- Automatically constructing corpus for automatic speech recognition from YouTube videos☆156Feb 15, 2020Updated 6 years ago
- A Python toolbox for speech features extraction☆165Feb 8, 2023Updated 3 years ago