talonvoice / wav2trainView external linksLinks
automatically align transcribed audio and generate a wav2letter training corpus
☆36Apr 11, 2023Updated 2 years ago
Alternatives and similar repositories for wav2train
Users that are interested in wav2train are comparing it to the libraries listed below
Sorting:
- Facebook AI Research Automatic Speech Recognition Toolkit☆23Mar 13, 2021Updated 4 years ago
- DeepSpeech based forced alignment tool☆239Dec 12, 2020Updated 5 years ago
- A fully convolution-network for speech-to-text, built on pytorch.☆126May 20, 2020Updated 5 years ago
- Professor forcing future code☆10Sep 22, 2018Updated 7 years ago
- The History of Speech Recognition to the Year 2030☆13Aug 14, 2021Updated 4 years ago
- A set of impulse response files used for convolution-based encoders and decoders☆15Feb 26, 2022Updated 3 years ago
- OHTI Open Head Tracking Initiative☆15Mar 1, 2023Updated 2 years ago
- Toward Scalable Neural Dialogue State Tracking Model☆20Sep 23, 2022Updated 3 years ago
- speech engine training projects☆29Apr 19, 2021Updated 4 years ago
- Deep learning for directional sound source separation from Ambisonics mixtures.☆28Oct 1, 2022Updated 3 years ago
- A Neural Audio Codec (NAC) for Universal Audio☆44May 30, 2025Updated 8 months ago
- WavEncoder is a Python library for encoding audio signals, transforms for audio augmentation, and training audio classification models wi…☆92Jun 6, 2021Updated 4 years ago
- A webpage and API for using Mozilla DeepSpeech☆48Feb 24, 2021Updated 4 years ago
- ☆57Apr 18, 2023Updated 2 years ago
- Quartznet implementation on pytorch [https://arxiv.org/abs/1910.10261]☆26Jul 16, 2021Updated 4 years ago
- Binaural EBU ADM Renderer☆26Jan 24, 2025Updated last year
- CUDA-Warp RNN-Transducer☆216Feb 22, 2023Updated 2 years ago
- ☆37Sep 21, 2025Updated 4 months ago
- Da - ECHO - RetrievAl - daTasEt☆34Jul 7, 2024Updated last year
- Blitzing Fast CTC Beam Search Decoder☆185Oct 27, 2025Updated 3 months ago
- PoC port of JUCE to the browser via emscripten☆35Dec 8, 2014Updated 11 years ago
- An auralisation system that takes a head-worn microphone array recordings as input and renders the audio for binaural playback; taking in…☆35Oct 10, 2023Updated 2 years ago
- A Fast Sequence Transducer Implementation with PyTorch Bindings☆199Sep 20, 2022Updated 3 years ago
- This code is to run the WARP-Q speech quality metric.☆35Oct 15, 2024Updated last year
- Creates CMM script that can directly executed on Kaggle from easy merge script☆14Jan 12, 2026Updated last month
- A Swift library that makes it easier to create AVAudioEngine-based audio players☆11Oct 14, 2023Updated 2 years ago
- E2E-SincNet: Toward fully end-to-end speech recognition☆30Feb 1, 2020Updated 6 years ago
- ☆10Apr 16, 2020Updated 5 years ago
- ☆37Mar 26, 2024Updated last year
- Open source audio annotation tool for humans☆1,130Feb 3, 2026Updated last week
- A bundle of JSFX and scripts for REAPER DAW.☆45Jun 7, 2023Updated 2 years ago
- Automatically constructing corpus for automatic speech recognition from YouTube videos☆156Feb 15, 2020Updated 6 years ago
- A Python toolbox for speech features extraction☆165Feb 8, 2023Updated 3 years ago
- A MaxMSP wrapper for Google Resonance (free, open-source spatial audio)☆11Oct 14, 2020Updated 5 years ago
- This repository defines a python class that can be used to load data for the tf.keras.model.fit_generator function by using a torch.utils…☆11Oct 26, 2024Updated last year
- ☆34Updated this week
- Phase Vocoder and Wavelet Transform Implementation for Pitch Shifting a sound signal☆11Jul 27, 2020Updated 5 years ago
- Using large language models to maintain AI_CHANGELOG.md☆14Jul 15, 2024Updated last year
- Determines whether the current OS X computer's firmware is up-to-date.☆10Feb 24, 2015Updated 10 years ago