automatically align transcribed audio and generate a wav2letter training corpus
☆36Apr 11, 2023Updated 2 years ago
Alternatives and similar repositories for wav2train
Users that are interested in wav2train are comparing it to the libraries listed below
Sorting:
- Facebook AI Research Automatic Speech Recognition Toolkit☆23Mar 13, 2021Updated 4 years ago
- DeepSpeech based forced alignment tool☆239Dec 12, 2020Updated 5 years ago
- A fully convolution-network for speech-to-text, built on pytorch.☆126May 20, 2020Updated 5 years ago
- Professor forcing future code☆10Sep 22, 2018Updated 7 years ago
- The History of Speech Recognition to the Year 2030☆13Aug 14, 2021Updated 4 years ago
- Common Voice Generator using Speech Synthesizer☆13Jul 28, 2021Updated 4 years ago
- PyTorch implementation of Data-Efficient Image Recognition with Contrastive Predictive Coding☆13Feb 26, 2020Updated 6 years ago
- Parallel Universal Dependencies.☆15Nov 12, 2025Updated 3 months ago
- OHTI Open Head Tracking Initiative☆15Mar 1, 2023Updated 3 years ago
- ☆17Apr 14, 2023Updated 2 years ago
- Java library to tokenize Thai text into a list of TCCs☆19May 30, 2017Updated 8 years ago
- speech engine training projects☆29Apr 19, 2021Updated 4 years ago
- Deep learning for directional sound source separation from Ambisonics mixtures.☆28Oct 1, 2022Updated 3 years ago
- A Neural Audio Codec (NAC) for Universal Audio☆44May 30, 2025Updated 9 months ago
- WavEncoder is a Python library for encoding audio signals, transforms for audio augmentation, and training audio classification models wi…☆92Jun 6, 2021Updated 4 years ago
- Binaural EBU ADM Renderer☆26Jan 24, 2025Updated last year
- Quartznet implementation on pytorch [https://arxiv.org/abs/1910.10261]☆26Jul 16, 2021Updated 4 years ago
- A TTS Trained on Universal Audio.☆41Jun 6, 2025Updated 9 months ago
- CUDA-Warp RNN-Transducer☆216Feb 22, 2023Updated 3 years ago
- ☆37Sep 21, 2025Updated 5 months ago
- Da - ECHO - RetrievAl - daTasEt☆34Jul 7, 2024Updated last year
- SuperCollider class for GUI-assisted authoring of dynamic ambisonic sound fields.☆27Updated this week
- Blitzing Fast CTC Beam Search Decoder☆186Oct 27, 2025Updated 4 months ago
- PoC port of JUCE to the browser via emscripten☆35Dec 8, 2014Updated 11 years ago
- Add motion-based magic to your React Native apps! ThinkSys Mediapipe Plugin offers real-time pose detection for iOS, with easy integratio…☆32Jan 19, 2026Updated last month
- An auralisation system that takes a head-worn microphone array recordings as input and renders the audio for binaural playback; taking in…☆35Oct 10, 2023Updated 2 years ago
- Based on Neural Amp Modeler 0.7.1 with some enhanced features☆12Apr 18, 2023Updated 2 years ago
- Creates CMM script that can directly executed on Kaggle from easy merge script☆14Jan 12, 2026Updated last month
- This code is to run the WARP-Q speech quality metric.☆34Oct 15, 2024Updated last year
- A Swift library that makes it easier to create AVAudioEngine-based audio players☆11Oct 14, 2023Updated 2 years ago
- ☆10Apr 16, 2020Updated 5 years ago
- ☆37Mar 26, 2024Updated last year
- Open source audio annotation tool for humans☆1,131Feb 3, 2026Updated last month
- A bundle of JSFX and scripts for REAPER DAW.☆45Jun 7, 2023Updated 2 years ago
- Automatically constructing corpus for automatic speech recognition from YouTube videos☆157Feb 15, 2020Updated 6 years ago
- A Python toolbox for speech features extraction☆165Feb 8, 2023Updated 3 years ago
- Using large language models to maintain AI_CHANGELOG.md☆14Jul 15, 2024Updated last year
- A MaxMSP wrapper for Google Resonance (free, open-source spatial audio)☆11Oct 14, 2020Updated 5 years ago
- Tool for Evaluating Multilingual WS-353 and SimLex-999☆10Dec 15, 2016Updated 9 years ago