talonvoice/wav2train

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/talonvoice/wav2train)

talonvoice / wav2train

automatically align transcribed audio and generate a wav2letter training corpus

☆36

Alternatives and similar repositories for wav2train

Users that are interested in wav2train are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

peladeaucome / ICASSP-2024-BEAFX-using-DDSP
View on GitHub
Github repository for the paper accepted in ICASSP 2024 : Blind estimation of audio effects using an auto-encoder approach and differenti…
☆15Apr 11, 2024Updated 2 years ago
awni / future_speech
View on GitHub
The History of Speech Recognition to the Year 2030
☆13Aug 14, 2021Updated 4 years ago
cahya-wirawan / artificial-commonvoice
View on GitHub
Common Voice Generator using Speech Synthesizer
☆14Jul 28, 2021Updated 5 years ago
wittawatj / jtcc
View on GitHub
Java library to tokenize Thai text into a list of TCCs
☆21May 30, 2017Updated 9 years ago
silversparro / wav2letter.pytorch
View on GitHub
A fully convolution-network for speech-to-text, built on pytorch.
☆126May 20, 2020Updated 6 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
alumae / streaming-punctuator
View on GitHub
☆17Apr 14, 2023Updated 3 years ago
talonvoice / speech
View on GitHub
speech engine training projects
☆29Apr 19, 2021Updated 5 years ago
mojesty / professor_forcing
View on GitHub
Professor forcing future code
☆10Sep 22, 2018Updated 7 years ago
elnaaz / GCE-Model
View on GitHub
Toward Scalable Neural Dialogue State Tracking Model
☆20Sep 23, 2022Updated 3 years ago
Speech-Lab-IITM / English_ASR_Challenge
View on GitHub
English ASR Challenge organized by Speech Lab, IIT Madras
☆10Feb 3, 2021Updated 5 years ago
vwrj / CPC
View on GitHub
PyTorch implementation of Data-Efficient Image Recognition with Contrastive Predictive Coding
☆13Feb 26, 2020Updated 6 years ago
ekmett / hybrid-vectors
View on GitHub
Hybrid vectors e.g. mixed boxed/unboxed vectors that are suitable for use with vector-algorithms
☆14Aug 29, 2025Updated 10 months ago
midas-research / speechmix
View on GitHub
☆12Oct 2, 2020Updated 5 years ago
bytecell / slotminer
View on GitHub
Tool for slot extraction from text
☆15Oct 23, 2022Updated 3 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
shangeth / wavencoder
View on GitHub
WavEncoder is a Python library for encoding audio signals, transforms for audio augmentation, and training audio classification models wi…
☆92Jun 6, 2021Updated 5 years ago
kitefishlabs / cbpsc
View on GitHub
Corpus-based Processing for SuperCollider
☆17Jul 5, 2013Updated 13 years ago
0xsamsar / metafarmerx
View on GitHub
Automated Yield Farming Framework
☆10Jul 6, 2022Updated 4 years ago
midas-research / audino
View on GitHub
Open source audio annotation tool for humans
☆1,140Feb 3, 2026Updated 5 months ago
ThinkSys / mediapipe-reactnative
View on GitHub
Add motion-based magic to your React Native apps! ThinkSys Mediapipe Plugin offers real-time pose detection for iOS, with easy integratio…
☆35Apr 10, 2026Updated 3 months ago
EgorLakomkin / KTSpeechCrawler
View on GitHub
Automatically constructing corpus for automatic speech recognition from YouTube videos
☆157Feb 15, 2020Updated 6 years ago
gong-io / gecko
View on GitHub
Gecko - A Tool for Effective Annotation of Human Conversations
☆306Dec 1, 2025Updated 7 months ago
csc2541-f17 / csc2541-f17.github.io
View on GitHub
☆12Dec 7, 2017Updated 8 years ago
106368015AlvinYang / Taiwanese-Food-101
View on GitHub
☆11Aug 3, 2020Updated 5 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
ambrishrawat / advflow
View on GitHub
Adversarial examples on keras and tensorflow
☆12Apr 5, 2017Updated 9 years ago
francesclluis / direction-ambisonics-source-separation
View on GitHub
Deep learning for directional sound source separation from Ambisonics mixtures.
☆31Oct 1, 2022Updated 3 years ago
fkoep / downcast-rs
View on GitHub
Trait for downcasting trait objects back to their original types.
☆19Sep 25, 2023Updated 2 years ago
iceychris / LibreASR
View on GitHub
An On-Premises, Streaming Speech Recognition System
☆679Nov 28, 2021Updated 4 years ago
Kowsher / Bangla-NLP
View on GitHub
☆14Sep 26, 2021Updated 4 years ago
flashlight / sequence
View on GitHub
Sequence algorithms for use in Flashlight.
☆14Jan 12, 2026Updated 6 months ago
marytts / gradle-marytts-voicebuilding-plugin
View on GitHub
A replacement for the legacy VoiceImportTools in MaryTTS
☆16Oct 27, 2024Updated last year
facebookresearch / libri-light
View on GitHub
dataset for lightly supervised training using the librivox audio book recordings. https://librivox.org/.
☆528Jul 11, 2023Updated 3 years ago
hpi-epic / dynamic_pricing__ss2018
View on GitHub
Repository for lecture "Data-Driven Demand Learning and Dynamic Pricing Strategies in Competitive Markets"
☆13May 8, 2018Updated 8 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
auspicious3000 / SpeechSplit-Demo
View on GitHub
Unsupervised Speech Decomposition via Triple Information Bottleneck
☆14Apr 29, 2020Updated 6 years ago
jeremyjordan / model_test
View on GitHub
A proof of concept library for generating and running machine learning model tests
☆13Sep 27, 2020Updated 5 years ago
AccelerateNetworks / DeepSpeech_Frontend
View on GitHub
A webpage and API for using Mozilla DeepSpeech
☆48Feb 24, 2021Updated 5 years ago
hirofumi0810 / neural_sp
View on GitHub
End-to-end ASR/LM implementation with PyTorch
☆594Aug 30, 2021Updated 4 years ago
facebookresearch / WavAugment
View on GitHub
A library for speech data augmentation in time-domain
☆689Aug 30, 2021Updated 4 years ago
ebu / bear
View on GitHub
Binaural EBU ADM Renderer
☆28Jan 24, 2025Updated last year
kanesee / d3-2way-tree
View on GitHub
d3 implementation of a 2-way tree
☆14Dec 19, 2014Updated 11 years ago