CorentinJ/librispeech-alignments

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/CorentinJ/librispeech-alignments)

CorentinJ / librispeech-alignments

Word alignments generated by the Montreal Forced Aligner for the Librispeech dataset

☆182

Alternatives and similar repositories for librispeech-alignments

Users that are interested in librispeech-alignments are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

r9y9 / kiritan_singing
View on GitHub
Labels for kiritan_singing data with extra resources for DNN-based singing voice synthesis (SVS) systems.
☆28Dec 31, 2023Updated 2 years ago
espnet / icassp2020-tts
View on GitHub
ESPnet-TTS Audio Sample HP
☆21Oct 25, 2019Updated 6 years ago
Yangyangii / TPGST-Tacotron
View on GitHub
Google's TPGST reimplementation.
☆34Dec 11, 2019Updated 6 years ago
0nutation / DUB
View on GitHub
Code and pretrained models for "DUB: Discrete Unit Back-translation for Speech Translation" (ACL 2023 Findings)
☆28Jun 28, 2023Updated 3 years ago
MontrealCorpusTools / Montreal-Forced-Aligner
View on GitHub
Command line utility for forced alignment using Kaldi
☆1,847Jul 11, 2026Updated last week
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
kan-bayashi / LibriTTSLabel
View on GitHub
Alignment files of LibriTTS.
☆70Mar 16, 2020Updated 6 years ago
nii-yamagishilab / TSNetVocoder
View on GitHub
☆42Oct 30, 2018Updated 7 years ago
ex3ndr / supervoice-librilight-preprocessed
View on GitHub
60k hours of phoneme-aligned audio from audio books
☆19Jul 27, 2024Updated last year
xcmyz / Transformer-TTS
View on GitHub
TTS model based on Transformer.
☆57Aug 2, 2019Updated 6 years ago
kokeshing / WaveNet-Estimator
View on GitHub
WaveNet implementation using tf.estimator
☆21Jul 6, 2023Updated 3 years ago
nii-yamagishilab / tacotron2
View on GitHub
An implementation of Tacotron and Tacotron2
☆80Aug 4, 2021Updated 4 years ago
Kyubyong / g2p
View on GitHub
g2p: English Grapheme To Phoneme Conversion
☆927Jan 5, 2023Updated 3 years ago
MarkWuNLP / SemanticMask
View on GitHub
The repo contains our code of ``Semantic Mask for Transformer based End-to-End Speech Recognition"
☆39Jun 9, 2020Updated 6 years ago
geneing / WaveRNN-Pytorch
View on GitHub
Fatcord's Alternative WaveRNN (Faster training)
☆132Nov 29, 2020Updated 5 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
jxzhanggg / nonparaSeq2seqVC_code
View on GitHub
Implementation code of non-parallel sequence-to-sequence VC
☆248Mar 24, 2023Updated 3 years ago
bootphon / phonemizer
View on GitHub
Simple text to phones converter for multiple languages
☆1,557Sep 26, 2024Updated last year
kan-bayashi / ParallelWaveGAN
View on GitHub
Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch
☆1,646Apr 22, 2024Updated 2 years ago
r9y9 / icassp2020-espnet-tts-merlin-baseline
View on GitHub
ICASSP 2020 ESPnet-TTS: Merlin baseline system
☆37Oct 28, 2019Updated 6 years ago
kahne / SpeechTransProgress
View on GitHub
Tracking the progress in end-to-end speech translation
☆260Oct 25, 2023Updated 2 years ago
NaoyukiKanda / LibriSpeechMix
View on GitHub
☆38Mar 30, 2021Updated 5 years ago
yanggeng1995 / FB-MelGAN
View on GitHub
A pytroch implementation of the FB-MelGAN
☆90May 26, 2020Updated 6 years ago
shashankshirol / GeneratingNoisySpeechData
View on GitHub
A repository comprising of code for generation of noisy speech data from clean data using deep learning methods
☆16Jul 12, 2021Updated 5 years ago
facebookresearch / WavAugment
View on GitHub
A library for speech data augmentation in time-domain
☆689Aug 30, 2021Updated 4 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
ZackHodari / average_prosody
View on GitHub
Code for paper titled "Using generative modelling to produce varied intonation for speech synthesis" submitted to the Speech Synthesis Wo…
☆24Dec 8, 2019Updated 6 years ago
ljuvela / multiscale-GAN
View on GitHub
Code for ICASSP 2019 paper
☆18Oct 29, 2018Updated 7 years ago
iamyuanchung / Autoregressive-Predictive-Coding
View on GitHub
Autoregressive Predictive Coding: An unsupervised autoregressive model for speech representation learning
☆191Jan 29, 2020Updated 6 years ago
ttaoREtw / semi-tts
View on GitHub
Semi-supervised Learning for Multi-speaker Text-to-speech Synthesis Using Discrete Speech Representation
☆39Jul 16, 2020Updated 6 years ago
wenet-e2e / speech-synthesis-paper
View on GitHub
List of speech synthesis papers.
☆1,074Jul 24, 2023Updated 2 years ago
cyhuang-tw / robust-vc
View on GitHub
☆11May 7, 2022Updated 4 years ago
s3prl / s3prl
View on GitHub
Self-Supervised Speech Pre-training and Representation Learning Toolkit
☆2,557Mar 12, 2026Updated 4 months ago
Helsinki-NLP / prosody
View on GitHub
Helsinki Prosody Corpus and A System for Predicting Prosodic Prominence from Text
☆249Oct 30, 2019Updated 6 years ago
bshall / UniversalVocoding
View on GitHub
A PyTorch implementation of "Robust Universal Neural Vocoding"
☆238Nov 14, 2020Updated 5 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
NVIDIA / mellotron
View on GitHub
Mellotron: a multispeaker voice synthesis model based on Tacotron 2 GST that can make a voice emote and sing without emotive or singing t…
☆870Jul 22, 2023Updated 2 years ago
ZackHodari / tts_data_tools
View on GitHub
Data processing tools for preparing speech and labels for training TTS voices
☆29Aug 13, 2020Updated 5 years ago
bshall / VectorQuantizedCPC
View on GitHub
Vector-Quantized Contrastive Predictive Coding for Acoustic Unit Discovery and Voice Conversion
☆142Sep 1, 2020Updated 5 years ago
X-LANCE / public_talks
View on GitHub
Materials of public talks given By SJTU X-LANCE members
☆14Dec 3, 2022Updated 3 years ago
JeremyCCHsu / Python-Wrapper-for-World-Vocoder
View on GitHub
A Python wrapper for the high-quality vocoder "World"
☆789Jan 21, 2025Updated last year
X-LANCE / StoryTTS
View on GitHub
[ICASSP 2024] StoryTTS: A Highly Expressive Text-to-Speech Dataset with Rich Textual Expressiveness Annotations
☆141Apr 27, 2024Updated 2 years ago
candlewill / CNTN
View on GitHub
ChiNese Text Normalization (CNTN) tool for Text-to-speech system
☆37Apr 12, 2018Updated 8 years ago