ex3ndr/supervoice-librilight-preprocessed

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ex3ndr/supervoice-librilight-preprocessed)

ex3ndr / supervoice-librilight-preprocessed

60k hours of phoneme-aligned audio from audio books

☆19

Alternatives and similar repositories for supervoice-librilight-preprocessed

Users that are interested in supervoice-librilight-preprocessed are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ex3ndr / supervoice-gpt
View on GitHub
GPT-style network for phonemization with durations of text
☆68Mar 21, 2024Updated 2 years ago
pengzhendong / audio-pipeline
View on GitHub
☆23Oct 17, 2024Updated last year
Scarfmonster / HiFiPLN
View on GitHub
Multispeaker Community Vocoder Model for DiffSinger
☆39Aug 11, 2025Updated 11 months ago
LEEYOONHYUNG / GraphTTS
View on GitHub
☆12Jul 6, 2023Updated 3 years ago
ex3ndr / supervoice-gpt-facodec
View on GitHub
GPT for FACodec
☆13Mar 25, 2024Updated 2 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
fss1t / CausalStarGANv2-VC
View on GitHub
☆22Apr 4, 2023Updated 3 years ago
archinetai / aligner-pytorch
View on GitHub
Sequence alignement methods with helpers for PyTorch.
☆24Nov 30, 2022Updated 3 years ago
ftshijt / Interspeech2024_DiscreteSpeechChallenge
View on GitHub
This is the official train-dev-test release of the Interspeech2024 Discrete Speech Representation Challenge.
☆32Jan 26, 2024Updated 2 years ago
cantabile-kwok / vec2wav2.0
View on GitHub
Code for vec2wav 2.0, a speech token vocoder for VC. Paper: https://arxiv.org/abs/2409.01995
☆79Dec 3, 2024Updated last year
rishikksh20 / iSTFT-Avocodo-pytorch
View on GitHub
Ultrafast GAN based Vocoder for Text to Speech
☆50Jul 16, 2022Updated 4 years ago
CODEJIN / XiaoiceSing2
View on GitHub
☆19Feb 2, 2023Updated 3 years ago
scutcsq / Neural-Transducers-for-Two-Stage-Text-to-Speech-via-Semantic-Token-Prediction
View on GitHub
Unofficial pytorch reproduction for the paper "Utilizing Neural Transducers for Two-Stage Text-to-Speech via Semantic Token Prediction" (…
☆60Apr 4, 2024Updated 2 years ago
CODEJIN / MLPSinger
View on GitHub
☆24Mar 15, 2022Updated 4 years ago
reppy4620 / convnext_tts
View on GitHub
Unofficial implementation of ConvNeXt-TTS powered by lightning
☆18Oct 20, 2024Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
AI-Unicamp / TTS-Objective-Metrics
View on GitHub
Objective metrics used in several text-to-speech (TTS) papers.
☆54Jun 17, 2025Updated last year
voidful / Codec-SUPERB
View on GitHub
Audio Codec Speech processing Universal PERformance Benchmark
☆308Jul 4, 2026Updated 3 weeks ago
nii-yamagishilab / ZMM-TTS
View on GitHub
ZMM-TTS: Zero-shot Multilingual and Multispeaker Speech Synthesis Conditioned on Self-supervised Discrete Speech Representations
☆185Mar 6, 2024Updated 2 years ago
ml-for-speech / speechtoolkit
View on GitHub
[Early Alpha] A unified framework for text-to-speech, voice conversion, automatic speech recognition, audio classification, voice activit…
☆21Jan 10, 2025Updated last year
ZehuaKcrissLi / GTR-Voice
View on GitHub
☆16Nov 11, 2024Updated last year
multimodal-art-projection / Open-Suno
View on GitHub
trying to reproduce suno v3
☆35Jan 29, 2025Updated last year
walker-hyf / FCTalker
View on GitHub
FCTalker: Fine and Coarse Grained Context Modeling for Expressive Conversational Speech Synthesis (Accepted by ISCSLP'2024)
☆26Feb 22, 2024Updated 2 years ago
choiHkk / Transformer-TTS-V2
View on GitHub
☆25Mar 6, 2024Updated 2 years ago
tonnetonne814 / PL-Bert-VITS2
View on GitHub
VITS2 using Phoneme-Level Japanese BERT
☆14Dec 17, 2023Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
Yazdi9 / TTS-MultiLingual
View on GitHub
Text To Speech Multilingual Support (+20 Language)
☆52May 2, 2023Updated 3 years ago
haiciyang / LaDiffCodec
View on GitHub
ICASSP 2024 - Generative De-Quantization for Neural Speech Codec via Latent Diffusion.
☆56Nov 16, 2025Updated 8 months ago
mt-upc / ZeroSwot
View on GitHub
Pushing the Limits of Zero-shot End-to-End Speech Translation
☆25Dec 12, 2024Updated last year
keonlee9420 / Robust_Fine_Grained_Prosody_Control
View on GitHub
PyTorch Implementation of Robust and fine-grained prosody control of end-to-end speech synthesis
☆41Feb 20, 2022Updated 4 years ago
yuan1615 / AdaVocoder
View on GitHub
Adaptive Vocoder for Custom Voice
☆61Sep 22, 2022Updated 3 years ago
sarulab-speech / multi-speaker-dgp
View on GitHub
Official implementation of DGP-based multi-speaker speech synthesis with PyTorch
☆24Mar 23, 2021Updated 5 years ago
Yuan-ManX / audio-ai-agent
View on GitHub
Here we will track the latest Audio AI Agent, including speech, music, sound effects, etc.
☆16Dec 8, 2023Updated 2 years ago
MiniXC / phones
View on GitHub
A collection of utilities for handling IPA phones.
☆27Sep 24, 2023Updated 2 years ago
xcmyz / ConvTasNet4BasisMelGAN
View on GitHub
This repo contains conv-tasnet for basis-melgan. If you want to get code of basis-melgan, please refer to FastVocoder.
☆21Jul 21, 2021Updated 5 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
Liu-Feng-deeplearning / TTS-frontend
View on GitHub
TTS-frontend with Bert and CRF/lstm (For Tacotron)
☆53Jun 2, 2020Updated 6 years ago
yangdongchao / SimpleSpeech
View on GitHub
The open source code for SimpleSpeech series
☆147Oct 8, 2024Updated last year
AudiogenAI / agc
View on GitHub
Audiogen Codec
☆146Jul 9, 2024Updated 2 years ago
zjlww / ardit-web
View on GitHub
☆27Aug 2, 2024Updated last year
0nutation / USLM
View on GitHub
Unified Speech Language Model for paper "SpeechTokenizer: Unified Speech Tokenizer for Speech Large Language Models"(ICLR 2024)
☆152Sep 14, 2023Updated 2 years ago
Jackson-Kang / VQVC-Pytorch
View on GitHub
An unofficial implementation of Vector Quantization Voice Conversion (VQVC).
☆29Apr 12, 2021Updated 5 years ago
0417keito / PromptTTS2
View on GitHub
[WIP] Unofficial Implementation of Microsoft's PromptTTS2
☆53Oct 31, 2023Updated 2 years ago