ddlBoJack / Awesome-Speech-GenerationLinks

Paper, Code and Statistics for Speech Generatation.

☆10

Alternatives and similar repositories for Awesome-Speech-Generation

Users that are interested in Awesome-Speech-Generation are comparing it to the libraries listed below

Sorting:

audiodemo / voice-conversion
Vocoder-Free Non-Parallel Conversion of Whispered Speech With Masked Cycle-Consistent Generative Adversarial Networks
☆17Updated last year
amphionspace / tts-evaluation
An evaluation set for large-scale trained TTS models (Coming in Sep 2024)
☆12Updated 10 months ago
cyhuang-tw / robust-vc
☆11Updated 3 years ago
ryota-komatsu / speech_resynth
Speech Resynthesis and Language Modeling
☆20Updated last month
ttslr / MonTTS
☆13Updated 3 years ago
lexkoro / cfm-vc
☆11Updated 4 months ago
yoongi43 / VRVQ
Implementation of the paper "Variable Bitrate Residual Vector Quantization for Audio Coding"
☆11Updated 3 months ago
karchkha / MelSpec_GPT_VQVAE
Audio Generation model working with GPT-2 and VQVAE compressed representation of MelSpectrograms
☆18Updated last year
lexkoro / StyleTTS
☆11Updated 2 years ago
PanagiotisP / svs-multiband
Code for the paper "MULTI-BAND MASKING FOR WAVEFORM-BASED SINGING VOICE SEPARATION" that was accepted on EUSIPCO2022
☆15Updated 3 years ago
kjw11 / Speaker-Aware-CTC
Speaker-aware CTC (SACTC) for multi-talker overlapped speech recognition.
☆20Updated last month
mutiann / neural-lexicon-reader
Neural Lexicon Reader: Reduce Pronunciation Errors in End-to-end TTS by Leveraging External Textual Knowledge
☆21Updated 2 years ago
interactiveaudiolab / emphases
Crowdsourced and Automatic Speech Prominence Estimation
☆21Updated last year
csalt-research / accented-codebooks-asr
☆19Updated 10 months ago
p1an-lin-jung / wv_tts
☆19Updated last year
BUTSpeechFIT / TS_SUPERB
☆15Updated 3 months ago
pengzhendong / streaming-vocos
Streaming Vocos
☆28Updated last month
jisang93 / VISinger
Unofficial pytorch implementation of VISinger: Variational Inference with Adversarial Learning for End-to-end Singing Voice Synthesis (IC…
☆15Updated 2 years ago
huutuongtu / Lightvoc
LIGHTVOC AN UPSAMPLING-FREE GAN VOCODER BASED ON CONFORMER AND INVERSE SHORT-TIME FOURIER TRANSFORM
☆18Updated last year
exercise-book-yq / FreeCodec
FREECODEC: A DISENTANGLED NEURAL SPEECH CODEC WITH FEWER TOKENS
☆21Updated 10 months ago
thuhcsi / PortableTTS
☆12Updated 2 years ago
slp-rl / SpokenStoryCloze
A spoken version of the textual story cloze benchmark
☆17Updated last year
zengchang233 / CrossSinger
The source code for the paper CrossSinger (asru2023)
☆18Updated last year
mushanshanshan / ESLTTS
ESLTTS dataset
☆16Updated 5 months ago
Mddct / usm-tokenizer
semantic tokenizer for speech and music
☆21Updated last week
ajaybati / miipher2.0
Reimplementation of Miipher
☆22Updated last year
b-sigpro / sed-hsmm
Onset-and-Offset-Aware Sound Event Detection
☆17Updated 5 months ago
JusperLee / Look2hear
A toolkit for researchers in the multimodal sound separation.
☆16Updated last year
meaningTeam / tidy-tunes
Tidy Tunes is an easy-to-use pipeline for mining high-quality audio data for speech generation models. To do so, it chains multiple open …
☆21Updated last week
lifeiteng / VoiceBox
Voicebox: Text-Guided Multilingual Universal Speech Generation at Scale
☆27Updated last year