xiaozhah / Aligner

Aligner for text-to-speech

☆14

Alternatives and similar repositories for Aligner:

Users that are interested in Aligner are comparing it to the libraries listed below

p1an-lin-jung / wv_tts
☆19Updated 11 months ago
ex3ndr / supervoice-hybrid
My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one
☆27Updated 6 months ago
ex3ndr / supervoice-vocoder
Production-ready vocoder using BigVSAN
☆11Updated last year
reppy4620 / convnext_tts
Unofficial implementation of ConvNeXt-TTS powered by lightning
☆15Updated 4 months ago
Tikai7 / DiTTO-TTS
DiTTo-TTS: Diffusion Transformers for Scalable Text-to-Speech without Domain-Specific Factors
☆18Updated 3 weeks ago
pengzhendong / audio-pipeline
☆20Updated 4 months ago
Infinity-INF / fast-phasr
Phonemes and durations labeling based on whisper small
☆11Updated 7 months ago
huutuongtu / Lightvoc
LIGHTVOC AN UPSAMPLING-FREE GAN VOCODER BASED ON CONFORMER AND INVERSE SHORT-TIME FOURIER TRANSFORM
☆16Updated 9 months ago
MiscellaneousStuff / PhoneLM
(R&D) Text to speech using phonemes as inputs and audio codec codes as outputs. Loosely based on MegaByte, VALL-E and Encodec.
☆47Updated last year
mushanshanshan / ESLTTS
ESLTTS dataset
☆16Updated 3 weeks ago
ZehuaKcrissLi / GTR-Voice
☆12Updated 3 months ago
shivammehta25 / BetterFastSpeech2
Just another FastSpeech 2 but cleaner code :)
☆26Updated 8 months ago
liuhuang31 / g2pw_once
G2pw's inference speed is accelerated by about 8-10 times. Change loop generated predictive data to only once and model loop prediction b…
☆14Updated last year
CODEJIN / XiaoiceSing2
☆19Updated 2 years ago
ex3ndr / supervoice-librilight-preprocessed
60k hours of phoneme-aligned audio from audio books
☆18Updated 7 months ago
shengcanxu / canoSpeech
text to speech
☆10Updated 11 months ago
ShovalMessica / NAST
Official repository for NAST: Noise Aware Speech Tokenization for Speech Language Models (Interspeech 2024) https://arxiv.org/abs/2406.11…
☆44Updated 8 months ago
jisang93 / VISinger
Unofficial pytorch implementation of VISinger: Variational Inference with Adversarial Learning for End-to-end Singing Voice Synthesis (IC…
☆15Updated last year
reppy4620 / vocoders
My vocoder experiments
☆26Updated 4 months ago
archinetai / aligner-pytorch
Sequence alignement methods with helpers for PyTorch.
☆24Updated 2 years ago
MaxMax2016 / Glow-SVC
4G GPU & 10 Minutes for train
☆12Updated last year
pengzhendong / streaming-vocos
Streaming Vocos
☆21Updated last month
ex3ndr / supervoice-gpt-facodec
GPT for FACodec
☆13Updated 11 months ago
asuni / PitchSqueezer
A robust pitch tracker using synchro-squeezed fft and frequency domain autocorrelation
☆34Updated last year
thuhcsi / PortableTTS
☆12Updated last year
Ereboas / TacoLM
☆18Updated 10 months ago
choiHkk / Transformer-TTS-V2
☆26Updated 11 months ago
miccio-dk / NISQA
NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment
☆16Updated 2 years ago
anton-kashkin / hifi_vc
☆25Updated 2 years ago
iamanigeeit / present
☆12Updated 6 months ago