anyvoiceai / Barkify
Barkify: an unoffical training implementation of Bark TTS by suno-ai
☆128Updated last year
Alternatives and similar repositories for Barkify:
Users that are interested in Barkify are comparing it to the libraries listed below
- Train the next generation of TTS systems.☆163Updated 6 months ago
- Unofficial implementation of NaturalSpeech2 for Voice Conversion and Text to Speech☆234Updated last year
- Style-Controllable Zero-Shot Text to Speech Synthesizer based on VALL-E☆136Updated 4 months ago
- An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io☆68Updated last year
- Application of MB-iSTFT-VITS components to vits2_pytorch☆123Updated 3 months ago
- FACodec: Speech Codec with Attribute Factorization used for NaturalSpeech 3☆192Updated 10 months ago
- Easy-to-Use Speech MOS predictors☆270Updated last year
- Official Implementation of StyleTTS-VC☆177Updated last month
- Diffusion Singing Voice Conversion based on Grad-TTS from HuaWei☆141Updated last year
- SyntaSpeech: Syntax-aware Generative Adversarial Text-to-Speech; IJCAI 2022; Official code☆199Updated 2 years ago
- Monotonic Alignment Search☆89Updated 2 years ago
- ZMM-TTS: Zero-shot Multilingual and Multispeaker Speech Synthesis Conditioned on Self-supervised Discrete Speech Representations☆147Updated last year
- Joint CTC-S2S Phoneme-level ASR for Voice Conversion and TTS (Text-Mel Alignment)☆120Updated 2 years ago
- Singing Voice Synthesis based on VITS, different from VISinger☆188Updated last year
- Implementation of SoundStorm built upon SpeechTokenizer.☆108Updated last year
- ☆253Updated last year
- NANSY++: Unified Voice Synthesis with Neural Analysis and Synthesis☆147Updated 2 years ago
- An unofficial PyTorch implementation of VALL-E☆88Updated this week
- Deep Neural Pitch Extractor for Voice Conversion and TTS Training☆122Updated 2 years ago
- Repository for the paper: VoiceMe: Personalized voice generation in TTS☆126Updated 2 years ago
- ☆69Updated last year
- It's a repository for implementations of neural speech editing algorithms.☆194Updated last year
- HiFTNet: A Fast High-Quality Neural Vocoder with Harmonic-plus-Noise Filter and Inverse Short Time Fourier Transform☆154Updated last month
- Phoneme-Level BERT for Enhanced Prosody of Text-to-Speech with Grapheme Predictions☆241Updated last month
- Scripts for computing the Intelligibility and CLVP scores for evaluating TTS models☆148Updated last year
- Unofficial implementation of NVIDIA P-Flow TTS paper☆220Updated 2 months ago
- CoMoSVC: One-Step Consistency Model Based Singing Voice Conversion & Singing Voice Clone☆136Updated 11 months ago
- ☆71Updated last year
- ☆140Updated last year
- This is the implementation for "ControlVC: Zero-Shot Voice Conversion with Time-Varying Controls on Pitch and Rhythm"☆131Updated last year