Lwasinam / voiceraLinks
☆18Updated 11 months ago
Alternatives and similar repositories for voicera
Users that are interested in voicera are comparing it to the libraries listed below
Sorting:
- Official implementation of the TTS model Lina-Speech☆168Updated 7 months ago
- ☆135Updated 2 weeks ago
- VoiceBox neural network implementation☆110Updated last year
- Collection of Open Source Speech Data☆160Updated 10 months ago
- VALL-E 2 reproduction☆129Updated last year
- ☆276Updated last month
- create dataset from list of youtube links easily☆21Updated 2 years ago
- An unofficial PyTorch implementation of VALL-E☆88Updated last month
- 🎙️ Automatically transcribe audio/video into high-quality, speaker-specific Text-To-Speech datasets ✨☆121Updated 3 weeks ago
- A TTS model capable of generating ultra-realistic dialogue in one pass.☆120Updated last month
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆103Updated 10 months ago
- finetune llm part for spark-tts model☆107Updated 5 months ago
- 🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning☆159Updated last year
- This repo is an exploratory experiment to enable frozen pretrained RWKV language models to accept speech modality input. We followed the …☆52Updated 8 months ago
- SlamKit is an open source tool kit for efficient training of SpeechLMs. It was used for "Slamming: Training a Speech Language Model on On…☆217Updated 3 months ago
- ☆273Updated last year
- Official code for "F5R-TTS: Improving Flow-Matching based Text-to-Speech with Group Relative Policy Optimization"☆115Updated 3 months ago
- A TTS model capable of generating ultra-realistic dialogue in one pass.☆202Updated 4 months ago
- This project is to train an RWKV LLM for TTS generation which compatible to other TTS engine(like fish/cosy/chattts).☆83Updated this week
- ☆262Updated last year
- StyleTTS-ZS: Efficient High-Quality Zero-Shot Text-to-Speech Synthesis with Distilled Time-Varying Style Diffusion☆185Updated 11 months ago
- This is an implementation for train hifigan part of XTTSv2 model using Coqui/TTS.☆85Updated 9 months ago
- Open TTS models, built for streaming on the edge☆42Updated 5 months ago
- This repository contains the code and data for the paper EmoKnob: Enhance Voice Cloning with Fine-Grained Emotion Control by Haozhe Chen,…☆78Updated 11 months ago
- ☆22Updated last week
- High quality text-to-speech based on StyleTTS 2.☆60Updated last week
- ☆57Updated last year
- ☆377Updated last year
- Implementation of E2-TTS, "Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS", in Pytorch☆495Updated 5 months ago
- Codec for paper: LLaSA: Scaling Train-time and Inference-time Compute for LLaMA-based Speech Synthesis☆297Updated last month