lpscr / F5-TTS
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
☆26Updated this week
Related projects ⓘ
Alternatives and complementary repositories for F5-TTS
- Text-to-Music Generation with Rectified Flow Transformer☆45Updated 2 months ago
- (Windows/Linux/MacOS) Local WebUI with neural network models (Text, Image, Video, 3D, Audio) on python (Gradio interface). Translated on …☆71Updated this week
- ☆87Updated 6 months ago
- ☆26Updated 10 months ago
- Awesome music generation model——MG²☆104Updated this week
- Uses deepgram/whisper/custom models to create an LJSpeech dataset for voice model fine tuning☆12Updated this week
- 🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning☆135Updated 3 months ago
- Using RVC via console or python scripts☆77Updated 3 weeks ago
- AI powered speech denoising and enhancement. Adapted for windows and optimized☆63Updated 4 months ago
- ☆25Updated 7 months ago
- ☆65Updated 3 weeks ago
- ☆77Updated 4 months ago
- ☆51Updated last month
- Generative models for conditional audio generation☆117Updated 2 months ago
- A Lightweight Gradio Web interface for Text-to-Audio Generation utilising SAO1.0☆44Updated 5 months ago
- 🎵 LyricWave - Your AI Music Composer 🎶 Compose Unique MP4 Songs Effortlessly! LyricWave uses AI to create personalized music by harmoni…☆26Updated 7 months ago
- ☆34Updated 6 months ago
- AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation☆39Updated 7 months ago
- Orchestrating AI for stunning lip-synced videos. Effortless workflow, exceptional results, all in one place.☆64Updated 4 months ago
- StyleTTS-ZS: Efficient High-Quality Zero-Shot Text-to-Speech Synthesis with Distilled Time-Varying Style Diffusion☆158Updated last month
- A collection of compiled wheels for deepspeed built for python 3.10 and 3.11 with support for cuda 11.8 and 12.1 for Windows☆45Updated last month
- Misc. tools/scripts that I made to use for tortoise☆17Updated 2 months ago
- Advanced RVC Inference for quicker and effortless model downloads☆32Updated 8 months ago
- This project includes a Python script for fine-tuning a text-to-speech (TTS) model. The script utilizes custom datasets and use CUDA for …☆13Updated last month
- VALL-E 2 reproduction☆83Updated 3 months ago
- This repository contains the code and data for the paper EmoKnob: Enhance Voice Cloning with Fine-Grained Emotion Control by Haozhe Chen,…☆38Updated last month
- [SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild☆57Updated 8 months ago
- ☆11Updated last year
- VoiceRestore: Flow-Matching Transformers for Universal Speech Restoration☆81Updated last month
- Decked-out gradio client for audio diffusion, mainly stable-audio-tools.☆36Updated 5 months ago