lpscr / F5-TTS
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
β50Updated 4 months ago
Alternatives and similar repositories for F5-TTS:
Users that are interested in F5-TTS are comparing it to the libraries listed below
- β58Updated 6 months ago
- π π€ Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloningβ154Updated 8 months ago
- β95Updated 10 months ago
- StyleTTS-ZS: Efficient High-Quality Zero-Shot Text-to-Speech Synthesis with Distilled Time-Varying Style Diffusionβ173Updated 5 months ago
- Awesome music generation modelββMGΒ²β144Updated last month
- Running the F5-TTS by ONNX Runtimeβ123Updated this week
- β82Updated 8 months ago
- Misc. tools/scripts that I made to use for tortoiseβ21Updated 7 months ago
- This repository contains the code and data for the paper EmoKnob: Enhance Voice Cloning with Fine-Grained Emotion Control by Haozhe Chen,β¦β67Updated 5 months ago
- (Windows/Linux/MacOS) Local WebUI with neural network models (Text, Image, Video, 3D, Audio) on python (Gradio interface). Translated on β¦β87Updated 2 weeks ago
- β207Updated 5 months ago
- AI powered speech denoising and enhancement. Adapted for windows and optimizedβ82Updated 8 months ago
- Generative models for conditional audio generationβ143Updated last month
- VALL-E 2 reproductionβ117Updated 8 months ago
- A very simple implementation of edge_tts w/ RVC for oobabooga text-generation-webui.β41Updated last year
- Audio datasets, easier.β82Updated last year
- StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Modelsβ33Updated 4 months ago
- Turn any common eBook file into an HQ Audiobook with F5-TTS (Easy Install)β21Updated 3 months ago
- Text-to-Music Generation with Rectified Flow Transformerβ60Updated 6 months ago
- Made slight modifications to the Tortoise API, provided 3 additional scripts to make using Tortoise easier. Less focus on cloning makes sβ¦β52Updated 10 months ago
- G2Pβ171Updated this week
- Automatically cleaning, enhancing, segmenting, filtering, and formatting a dataset to fine tune or train a voice model.β32Updated 2 weeks ago
- 1 min voice data can also be used to train a good TTS model! (few shot voice cloning)β18Updated 3 months ago
- β39Updated 10 months ago
- Orchestrating AI for stunning lip-synced videos. Effortless workflow, exceptional results, all in one place.β68Updated 8 months ago
- An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.ioβ68Updated last year