quantumlump / eBook_to_Audiobook_with_F5-TTSLinks
Turn any common eBook file into an HQ Audiobook with F5-TTS (Easy Install)
☆28Updated 4 months ago
Alternatives and similar repositories for eBook_to_Audiobook_with_F5-TTS
Users that are interested in eBook_to_Audiobook_with_F5-TTS are comparing it to the libraries listed below
Sorting:
- 1 min voice data can also be used to train a good TTS model! (few shot voice cloning)☆29Updated 4 months ago
- High-performance ASR tool using Faster Whisper, supporting custom models, multi-language transcription, and real-time processing feedback…☆10Updated 3 weeks ago
- Convert your PDFs and EPUBs into audiobooks effortlessly. Features intelligent text extraction, customizable text-to-speech settings, and…☆120Updated 6 months ago
- This project includes a Python script for fine-tuning a text-to-speech (TTS) model. The script utilizes custom datasets and use CUDA for …☆13Updated last year
- SoTA open-source TTS☆99Updated 2 weeks ago
- Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"☆66Updated 10 months ago
- Examples of using the llasa-tts models locally☆180Updated 5 months ago
- (Windows/Linux/MacOS) Local WebUI with neural network models (Text, Image, Video, 3D, Audio) on python (Gradio interface). Translated on …☆103Updated 2 weeks ago
- ☆46Updated this week
- ☆66Updated 6 months ago
- StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models☆37Updated 4 months ago
- RVC realtime voice changer - standalone/lightweight☆77Updated last month
- 🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning☆157Updated last year
- Multivoice: Enhance your foreign-language movie and TV show experience with personalized dubbed versions. Our project uses voice cloning …☆26Updated 2 years ago
- Implementation of Sesame's Conversational Speech Model for Hugging Face Transformers☆57Updated 4 months ago
- A UI for the Piper TTS☆101Updated last year
- Automatically cleaning, enhancing, segmenting, filtering, and formatting a dataset to fine tune or train a voice model.☆42Updated 3 weeks ago
- A random walk voice style cloning application for Kokoro text to speech☆141Updated 3 months ago
- Create Unmute voice embeddings☆18Updated last month
- ☆17Updated last year
- Performs the entire AI cover generation process with UI☆24Updated 2 months ago
- Unofficial WIP LoRa Finetuning repository for VibeVoice☆206Updated 2 weeks ago
- ☆99Updated last year
- SoTA open-source TTS☆96Updated 4 months ago
- BeltOut: An open source pitch-perfect voice-to-voice timbre transfer model based on ChatterboxVC☆76Updated 2 months ago
- Jupyter notebooks for Inpainting | Outpainting with Flux.1 Fill dev. Able to run on Google Colab Free Tier☆32Updated 9 months ago
- Lightweight Gradio based WebUI for orpheusTTS - WSL / Linux [CUDA]☆103Updated 6 months ago
- StyleTTS-ZS: Efficient High-Quality Zero-Shot Text-to-Speech Synthesis with Distilled Time-Varying Style Diffusion☆10Updated last year
- ☆282Updated 2 months ago
- VoiceCraftAI is a revolutionary AI tool to dub videos into multiple regional languages and lip-sync at the same time.☆69Updated last year