quantumlump / eBook_to_Audiobook_with_F5-TTSLinks
Turn any common eBook file into an HQ Audiobook with F5-TTS (Easy Install)
☆25Updated 2 months ago
Alternatives and similar repositories for eBook_to_Audiobook_with_F5-TTS
Users that are interested in eBook_to_Audiobook_with_F5-TTS are comparing it to the libraries listed below
Sorting:
- 1 min voice data can also be used to train a good TTS model! (few shot voice cloning)☆26Updated last week
- 🎙️ Automatically transcribe audio/video into high-quality, speaker-specific Text-To-Speech datasets ✨☆38Updated last week
- (Windows/Linux/MacOS) Local WebUI with neural network models (Text, Image, Video, 3D, Audio) on python (Gradio interface). Translated on …☆96Updated 2 weeks ago
- This project includes a Python script for fine-tuning a text-to-speech (TTS) model. The script utilizes custom datasets and use CUDA for …☆12Updated 7 months ago
- ☆21Updated last month
- High-performance ASR tool using Faster Whisper, supporting custom models, multi-language transcription, and real-time processing feedback…☆10Updated 7 months ago
- ☆67Updated 2 months ago
- A random walk voice style cloning application for Kokoro text to speech☆85Updated last week
- Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"☆61Updated 6 months ago
- ☆50Updated 6 months ago
- Jupyter notebooks for Inpainting | Outpainting with Flux.1 Fill dev. Able to run on Google Colab Free Tier☆31Updated 5 months ago
- Automatically cleaning, enhancing, segmenting, filtering, and formatting a dataset to fine tune or train a voice model.☆36Updated last week
- A Lightweight Gradio Web interface for Text-to-Audio Generation utilising SAO1.0☆53Updated 11 months ago
- Performs the entire AI cover generation process with UI☆18Updated 3 weeks ago
- ACE-Step: A Step Towards Music Generation Foundation Model☆40Updated last week
- A SwarmUI extension that adds parameters for ReActor to the the generate tab☆21Updated last week
- Instaswap Desktop App☆19Updated 5 months ago
- Hanasu is a human-like TTS model based on the multilingual Himitsu V1 transformer-based encoder and VITS architecture☆28Updated this week
- Collection of the best Applio plugins.☆29Updated 8 months ago
- This is a simple ComfyUI custom TTS node based on Parler_tts.☆44Updated 5 months ago
- ☆27Updated last year
- Adding a multi-text multi-speaker script (diffe) that is based on a script from asiff00 on issue 61 for Sesame: A Conversational Speech G…☆23Updated 2 months ago
- ☆36Updated 3 months ago
- OminiControl for the GPU Poor☆27Updated 4 months ago
- Lightweight Gradio based WebUI for orpheusTTS - WSL / Linux [CUDA]☆96Updated 2 months ago
- StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models☆36Updated 2 weeks ago
- This is a pre-built wheel of Triton 3.3.0 for Windows with Nvidia only + Proton☆23Updated 2 weeks ago
- A lightweight, efficient variation of the StyleTTS 2 text‐to‐speech model.☆18Updated last week
- A simple wrapper around "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching" that provides an OpenAI-compatibl…☆13Updated 3 months ago
- This project provides a Flask-based API for generating high-quality text-to-speech (TTS) audio using F5-TTS, a flexible and powerful TTS …☆12Updated 2 months ago