quantumlump / eBook_to_Audiobook_with_F5-TTSView external linksLinks
Turn any common eBook file into an HQ Audiobook with F5-TTS (Easy Install)
☆36Dec 31, 2025Updated last month
Alternatives and similar repositories for eBook_to_Audiobook_with_F5-TTS
Users that are interested in eBook_to_Audiobook_with_F5-TTS are comparing it to the libraries listed below
Sorting:
- High-performance ASR tool using Faster Whisper, supporting custom models, multi-language transcription, and real-time processing feedback…☆10Sep 17, 2025Updated 4 months ago
- An AR+AR TTS attempt.☆18Jan 13, 2025Updated last year
- A successor to booknlp, aiming to fix bugs and improve model performance☆17Jul 16, 2024Updated last year
- ☆13Nov 22, 2022Updated 3 years ago
- Incremental Disentanglement for Environment-Aware Zero-Shot Text-to-Speech Synthesis☆27Mar 21, 2025Updated 10 months ago
- A lightweight, efficient variation of the StyleTTS 2 text‐to‐speech model.☆52May 22, 2025Updated 8 months ago
- A simple wrapper around "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching" that provides an OpenAI-compatibl…☆14Feb 7, 2025Updated last year
- The implementation of the paper *SinGS: Animatable Single-Image Human Gaussian Splats with Kinematic Priors* [CVPR 2025]☆37Nov 11, 2025Updated 3 months ago
- An official implementation of Style-Talker for Spoken Dialogue Generation☆23Jan 12, 2025Updated last year
- Personal GPEN scripts within the GPEN-Windows stand-alone package.☆20Jun 5, 2022Updated 3 years ago
- Running the F5-TTS by ONNX Runtime☆191Jan 7, 2026Updated last month
- Faster Whisper ASR transcription with CTranslate2☆24Oct 25, 2024Updated last year
- singing voice conversion without f0☆23May 10, 2023Updated 2 years ago
- [ICASSP 2026] Task Vector in TTS: Toward Emotionally Expressive Dialectal Speech Synthesis☆36Dec 24, 2025Updated last month
- Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"☆53Dec 17, 2024Updated last year
- A collection of all our phonemeizers for dataset construction and inference☆27Feb 21, 2025Updated 11 months ago
- A pitch detection model trained to be robust against noise and reverberation environments.☆27Jan 21, 2025Updated last year
- Finally, some decent sample sentences☆23Dec 3, 2023Updated 2 years ago
- ☆30Oct 29, 2024Updated last year
- Non Parallel Voice Conversion based on VITS☆24Mar 31, 2023Updated 2 years ago
- ☆28Nov 15, 2023Updated 2 years ago
- 基于达摩院视频切割技术的视频转换为短音频的vits数据集生成工具 A VITS Dataset Generation Tool for Converting Video to Short Audio Based on Damo Academy Video Cutting T…☆55Jan 17, 2024Updated 2 years ago
- Musculoskeletal Analysis extension for 3D Slicer. Currently has cortical, cancellous, and bone density analysis.☆12May 2, 2024Updated last year
- Future version of the AnyBody Managed Model Repository with a full thoracic spine model.☆18Feb 2, 2026Updated 2 weeks ago
- 1 min voice data can also be used to train a good TTS model! (few shot voice cloning)☆30Jun 9, 2025Updated 8 months ago
- ☆40Jul 15, 2025Updated 7 months ago
- Robust Speech Recognition via Large-Scale Weak Supervision☆29Dec 16, 2023Updated 2 years ago
- ☆29Jun 30, 2025Updated 7 months ago
- StyleTTS 2 Optimized Training Fork☆33Feb 2, 2025Updated last year
- Next-generation, fully open-source refacer. Images. GIFs. TIFFs. Full-length videos. Bulk refacing☆41May 16, 2025Updated 9 months ago
- Your one-stop solution for voice dataset creation☆129Dec 10, 2023Updated 2 years ago
- Generative Expressive Conversational Speech Synthesis (Accepted by MM'2024)☆78Nov 1, 2024Updated last year
- ☆10Oct 23, 2024Updated last year
- Text to speech is an emerging zone of AI. This repository helps to create a dataset with audio and transcripts for personalized text to s…☆28Mar 14, 2023Updated 2 years ago
- ☆33Jun 29, 2023Updated 2 years ago
- Forsen☆32Apr 29, 2025Updated 9 months ago
- The implementation of paper "SpeechTripleNet: End-to-End Disentangled Speech Representation Learning for Content, Timbre and Prosody"☆34Nov 23, 2023Updated 2 years ago
- StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models☆37May 17, 2025Updated 8 months ago
- [ICASSP 2025] FreeSVC: Towards Zero-shot Multilingual Singing Voice Conversion☆91Jul 23, 2025Updated 6 months ago