Text to Speech using Coqui TTS + RVC
β112Nov 30, 2025Updated 2 months ago
Alternatives and similar repositories for TTS-RVC-API
Users that are interested in TTS-RVC-API are comparing it to the libraries listed below
Sorting:
- π RVC + UVR = A perfect set of tools for voice cloning, easily and free!β227Jul 12, 2025Updated 7 months ago
- in preparation...β542Nov 5, 2025Updated 3 months ago
- β13Dec 7, 2022Updated 3 years ago
- β21Mar 7, 2025Updated 11 months ago
- GUI for a Vocal Remover that uses Deep Neural Networks.β17Jan 18, 2024Updated 2 years ago
- Using RVC via console or python scriptsβ141Oct 18, 2024Updated last year
- TTS pipeline that uses RVC to enhance audio quality and cloningβ146Jan 25, 2024Updated 2 years ago
- Text-to-Speech Gradio webui using RVC and edge-ttsβ335Sep 17, 2023Updated 2 years ago
- Russian phonetical transcriptionβ11Nov 19, 2025Updated 3 months ago
- π΅ muse: Music Separationβ11Feb 14, 2024Updated 2 years ago
- β25Nov 3, 2025Updated 3 months ago
- An open-source Khmer Word to Speech Model. Just single word not sentence!β18Dec 31, 2025Updated 2 months ago
- Code for the paper "Free-View Expressive Talking Head Video Editing" (ICASSP 2023)β12May 26, 2024Updated last year
- Package for word stress detectionβ11Jan 27, 2023Updated 3 years ago
- Python3 code for the IEEE SPL paper "Auto-Tuning Spectral Clustering for SpeakerDiarization Using Normalized Maximum Eigengap"β11Apr 6, 2020Updated 5 years ago
- β13Apr 14, 2024Updated last year
- A simple FastAPI Server to run XTTSv2β573Jul 21, 2024Updated last year
- Neural model for prediction of stress position in Russian wordsβ13Jun 22, 2025Updated 8 months ago
- Llama-Mimi is a speech language model that uses a unified tokenizer (Mimi) and a single Transformer decoder (Llama) to jointly model sequβ¦β27Sep 20, 2025Updated 5 months ago
- Using OpenVINO to speed up MeloTTS inferenceβ15Nov 1, 2024Updated last year
- Speech recognition module for Python, supporting several engines and APIs, online and offline.β13Mar 9, 2022Updated 3 years ago
- Java Bindings for the C++ library DeepSpeechβ10Jun 4, 2020Updated 5 years ago
- The implementation for "Empowering Whisper as a Joint Multi-Talker and Target-Talker Speech Recognition System".β30Aug 2, 2025Updated 6 months ago
- WebRTC-based real-time audio streaming with Faster Whisper ASR integration for live speech-to-text transcription.β13Sep 27, 2024Updated last year
- Easily build Electron app using umiβ16Jul 10, 2024Updated last year
- This project provides a Flask-based API for generating high-quality text-to-speech (TTS) audio using F5-TTS, a flexible and powerful TTS β¦β14Aug 8, 2025Updated 6 months ago
- Turn any common eBook file into an HQ Audiobook with F5-TTS (Easy Install)β37Dec 31, 2025Updated 2 months ago
- β13Oct 27, 2021Updated 4 years ago
- β17Apr 28, 2021Updated 4 years ago
- Launch your speech synthesis within one minute.β12May 6, 2024Updated last year
- Pure C# port of the Pocketsphinx keyword spotterβ13Jan 19, 2020Updated 6 years ago
- C++ version of pyannote audio overlapped speech detection pipelineβ13Feb 14, 2024Updated 2 years ago
- Code release for "TinySpeech: Attention Condensers for Deep Speech Recognition Neural Networks on Edge Devices"β21Jun 7, 2025Updated 8 months ago
- FlowMirror-HydraVox β A natively accelerated multi-head autoregressive TTS system derived from CosyVoice 3.0. It predicts multiple tokensβ¦β38Feb 17, 2026Updated last week
- Test Framework for few-shot open set KWSβ41Nov 8, 2024Updated last year
- (WIP) A retrain of F5-TTS on permissively-licensed dataβ13Apr 6, 2025Updated 10 months ago
- β18May 27, 2025Updated 9 months ago
- β18Aug 23, 2024Updated last year
- Assistance component base for Dicio assistant componentsβ13May 27, 2024Updated last year