SocAIty / Retrieval-based-Voice-Conversion-FastAPI
Adds a web API to RVC to infer via json requests
β17Updated 4 months ago
Related projects β
Alternatives and complementary repositories for Retrieval-based-Voice-Conversion-FastAPI
- π Text2Speech, Voice-Cloning and Voice2Voice conversion with the text-prompted generative audio model barkβ41Updated 2 months ago
- TTS pipeline that uses RVC to enhance audio quality and cloningβ139Updated 9 months ago
- AI 3D avatar voice interface in browser. VAD -> STT -> LLM -> TTS -> VRM (Prototype/Proof-of-Concept)β64Updated last year
- β87Updated 6 months ago
- β59Updated last month
- Site for sharing Bark voicesβ48Updated 4 months ago
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelinesβ84Updated 6 months ago
- β51Updated 2 months ago
- Open dubbing is an AI dubbing system which uses machine learning models to automatically translate and synchronize audio dialogue into diβ¦β63Updated this week
- β68Updated 8 months ago
- Using RVC via console or python scriptsβ78Updated last month
- Efficient approach to speaker diarization using voice characteristics extractionβ68Updated 6 months ago
- β37Updated this week
- The code for some apps built with Sieve.β71Updated last month
- Misc. tools/scripts that I made to use for tortoiseβ18Updated 3 months ago
- API server for Instant voice cloning by MyShell.β69Updated last month
- β77Updated 4 months ago
- Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"β71Updated last month
- a transcription application that listens to audio input from the microphone using OpenAI's Whisper, transcribes it into text, and simulatβ¦β15Updated 9 months ago
- Synchronize Whisper's timestamps over an existing accurate transcriptionβ132Updated 5 months ago
- VoiceCraftAI is a revolutionary AI tool to dub videos into multiple regional languages and lip-sync at the same time.β46Updated last month
- Slightly improved official version for finetune xttsβ236Updated last month
- Self hosted high quality voice recognition for de-googled Android using whisper. Like Siri or OK Google.β53Updated 10 months ago
- β93Updated 3 months ago
- π π€ Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloningβ138Updated 4 months ago
- fast state-of-the-art speech models and a runtime that runs anywhere π₯β55Updated last month
- RVC Inference with multiple model and huggingface supportβ102Updated 8 months ago
- Text to Speech using Coqui TTS + RVCβ90Updated 8 months ago
- Windows-compatible Fast API implementation of VoiceCraft, the Zero-Shot Speech Editing and Text-to-Speech in the Wildβ18Updated 6 months ago
- A UI for the Piper TTSβ67Updated 2 months ago