SocAIty / Retrieval-based-Voice-Conversion-FastAPI
Adds a web API to RVC to infer via json requests
ā17Updated 4 months ago
Related projects ā
Alternatives and complementary repositories for Retrieval-based-Voice-Conversion-FastAPI
- š Text2Speech, Voice-Cloning and Voice2Voice conversion with the text-prompted generative audio model barkā40Updated 2 months ago
- TTS pipeline that uses RVC to enhance audio quality and cloningā139Updated 9 months ago
- ā87Updated 6 months ago
- Using RVC via console or python scriptsā77Updated 3 weeks ago
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelinesā84Updated 6 months ago
- ā77Updated 4 months ago
- š š¤ Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloningā135Updated 3 months ago
- Efficient approach to speaker diarization using voice characteristics extractionā67Updated 6 months ago
- Text to Speech using Coqui TTS + RVCā89Updated 7 months ago
- ā51Updated last month
- ā68Updated 7 months ago
- Modern Desktop Application offering a suite of tools for audio/video text recognition and a variety of other useful utilities.ā43Updated 3 months ago
- ā176Updated last month
- TTS with The Massively Multilingual Speech (MMS) projectā226Updated 4 months ago
- Site for sharing Bark voicesā48Updated 4 months ago
- API server for Instant voice cloning by MyShell.ā69Updated last month
- VoiceCraftAI is a revolutionary AI tool to dub videos into multiple regional languages and lip-sync at the same time.ā44Updated last month
- Record audio and save a transcription to your system's clipboard with ctranslate2 and faster-whisper.ā66Updated 3 weeks ago
- Misc. tools/scripts that I made to use for tortoiseā17Updated 2 months ago
- Text-to-speech API endpoint compatible with OpenAI's TTS API endpoint, using Microsoft Edge TTS to generate speech for free locallyā115Updated last week
- Faster Tortoise inference then Tortoise Fast Forkā122Updated 6 months ago
- ā59Updated 3 weeks ago
- Listen to any audio stream on your machine and print out the transcribed or translated audio.ā114Updated last year
- Made slight modifications to the Tortoise API, provided 3 additional scripts to make using Tortoise easier. Less focus on cloning makes sā¦ā50Updated 6 months ago
- AI 3D avatar voice interface in browser. VAD -> STT -> LLM -> TTS -> VRM (Prototype/Proof-of-Concept)ā64Updated last year
- ā34Updated 6 months ago
- The code for some apps built with Sieve.ā70Updated 3 weeks ago
- A very simple implementation of edge_tts w/ RVC for oobabooga text-generation-webui.ā41Updated 9 months ago
- š Create labeled datasets, enhance audio quality, identify speakers, support diverse dataset types. š§š„š Advanced audio processing.ā204Updated 5 months ago
- On-device streaming text-to-speech engine powered by deep learningā54Updated last week