hparcells / rtvc
π¬ "Realtime" voice transcription and cloning using ElevenLabs's API.
β53Updated 2 years ago
Alternatives and similar repositories for rtvc
Users that are interested in rtvc are comparing it to the libraries listed below
Sorting:
- This chatbot lets you use your microphone to communicate with GPT-4. It uses the OpenAI text to speech to respond with a voice. It uses Pβ¦β56Updated last year
- Speech to text to speech using Elevenlabsβ28Updated last year
- Listen, transcribe, reply - Voice Assistant using OpenAI & ElevenLabs API'sβ14Updated last year
- AudioLDM text to audio colabβ19Updated last year
- Auto-Lyrics: Lyrics transcription & alignment using Whisper and yt-dlpβ19Updated 2 weeks ago
- β71Updated last year
- A simplistic UI connecting gpt-3 and stable diffusionβ15Updated 2 years ago
- GradioUI for TortoiseTTS voice generationβ34Updated last year
- This is a simple Graphical User Interface (GUI) application for working with the OpenAI API, allowing you to use OpenAI's Chat Completionβ¦β29Updated 3 months ago
- A GPT client with long term memoryβ38Updated last year
- Your personal assistant who will help you with your lonelinessβ18Updated 2 years ago
- Site for sharing MusicGen + AudioGen Prompts and Creationsβ42Updated last month
- A multi-voice TTS system trained with an emphasis on qualityβ26Updated 2 years ago
- Speech-to-text, text-to-speech with ElevenLabsβ26Updated last year
- Audio datasets, easier.β84Updated last year
- Text prompt steered synthetic audio generatorsβ46Updated last month
- Auto-Video maker handling many AI'sβ10Updated last year
- Transcribe with ease :Dβ14Updated last year
- Misc. tools/scripts that I made to use for tortoiseβ21Updated 8 months ago
- This project includes a Python script for fine-tuning a text-to-speech (TTS) model. The script utilizes custom datasets and use CUDA for β¦β13Updated 7 months ago
- Genie in the Box: Distill Whisper STT => Mistral-7B => Phind/Phind-CodeLlama-34B-v2 => GPT 3.5 => Coqui's TTS/OpenAI TTSβ16Updated last year
- β99Updated 9 months ago
- Advanced RVC Inference for quicker and effortless model downloadsβ51Updated last month
- CopperAI offers a hands-free, voice-to-voice interaction system with a Large Language Model (LLM)β29Updated last year
- AI 3D avatar voice interface in browser. VAD -> STT -> LLM -> TTS -> VRM (Prototype/Proof-of-Concept)β67Updated last year
- β17Updated 2 years ago
- Unofficial Bark APIβ9Updated last year
- β83Updated 10 months ago
- β27Updated last year
- A simple Langchain agent setup that makes it easy to test out new agent tools.β15Updated 2 years ago