hparcells / rtvc
π¬ "Realtime" voice transcription and cloning using ElevenLabs's API.
β53Updated last year
Alternatives and similar repositories for rtvc:
Users that are interested in rtvc are comparing it to the libraries listed below
- Speech to text to speech using Elevenlabsβ28Updated last year
- Listen, transcribe, reply - Voice Assistant using OpenAI & ElevenLabs API'sβ14Updated last year
- This chatbot lets you use your microphone to communicate with GPT-4. It uses the OpenAI text to speech to respond with a voice. It uses Pβ¦β55Updated last year
- AI 3D avatar voice interface in browser. VAD -> STT -> LLM -> TTS -> VRM (Prototype/Proof-of-Concept)β64Updated last year
- Multivoice: Enhance your foreign-language movie and TV show experience with personalized dubbed versions. Our project uses voice cloning β¦β26Updated last year
- AudioLDM text to audio colabβ19Updated last year
- Advanced RVC Inference for quicker and effortless model downloadsβ37Updated this week
- Text prompt steered synthetic audio generatorsβ45Updated last year
- This is a simple Graphical User Interface (GUI) application for working with the OpenAI API, allowing you to use OpenAI's Chat Completionβ¦β29Updated last month
- MimicMania is a web application that allows you to generate speech and clone voices using text-to-speech technology. With MimicMania, youβ¦β60Updated last year
- A fast MP3 decoder for python, using minimp3β28Updated 2 years ago
- Site for sharing Bark voicesβ49Updated 6 months ago
- β69Updated 10 months ago
- A library for defining AI personalities for AI based models.We define a file format, assets and personalized scripts.β54Updated last year
- The official front-end UI.β38Updated last year
- Full python wrapper for the elevenlabs API.β155Updated 3 months ago
- Misc. tools/scripts that I made to use for tortoiseβ21Updated 5 months ago
- an improved version of Real-time-voice-cloningβ46Updated 10 months ago
- Simplified installers for suno-ai/bark, musicgen, tortoise, RVC, demucs and vocosβ45Updated 6 months ago
- Audio datasets, easier.β83Updated last year
- Using a single image and just 10 seconds of sample audio, our project enables you to create a video where it appears as if you're speakinβ¦β33Updated last year
- Powered by OpenAI Whisper & Gradioβ30Updated 2 years ago
- A multi-voice TTS system trained with an emphasis on qualityβ26Updated 2 years ago
- A character chat with integrated medium and long-term memoryβ14Updated last week
- RVC Inference with multiple model and huggingface supportβ102Updated 10 months ago
- Heteronym to Phoneme Parserβ18Updated last year
- Site for sharing MusicGen + AudioGen Prompts and Creationsβ41Updated 6 months ago
- A GPT client with long term memoryβ38Updated last year
- A reverse engineered Python API wrapper for OpenPlayground (nat.dev)β76Updated last year
- CopperAI offers a hands-free, voice-to-voice interaction system with a Large Language Model (LLM)β32Updated last year