hparcells / rtvc
π¬ "Realtime" voice transcription and cloning using ElevenLabs's API.
β50Updated last year
Related projects β
Alternatives and complementary repositories for rtvc
- Speech to text to speech using Elevenlabsβ28Updated last year
- This chatbot lets you use your microphone to communicate with GPT-4. It uses the OpenAI text to speech to respond with a voice. It uses Pβ¦β55Updated 11 months ago
- Listen, transcribe, reply - Voice Assistant using OpenAI & ElevenLabs API'sβ14Updated last year
- A multi-voice TTS system trained with an emphasis on qualityβ26Updated last year
- Advanced RVC Inference for quicker and effortless model downloadsβ32Updated 8 months ago
- Imagine translating your speech or anybody's speech to any language you want within minutes. check this out...β36Updated 3 months ago
- A minimalistic automatic speech recognition streamlit based webapp powered by OpenAI's Whisper "State of the Art" modelsβ65Updated 2 years ago
- Multivoice: Enhance your foreign-language movie and TV show experience with personalized dubbed versions. Our project uses voice cloning β¦β24Updated last year
- Using a single image and just 10 seconds of sample audio, our project enables you to create a video where it appears as if you're speakinβ¦β27Updated last year
- This is a simple Graphical User Interface (GUI) application for working with the OpenAI API, allowing you to use OpenAI's Chat Completionβ¦β29Updated this week
- Genie in the Box: Distill Whisper STT => Mistral-7B => Phind/Phind-CodeLlama-34B-v2 => GPT 3.5 => Coqui's TTS/OpenAI TTSβ15Updated 7 months ago
- β68Updated 8 months ago
- AudioLDM text to audio colabβ19Updated last year
- Auto-Video maker handling many AI'sβ12Updated 8 months ago
- Text prompt steered synthetic audio generatorsβ45Updated 11 months ago
- Smart, context-aware GPT-4/Custom LLM bot for discordβ40Updated last year
- This project includes a Python script for fine-tuning a text-to-speech (TTS) model. The script utilizes custom datasets and use CUDA for β¦β13Updated last month
- Simplified installers for suno-ai/bark, musicgen, tortoise, RVC, demucs and vocosβ45Updated 4 months ago
- The official front-end UI.β38Updated last year
- Your own personal assistant thanks to chat-gpt, whisper, and elevenlabs ttsβ48Updated last year
- AI Agent capable of automating various tasks using openai function call featureβ36Updated 11 months ago
- llmon-py is a multimodal webui for Llama 3-8B.β15Updated 4 months ago
- RealVoiceGPT is a web application that lets you have voice conversations with ChatGPT. The project uses ElevenLabs AI text to speech to gβ¦β25Updated last year
- Misc. tools/scripts that I made to use for tortoiseβ18Updated 3 months ago
- AI 3D avatar voice interface in browser. VAD -> STT -> LLM -> TTS -> VRM (Prototype/Proof-of-Concept)β64Updated last year
- Coqui STT Model Manager - install, manage and try out Coqui STT models from the Model Zooβ24Updated last year
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelinesβ85Updated 6 months ago
- A library for defining AI personalities for AI based models.We define a file format, assets and personalized scripts.β54Updated last year
- Shared Voice Interfaceβ40Updated last year
- β35Updated last year