π¬ "Realtime" voice transcription and cloning using ElevenLabs's API.
β56Mar 1, 2023Updated 3 years ago
Alternatives and similar repositories for rtvc
Users that are interested in rtvc are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Speech to text to speech using Elevenlabsβ27Jul 2, 2023Updated 2 years ago
- Listen, transcribe, reply - Voice Assistant using OpenAI & ElevenLabs API'sβ14Jun 24, 2023Updated 2 years ago
- A user-friendly interface for ElevenLabs' API with added audio transcription capability.β13Jun 20, 2023Updated 2 years ago
- This chatbot lets you use your microphone to communicate with GPT-4. It uses the OpenAI text to speech to respond with a voice. It uses Pβ¦β55Dec 6, 2023Updated 2 years ago
- Code for "Distribution-based Emotion Recognition in Conversation"β19Feb 6, 2023Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Project for HIDING SPEAKERβS SEX IN SPEECH USING ZERO-EVIDENCE SPEAKER REPRESENTATION IN AN ANALYSIS/SYNTHESIS PIPELINEβ15Nov 30, 2022Updated 3 years ago
- Streamlit app to visualize and edit TTS datasetsβ15Dec 15, 2021Updated 4 years ago
- A fast CPU-first video/audio transcriber for generating caption files with Whisper and CTranslate2, hosted on Hugging Face Spaces.β11May 20, 2026Updated last week
- Avocodo: Generative Adversarial Network for Artifact-free Vocoderβ122Jul 14, 2022Updated 3 years ago
- PyTorch implementation of Retriever: Learning Content-Style Representationβ12Jan 27, 2023Updated 3 years ago
- CML-TTS: A Multilingual Dataset for Speech Synthesisβ36Jul 31, 2024Updated last year
- Autonomous and goal-seeking coding agents. π«β21Dec 11, 2024Updated last year
- β11May 7, 2022Updated 4 years ago
- A voice-powered AI built with Whisper, ChatGPT, and ElevenLabsβ151Apr 2, 2023Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI β’ AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Aty-TTS: Improving fairness for spoken language understanding in atypical speech with Text-to-Speechβ11May 14, 2025Updated last year
- Google's TPGST reimplementation.β34Dec 11, 2019Updated 6 years ago
- εη¬η»΄ζ€ηδΈζTTSβ34Oct 28, 2022Updated 3 years ago
- NU-Wave 2: A General Neural Audio Upsampling Model for Various Sampling Rates [WIP]β25Jul 5, 2022Updated 3 years ago
- Code for "Phoneme Segmentation Using Self-Supervised Speech Models", Strgar & Harwath, Proceedings of the IEEE Spoken Language Technologyβ¦β55Nov 4, 2022Updated 3 years ago
- TTSεοΌζζ¬ζ εεοΌε°ζ°εεζ―ε€η转εδΈΊζ±εβ12Apr 27, 2024Updated 2 years ago
- β12Aug 7, 2021Updated 4 years ago
- Syllable Segmentation and Cross-Lingual Generalization in a Visually Grounded, Self-Supervised Speech Modelβ35Aug 27, 2023Updated 2 years ago
- [IJCAI'23] Learning to Speak from Text for Low-Resource TTSβ65May 30, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer β’ AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- β37May 8, 2021Updated 5 years ago
- ICASSP2022 TTS&VC Summaryβ14Jun 9, 2022Updated 3 years ago
- The source code for the paper CrossSinger (asru2023)β18Oct 12, 2023Updated 2 years ago
- This repository contains the registries for components, agents and services, the second part of the autonolas-v1 protocol.β15May 19, 2026Updated last week
- python wrap for hts engineβ14Jan 30, 2018Updated 8 years ago
- A simple unofficial Python3 library to interface with elevenlabs.io.β17Nov 12, 2023Updated 2 years ago
- Rich Prosody Diversity Modelling with Phone-level Mixture Density Networkβ45Dec 1, 2021Updated 4 years ago
- An unofficial PyTorch implementation of Mix-Phoneme-Bertβ40Jul 10, 2023Updated 2 years ago
- Base mechβ40May 18, 2026Updated last week
- Serverless GPU API endpoints on Runpod - Get Bonus Credits β’ AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- MultiSpeaker Tacotron2 using LifeLong Learning.β13Sep 27, 2019Updated 6 years ago
- Finally, some decent sample sentencesβ23Dec 3, 2023Updated 2 years ago
- Code for ICASSP 2019 paperβ18Oct 29, 2018Updated 7 years ago
- https://facetimeanyone.com/β10Nov 9, 2023Updated 2 years ago
- Audio Generation model working with GPT-2 and VQVAE compressed representation of MelSpectrogramsβ18Oct 8, 2023Updated 2 years ago
- Implementation for paper "Disentangled Speech Representation Learning for One-Shot Cross-Lingual Voice Conversion Using Γ-VAE"β44Apr 10, 2023Updated 3 years ago
- Community guides and tips for xVASynthβ17Jul 26, 2022Updated 3 years ago