π¬ "Realtime" voice transcription and cloning using ElevenLabs's API.
β55Mar 1, 2023Updated 3 years ago
Alternatives and similar repositories for rtvc
Users that are interested in rtvc are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Speech to text to speech using Elevenlabsβ27Jul 2, 2023Updated 2 years ago
- Listen, transcribe, reply - Voice Assistant using OpenAI & ElevenLabs API'sβ14Jun 24, 2023Updated 2 years ago
- A user-friendly interface for ElevenLabs' API with added audio transcription capability.β12Jun 20, 2023Updated 2 years ago
- This chatbot lets you use your microphone to communicate with GPT-4. It uses the OpenAI text to speech to respond with a voice. It uses Pβ¦β56Dec 6, 2023Updated 2 years ago
- Code for "Distribution-based Emotion Recognition in Conversation"β19Feb 6, 2023Updated 3 years ago
- Managed Database hosting by DigitalOcean β’ AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Project for HIDING SPEAKERβS SEX IN SPEECH USING ZERO-EVIDENCE SPEAKER REPRESENTATION IN AN ANALYSIS/SYNTHESIS PIPELINEβ15Nov 30, 2022Updated 3 years ago
- Streamlit app to visualize and edit TTS datasetsβ15Dec 15, 2021Updated 4 years ago
- A fast CPU-first video/audio transcriber for generating caption files with Whisper and CTranslate2, hosted on Hugging Face Spaces.β11Updated this week
- A browser for your agent.β25Dec 7, 2025Updated 3 months ago
- CML-TTS: A Multilingual Dataset for Speech Synthesisβ34Jul 31, 2024Updated last year
- Avocodo: Generative Adversarial Network for Artifact-free Vocoderβ122Jul 14, 2022Updated 3 years ago
- PyTorch implementation of Retriever: Learning Content-Style Representationβ12Jan 27, 2023Updated 3 years ago
- β11May 7, 2022Updated 3 years ago
- Aty-TTS: Improving fairness for spoken language understanding in atypical speech with Text-to-Speechβ11May 14, 2025Updated 10 months ago
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Google's TPGST reimplementation.β34Dec 11, 2019Updated 6 years ago
- Obsidian theme inspired by iA Writerβ15Apr 12, 2024Updated last year
- εη¬η»΄ζ€ηδΈζTTSβ34Oct 28, 2022Updated 3 years ago
- NU-Wave 2: A General Neural Audio Upsampling Model for Various Sampling Rates [WIP]β25Jul 5, 2022Updated 3 years ago
- Code for "Phoneme Segmentation Using Self-Supervised Speech Models", Strgar & Harwath, Proceedings of the IEEE Spoken Language Technologyβ¦β55Nov 4, 2022Updated 3 years ago
- TTSεοΌζζ¬ζ εεοΌε°ζ°εεζ―ε€η转εδΈΊζ±εβ12Apr 27, 2024Updated last year
- β12Aug 7, 2021Updated 4 years ago
- Syllable Segmentation and Cross-Lingual Generalization in a Visually Grounded, Self-Supervised Speech Modelβ34Aug 27, 2023Updated 2 years ago
- [IJCAI'23] Learning to Speak from Text for Low-Resource TTSβ64May 30, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient β’ AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- β37May 8, 2021Updated 4 years ago
- ICASSP2022 TTS&VC Summaryβ14Jun 9, 2022Updated 3 years ago
- The source code for the paper CrossSinger (asru2023)β18Oct 12, 2023Updated 2 years ago
- This repository contains the registries for components, agents and services, the second part of the autonolas-v1 protocol.β14Updated this week
- python wrap for hts engineβ14Jan 30, 2018Updated 8 years ago
- Rich Prosody Diversity Modelling with Phone-level Mixture Density Networkβ45Dec 1, 2021Updated 4 years ago
- An unofficial PyTorch implementation of Mix-Phoneme-Bertβ40Jul 10, 2023Updated 2 years ago
- Base mechβ40Updated this week
- MultiSpeaker Tacotron2 using LifeLong Learning.β13Sep 27, 2019Updated 6 years ago
- DigitalOcean Gradient AI Platform β’ AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Finally, some decent sample sentencesβ23Dec 3, 2023Updated 2 years ago
- Code for ICASSP 2019 paperβ18Oct 29, 2018Updated 7 years ago
- Your personal assistant who will help you with your lonelinessβ19May 4, 2023Updated 2 years ago
- Audio Generation model working with GPT-2 and VQVAE compressed representation of MelSpectrogramsβ18Oct 8, 2023Updated 2 years ago
- Implementation for paper "Disentangled Speech Representation Learning for One-Shot Cross-Lingual Voice Conversion Using Γ-VAE"β44Apr 10, 2023Updated 2 years ago
- Manage your Youtube and Twitter subscriptions into groups and foldersβ14Feb 19, 2026Updated last month
- Demo audio of VARA-TTS modelβ20Jun 11, 2021Updated 4 years ago