hparcells / rtvcView external linksLinks
π¬ "Realtime" voice transcription and cloning using ElevenLabs's API.
β54Mar 1, 2023Updated 2 years ago
Alternatives and similar repositories for rtvc
Users that are interested in rtvc are comparing it to the libraries listed below
Sorting:
- Speech to text to speech using Elevenlabsβ28Jul 2, 2023Updated 2 years ago
- Listen, transcribe, reply - Voice Assistant using OpenAI & ElevenLabs API'sβ14Jun 24, 2023Updated 2 years ago
- Code for "Distribution-based Emotion Recognition in Conversation"β19Feb 6, 2023Updated 3 years ago
- This chatbot lets you use your microphone to communicate with GPT-4. It uses the OpenAI text to speech to respond with a voice. It uses Pβ¦β56Dec 6, 2023Updated 2 years ago
- Streamlit app to visualize and edit TTS datasetsβ15Dec 15, 2021Updated 4 years ago
- Project for HIDING SPEAKERβS SEX IN SPEECH USING ZERO-EVIDENCE SPEAKER REPRESENTATION IN AN ANALYSIS/SYNTHESIS PIPELINEβ15Nov 30, 2022Updated 3 years ago
- PyTorch implementation of Retriever: Learning Content-Style Representationβ12Jan 27, 2023Updated 3 years ago
- CML-TTS: A Multilingual Dataset for Speech Synthesisβ33Jul 31, 2024Updated last year
- β14Aug 16, 2023Updated 2 years ago
- TTSεοΌζζ¬ζ εεοΌε°ζ°εεζ―ε€η转εδΈΊζ±εβ12Apr 27, 2024Updated last year
- β11May 7, 2022Updated 3 years ago
- Syllable Segmentation and Cross-Lingual Generalization in a Visually Grounded, Self-Supervised Speech Modelβ34Aug 27, 2023Updated 2 years ago
- Avocodo: Generative Adversarial Network for Artifact-free Vocoderβ122Jul 14, 2022Updated 3 years ago
- Research code for the paper "Training speaker recognition systems with limited data" at https://arxiv.org/abs/2203.14688β12Dec 2, 2024Updated last year
- Aty-TTS: Improving fairness for spoken language understanding in atypical speech with Text-to-Speechβ11May 14, 2025Updated 9 months ago
- Phoneme segmentation using pre-trained speech modelsβ55Nov 4, 2022Updated 3 years ago
- ICASSP2022 TTS&VC Summaryβ14Jun 9, 2022Updated 3 years ago
- [IJCAI'23] Learning to Speak from Text for Low-Resource TTSβ64May 30, 2023Updated 2 years ago
- python wrap for hts engineβ14Jan 30, 2018Updated 8 years ago
- A simple unofficial Python3 library to interface with elevenlabs.io.β17Nov 12, 2023Updated 2 years ago
- A transform to show the latest copy of the website from the Wayback Machineβ17Nov 25, 2014Updated 11 years ago
- εη¬η»΄ζ€ηδΈζTTSβ34Oct 28, 2022Updated 3 years ago
- Audio Generation model working with GPT-2 and VQVAE compressed representation of MelSpectrogramsβ18Oct 8, 2023Updated 2 years ago
- CVC: Contrastive Learning for Non-parallel Voice Conversion (INTERSPEECH 2021, in PyTorch)β59Jul 26, 2022Updated 3 years ago
- An unofficial PyTorch implementation of Mix-Phoneme-Bertβ40Jul 10, 2023Updated 2 years ago
- Google's TPGST reimplementation.β34Dec 11, 2019Updated 6 years ago
- wake-up word emotion recognition [APSIPA 2022]β17Nov 11, 2022Updated 3 years ago
- The source code for the paper CrossSinger (asru2023)β18Oct 12, 2023Updated 2 years ago
- Obsidian theme inspired by iA Writerβ15Apr 12, 2024Updated last year
- Some script for helping using Montreal Forced Aligner, maily for transforming Hanzi character to pinyin and extrat pause time from .textgβ¦β14Feb 9, 2024Updated 2 years ago
- β37May 8, 2021Updated 4 years ago
- A browser for your agent.β23Dec 7, 2025Updated 2 months ago
- MnTTS: An Open-Source Mongolian Text-to-Speech Synthesis Dataset and Accompanied Baseline. (Accepted by IALP'2022)β22Dec 5, 2022Updated 3 years ago
- Chat with an AI simulation of anyone as easily as copy-pasting text into a folder!β18Mar 4, 2023Updated 2 years ago
- β18Dec 7, 2023Updated 2 years ago
- Code for ICASSP 2019 paperβ18Oct 29, 2018Updated 7 years ago
- A diffusion-based cross-lingual voice conversion model, as my bachelor's thesisβ44Jul 24, 2023Updated 2 years ago
- Rich Prosody Diversity Modelling with Phone-level Mixture Density Networkβ45Dec 1, 2021Updated 4 years ago
- The official implementation of OpenSR (ACL2023 Oral)β16Nov 29, 2023Updated 2 years ago