π¬ "Realtime" voice transcription and cloning using ElevenLabs's API.
β55Mar 1, 2023Updated 3 years ago
Alternatives and similar repositories for rtvc
Users that are interested in rtvc are comparing it to the libraries listed below
Sorting:
- Speech to text to speech using Elevenlabsβ27Jul 2, 2023Updated 2 years ago
- Listen, transcribe, reply - Voice Assistant using OpenAI & ElevenLabs API'sβ14Jun 24, 2023Updated 2 years ago
- Code for "Distribution-based Emotion Recognition in Conversation"β19Feb 6, 2023Updated 3 years ago
- Streamlit app to visualize and edit TTS datasetsβ15Dec 15, 2021Updated 4 years ago
- PyTorch implementation of Retriever: Learning Content-Style Representationβ12Jan 27, 2023Updated 3 years ago
- Project for HIDING SPEAKERβS SEX IN SPEECH USING ZERO-EVIDENCE SPEAKER REPRESENTATION IN AN ANALYSIS/SYNTHESIS PIPELINEβ15Nov 30, 2022Updated 3 years ago
- CML-TTS: A Multilingual Dataset for Speech Synthesisβ33Jul 31, 2024Updated last year
- A fast CPU-first video/audio transcriber for generating caption files with Whisper and CTranslate2, hosted on Hugging Face Spaces.β11Updated this week
- TTSεοΌζζ¬ζ εεοΌε°ζ°εεζ―ε€η转εδΈΊζ±εβ12Apr 27, 2024Updated last year
- β14Aug 16, 2023Updated 2 years ago
- β11May 7, 2022Updated 3 years ago
- Syllable Segmentation and Cross-Lingual Generalization in a Visually Grounded, Self-Supervised Speech Modelβ34Aug 27, 2023Updated 2 years ago
- Research code for the paper "Training speaker recognition systems with limited data" at https://arxiv.org/abs/2203.14688β12Dec 2, 2024Updated last year
- A user-friendly interface for ElevenLabs' API with added audio transcription capability.β12Jun 20, 2023Updated 2 years ago
- Avocodo: Generative Adversarial Network for Artifact-free Vocoderβ122Jul 14, 2022Updated 3 years ago
- Aty-TTS: Improving fairness for spoken language understanding in atypical speech with Text-to-Speechβ11May 14, 2025Updated 9 months ago
- Code for "Phoneme Segmentation Using Self-Supervised Speech Models", Strgar & Harwath, Proceedings of the IEEE Spoken Language Technologyβ¦β55Nov 4, 2022Updated 3 years ago
- ICASSP2022 TTS&VC Summaryβ14Jun 9, 2022Updated 3 years ago
- A transform to show the latest copy of the website from the Wayback Machineβ17Nov 25, 2014Updated 11 years ago
- python wrap for hts engineβ14Jan 30, 2018Updated 8 years ago
- A simple unofficial Python3 library to interface with elevenlabs.io.β17Nov 12, 2023Updated 2 years ago
- [IJCAI'23] Learning to Speak from Text for Low-Resource TTSβ64May 30, 2023Updated 2 years ago
- εη¬η»΄ζ€ηδΈζTTSβ34Oct 28, 2022Updated 3 years ago
- MultiSpeaker Tacotron2 using LifeLong Learning.β13Sep 27, 2019Updated 6 years ago
- Audio Generation model working with GPT-2 and VQVAE compressed representation of MelSpectrogramsβ18Oct 8, 2023Updated 2 years ago
- CVC: Contrastive Learning for Non-parallel Voice Conversion (INTERSPEECH 2021, in PyTorch)β59Jul 26, 2022Updated 3 years ago
- An unofficial PyTorch implementation of Mix-Phoneme-Bertβ40Jul 10, 2023Updated 2 years ago
- Google's TPGST reimplementation.β34Dec 11, 2019Updated 6 years ago
- wake-up word emotion recognition [APSIPA 2022]β17Nov 11, 2022Updated 3 years ago
- Obsidian theme inspired by iA Writerβ15Apr 12, 2024Updated last year
- The source code for the paper CrossSinger (asru2023)β18Oct 12, 2023Updated 2 years ago
- Some script for helping using Montreal Forced Aligner, maily for transforming Hanzi character to pinyin and extrat pause time from .textgβ¦β14Feb 9, 2024Updated 2 years ago
- β37May 8, 2021Updated 4 years ago
- β18Dec 7, 2023Updated 2 years ago
- MnTTS: An Open-Source Mongolian Text-to-Speech Synthesis Dataset and Accompanied Baseline. (Accepted by IALP'2022)β22Dec 5, 2022Updated 3 years ago
- Code for ICASSP 2019 paperβ18Oct 29, 2018Updated 7 years ago
- Chat with an AI simulation of anyone as easily as copy-pasting text into a folder!β19Mar 4, 2023Updated 3 years ago
- Rich Prosody Diversity Modelling with Phone-level Mixture Density Networkβ45Dec 1, 2021Updated 4 years ago
- A diffusion-based cross-lingual voice conversion model, as my bachelor's thesisβ44Jul 24, 2023Updated 2 years ago