hparcells/rtvc

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/hparcells/rtvc)

hparcells / rtvc

💬 "Realtime" voice transcription and cloning using ElevenLabs's API.

☆57

Alternatives and similar repositories for rtvc

Users that are interested in rtvc are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

lugia19 / Echo-XI
View on GitHub
Speech to text to speech using Elevenlabs
☆27Jul 2, 2023Updated 3 years ago
Slaymish / ArticulateAI
View on GitHub
Listen, transcribe, reply - Voice Assistant using OpenAI & ElevenLabs API's
☆14Jun 24, 2023Updated 3 years ago
winedarkmoon / ElevenGUI
View on GitHub
A user-friendly interface for ElevenLabs' API with added audio transcription capability.
☆13Jun 20, 2023Updated 3 years ago
1nnovat1on / gpt_chatbot
View on GitHub
This chatbot lets you use your microphone to communicate with GPT-4. It uses the OpenAI text to speech to respond with a voice. It uses P…
☆56Dec 6, 2023Updated 2 years ago
W-Wu / ERC-SLT22
View on GitHub
Code for "Distribution-based Emotion Recognition in Conversation"
☆18Feb 6, 2023Updated 3 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
benbaptist / elevenlabs
View on GitHub
A simple unofficial Python3 library to interface with elevenlabs.io.
☆17Nov 12, 2023Updated 2 years ago
nii-yamagishilab / speaker_sex_attribute_privacy
View on GitHub
Project for HIDING SPEAKER’S SEX IN SPEECH USING ZERO-EVIDENCE SPEAKER REPRESENTATION IN AN ANALYSIS/SYNTHESIS PIPELINE
☆15Nov 30, 2022Updated 3 years ago
winstxnhdw / CapGen
View on GitHub
A fast CPU-first video/audio transcriber for generating caption files with Whisper and CTranslate2, hosted on Hugging Face Spaces.
☆11Updated this week
seahore / PPG-GradVC
View on GitHub
A diffusion-based cross-lingual voice conversion model, as my bachelor's thesis
☆45Jul 24, 2023Updated 3 years ago
rishikksh20 / Avocodo-pytorch
View on GitHub
Avocodo: Generative Adversarial Network for Artifact-free Vocoder
☆122Jul 14, 2022Updated 4 years ago
revsic / torch-retriever-vc
View on GitHub
PyTorch implementation of Retriever: Learning Content-Style Representation
☆12Jan 27, 2023Updated 3 years ago
freds0 / CML-TTS-Dataset
View on GitHub
CML-TTS: A Multilingual Dataset for Speech Synthesis
☆36Jul 31, 2024Updated last year
agentitive / spellbound
View on GitHub
Autonomous and goal-seeking coding agents. 💫
☆22Dec 11, 2024Updated last year
Yangyangii / TPGST-Tacotron
View on GitHub
Google's TPGST reimplementation.
☆34Dec 11, 2019Updated 6 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
WangHelin1997 / Aty-TTS
View on GitHub
Aty-TTS: Improving fairness for spoken language understanding in atypical speech with Text-to-Speech
☆11May 14, 2025Updated last year
lucasjinreal / textfrontend
View on GitHub
单独维护的中文TTS
☆34Oct 28, 2022Updated 3 years ago
whereiswhere / iB-Writer
View on GitHub
Obsidian theme inspired by iA Writer
☆16Apr 12, 2024Updated 2 years ago
rishikksh20 / NU-Wave2-pytorch
View on GitHub
NU-Wave 2: A General Neural Audio Upsampling Model for Various Sampling Rates [WIP]
☆25Jul 5, 2022Updated 4 years ago
lstrgar / ss-phoneme-seg
View on GitHub
Code for "Phoneme Segmentation Using Self-Supervised Speech Models", Strgar & Harwath, Proceedings of the IEEE Spoken Language Technology…
☆55Nov 4, 2022Updated 3 years ago
dshukertjr / avatar-village
View on GitHub
☆11Aug 7, 2021Updated 4 years ago
errolyan / text_normalization_CH
View on GitHub
TTS前，文本标准化，将数字字母处理转化为汉字
☆12Apr 27, 2024Updated 2 years ago
rhoposit / multilingual_VQVAE
View on GitHub
☆37May 8, 2021Updated 5 years ago
jasonppy / syllable-discovery
View on GitHub
Syllable Segmentation and Cross-Lingual Generalization in a Visually Grounded, Self-Supervised Speech Model
☆35Aug 27, 2023Updated 2 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
Takaaki-Saeki / zm-text-tts
View on GitHub
[IJCAI'23] Learning to Speak from Text for Low-Resource TTS
☆65May 30, 2023Updated 3 years ago
lmxue / ICASSP2022_TTS_VC_Summary
View on GitHub
ICASSP2022 TTS&VC Summary
☆13Jun 9, 2022Updated 4 years ago
zengchang233 / CrossSinger
View on GitHub
The source code for the paper CrossSinger (asru2023)
☆18Oct 12, 2023Updated 2 years ago
valory-xyz / mech
View on GitHub
Base mech
☆40Jul 8, 2026Updated 3 weeks ago
i3thuan5 / hts_engine_python
View on GitHub
python wrap for hts engine
☆14Jan 30, 2018Updated 8 years ago
liady / rl-games
View on GitHub
Games and AI in the browser! Demo site here: http://rl-games-ai.netlify.app/
☆10Oct 27, 2020Updated 5 years ago
rishikksh20 / Phone-Level-Mixture-Density-Network-for-TTS
View on GitHub
Rich Prosody Diversity Modelling with Phone-level Mixture Density Network
☆45Dec 1, 2021Updated 4 years ago
ljuvela / multiscale-GAN
View on GitHub
Code for ICASSP 2019 paper
☆18Oct 29, 2018Updated 7 years ago
stevemoraco / VideochatWithAI
View on GitHub
https://facetimeanyone.com/
☆10Nov 9, 2023Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
abdtriedcoding / noted
View on GitHub
Noted is an all-in-one workspace application, that helps you for note-making 📝, project management 📅, collaboration 👥, and more! 🛠️
☆17Nov 11, 2024Updated last year
xcmyz / Lifelong-Learning-Tacotron2
View on GitHub
MultiSpeaker Tacotron2 using LifeLong Learning.
☆13Sep 27, 2019Updated 6 years ago
hecko-yes / tts-dataset-prompts
View on GitHub
Finally, some decent sample sentences
☆24Dec 3, 2023Updated 2 years ago
ArthurBugan / nestfeed.extension
View on GitHub
Manage your Youtube and Twitter subscriptions into groups and folders
☆14May 22, 2026Updated 2 months ago
vara-tts / VARA-TTS
View on GitHub
Demo audio of VARA-TTS model
☆20Jun 11, 2021Updated 5 years ago
karchkha / MelSpec_GPT_VQVAE
View on GitHub
Audio Generation model working with GPT-2 and VQVAE compressed representation of MelSpectrograms
☆18Oct 8, 2023Updated 2 years ago
ZmiennyVT / UniAI
View on GitHub
Your personal assistant who will help you with your loneliness
☆19May 4, 2023Updated 3 years ago