Clone a voice in 5 seconds to generate arbitrary speech in real-time
β59,868Mar 9, 2026Updated 2 months ago
Alternatives and similar repositories for Real-Time-Voice-Cloning
Users that are interested in Real-Time-Voice-Cloning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- πΈπ¬ - a deep learning toolkit for Text-to-Speech, battle-tested in research and productionβ45,462Aug 16, 2024Updated last year
- DeepFaceLab is the leading software for creating deepfakes.β19,224Nov 13, 2024Updated last year
- πClone a voice in 5 seconds to generate arbitrary speech in real-timeβ36,898Mar 3, 2026Updated 3 months ago
- Deepfakes Software For Allβ55,250May 29, 2026Updated last week
- π Text-Prompted Generative Audio Modelβ39,145Aug 19, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer β’ AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Instant voice cloning by MIT and MyShell. Audio foundation model.β36,596Apr 19, 2025Updated last year
- Real-time face swap for PC streaming or video callsβ30,888Nov 8, 2024Updated last year
- A python package to analyze and compare voices with deep learningβ3,262Oct 12, 2023Updated 2 years ago
- Robust Speech Recognition via Large-Scale Weak Supervisionβ100,922Apr 15, 2026Updated last month
- Stable Diffusion web UIβ163,404Mar 2, 2026Updated 3 months ago
- π€ Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal modelβ¦β161,309Updated this week
- Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)β10,146Nov 9, 2023Updated 2 years ago
- DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Rasβ¦β26,750Jun 19, 2025Updated 11 months ago
- The world's simplest facial recognition api for Python and the command lineβ56,451Aug 21, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer β’ AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Hunt down social media accounts by username across social networksβ84,372Updated this week
- Command-line program to download videos from YouTube.com and other video sitesβ140,409Feb 19, 2026Updated 3 months ago
- Deezer source separation library including pretrained models.β28,232Apr 2, 2025Updated last year
- Avatars for Zoom, Skype and other video-conferencing apps.β16,516Aug 30, 2024Updated last year
- A multi-voice TTS system trained with an emphasis on qualityβ14,852Nov 19, 2024Updated last year
- Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressorβ¦β23,337Mar 3, 2026Updated 3 months ago
- AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus oβ¦β184,750Updated this week
- This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multβ¦β13,017Jun 22, 2025Updated 11 months ago
- This repository contains the source code for the paper First Order Motion Model for Image Animationβ15,001Nov 14, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient β’ AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- A latent text-to-image diffusion modelβ73,078Jun 18, 2024Updated last year
- Making large AI models cheaper, faster and more accessibleβ41,385May 25, 2026Updated last week
- Tacotron 2 - PyTorch implementation with faster-than-realtime inferenceβ5,303Jun 12, 2024Updated last year
- End-to-End Speech Processing Toolkitβ9,847May 30, 2026Updated last week
- A collective list of free APIsβ439,010Updated this week
- An open-source remote desktop application designed for self-hosting, as an alternative to TeamViewer.β115,218Updated this week
- WaveRNN Vocoder + TTSβ2,185Jul 2, 2022Updated 3 years ago
- Industry leading face manipulation platformβ28,589May 29, 2026Updated last week
- GPT4All: Run Local LLMs on Any Device. Open-source and available for commercial use.β77,354May 27, 2025Updated last year
- Managed Database hosting by DigitalOcean β’ AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- openpilot is an operating system for robotics. Currently, it upgrades the driver assistance system on 300+ supported cars.β61,245Updated this week
- The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.β115,665Updated this week
- All Algorithms implemented in Pythonβ221,469May 22, 2026Updated 2 weeks ago
- 1 min voice data can also be used to train a good TTS model! (few shot voice cloning)β58,289Apr 30, 2026Updated last month
- SOTA Open Source TTSβ30,620May 26, 2026Updated last week
- Build smaller, faster, and more secure desktop and mobile applications with a web frontend.β107,414Updated this week
- Facebook AI Research Sequence-to-Sequence Toolkit written in Python.β32,229Sep 30, 2025Updated 8 months ago