Clone a voice in 5 seconds to generate arbitrary speech in real-time
β59,554Mar 9, 2026Updated 2 weeks ago
Alternatives and similar repositories for Real-Time-Voice-Cloning
Users that are interested in Real-Time-Voice-Cloning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- πΈπ¬ - a deep learning toolkit for Text-to-Speech, battle-tested in research and productionβ44,896Aug 16, 2024Updated last year
- DeepFaceLab is the leading software for creating deepfakes.β19,080Nov 13, 2024Updated last year
- πClone a voice in 5 seconds to generate arbitrary speech in real-timeβ36,891Mar 3, 2026Updated 3 weeks ago
- Deepfakes Software For Allβ55,058Updated this week
- π Text-Prompted Generative Audio Modelβ39,045Aug 19, 2024Updated last year
- Instant voice cloning by MIT and MyShell. Audio foundation model.β36,136Apr 19, 2025Updated 11 months ago
- Real-time face swap for PC streaming or video callsβ30,646Nov 8, 2024Updated last year
- A python package to analyze and compare voices with deep learningβ3,231Oct 12, 2023Updated 2 years ago
- Robust Speech Recognition via Large-Scale Weak Supervisionβ96,288Dec 15, 2025Updated 3 months ago
- Stable Diffusion web UIβ161,958Mar 2, 2026Updated 3 weeks ago
- π€ Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal modelβ¦β158,060Updated this week
- Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)β10,127Nov 9, 2023Updated 2 years ago
- DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Rasβ¦β26,737Jun 19, 2025Updated 9 months ago
- The world's simplest facial recognition api for Python and the command lineβ56,226Aug 21, 2024Updated last year
- Hunt down social media accounts by username across social networksβ73,965Updated this week
- Command-line program to download videos from YouTube.com and other video sitesβ139,931Feb 19, 2026Updated last month
- β30,533Mar 13, 2026Updated last week
- Deezer source separation library including pretrained models.β28,114Apr 2, 2025Updated 11 months ago
- Avatars for Zoom, Skype and other video-conferencing apps.β16,542Aug 30, 2024Updated last year
- A multi-voice TTS system trained with an emphasis on qualityβ14,824Nov 19, 2024Updated last year
- Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressorβ¦β23,112Mar 3, 2026Updated 3 weeks ago
- AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus oβ¦β182,560Mar 18, 2026Updated last week
- This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multβ¦β12,885Jun 22, 2025Updated 9 months ago
- This repository contains the source code for the paper First Order Motion Model for Image Animationβ15,007Nov 14, 2024Updated last year
- A latent text-to-image diffusion modelβ72,709Jun 18, 2024Updated last year
- Making large AI models cheaper, faster and more accessibleβ41,362Mar 16, 2026Updated last week
- Tacotron 2 - PyTorch implementation with faster-than-realtime inferenceβ5,306Jun 12, 2024Updated last year
- End-to-End Speech Processing Toolkitβ9,780Updated this week
- A collective list of free APIsβ414,860Mar 18, 2026Updated last week
- An open-source remote desktop application designed for self-hosting, as an alternative to TeamViewer.β109,681Updated this week
- WaveRNN Vocoder + TTSβ2,179Jul 2, 2022Updated 3 years ago
- Industry leading face manipulation platformβ27,153Updated this week
- GPT4All: Run Local LLMs on Any Device. Open-source and available for commercial use.β77,236May 27, 2025Updated 9 months ago
- openpilot is an operating system for robotics. Currently, it upgrades the driver assistance system on 300+ supported cars.β60,406Updated this week
- The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.β106,179Mar 18, 2026Updated last week
- SOTA Open Source TTSβ28,614Updated this week
- All Algorithms implemented in Pythonβ218,878Mar 13, 2026Updated last week
- 1 min voice data can also be used to train a good TTS model! (few shot voice cloning)β56,007Feb 9, 2026Updated last month
- Build smaller, faster, and more secure desktop and mobile applications with a web frontend.β104,533Updated this week