Clone a voice in 5 seconds to generate arbitrary speech in real-time
β59,732Mar 9, 2026Updated 2 months ago
Alternatives and similar repositories for Real-Time-Voice-Cloning
Users that are interested in Real-Time-Voice-Cloning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- πΈπ¬ - a deep learning toolkit for Text-to-Speech, battle-tested in research and productionβ45,255Aug 16, 2024Updated last year
- DeepFaceLab is the leading software for creating deepfakes.β19,185Nov 13, 2024Updated last year
- πClone a voice in 5 seconds to generate arbitrary speech in real-timeβ36,903Mar 3, 2026Updated 2 months ago
- Deepfakes Software For Allβ55,231May 9, 2026Updated last week
- π Text-Prompted Generative Audio Modelβ39,120Aug 19, 2024Updated last year
- Deploy open-source AI quickly and easily - Special Bonus Offer β’ AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Instant voice cloning by MIT and MyShell. Audio foundation model.β36,509Apr 19, 2025Updated last year
- Real-time face swap for PC streaming or video callsβ30,836Nov 8, 2024Updated last year
- A python package to analyze and compare voices with deep learningβ3,254Oct 12, 2023Updated 2 years ago
- Robust Speech Recognition via Large-Scale Weak Supervisionβ99,463Apr 15, 2026Updated last month
- Stable Diffusion web UIβ162,812Mar 2, 2026Updated 2 months ago
- π€ Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal modelβ¦β160,559Updated this week
- Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)β10,135Nov 9, 2023Updated 2 years ago
- DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Rasβ¦β26,756Jun 19, 2025Updated 10 months ago
- The world's simplest facial recognition api for Python and the command lineβ56,393Aug 21, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Hunt down social media accounts by username across social networksβ83,157Updated this week
- Command-line program to download videos from YouTube.com and other video sitesβ140,237Feb 19, 2026Updated 2 months ago
- β30,488Mar 13, 2026Updated 2 months ago
- Deezer source separation library including pretrained models.β28,204Apr 2, 2025Updated last year
- Avatars for Zoom, Skype and other video-conferencing apps.β16,524Aug 30, 2024Updated last year
- A multi-voice TTS system trained with an emphasis on qualityβ14,843Nov 19, 2024Updated last year
- Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressorβ¦β23,263Mar 3, 2026Updated 2 months ago
- AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus oβ¦β184,200Updated this week
- This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multβ¦β12,984Jun 22, 2025Updated 10 months ago
- Simple, predictable pricing with DigitalOcean hosting β’ AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- This repository contains the source code for the paper First Order Motion Model for Image Animationβ15,003Nov 14, 2024Updated last year
- A latent text-to-image diffusion modelβ72,989Jun 18, 2024Updated last year
- Making large AI models cheaper, faster and more accessibleβ41,380Updated this week
- Tacotron 2 - PyTorch implementation with faster-than-realtime inferenceβ5,302Jun 12, 2024Updated last year
- End-to-End Speech Processing Toolkitβ9,836Updated this week
- A collective list of free APIsβ433,971Updated this week
- An open-source remote desktop application designed for self-hosting, as an alternative to TeamViewer.β114,156Updated this week
- WaveRNN Vocoder + TTSβ2,182Jul 2, 2022Updated 3 years ago
- Industry leading face manipulation platformβ28,236Updated this week
- Bare Metal GPUs on DigitalOcean Gradient AI β’ AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- GPT4All: Run Local LLMs on Any Device. Open-source and available for commercial use.β77,364May 27, 2025Updated 11 months ago
- openpilot is an operating system for robotics. Currently, it upgrades the driver assistance system on 300+ supported cars.β60,889Updated this week
- The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.β112,559Updated this week
- All Algorithms implemented in Pythonβ221,021May 4, 2026Updated last week
- 1 min voice data can also be used to train a good TTS model! (few shot voice cloning)β57,341Apr 30, 2026Updated 2 weeks ago
- SOTA Open Source TTSβ30,356Updated this week
- Build smaller, faster, and more secure desktop and mobile applications with a web frontend.β106,447Updated this week