CorentinJ / Real-Time-Voice-CloningLinks
Clone a voice in 5 seconds to generate arbitrary speech in real-time
☆58,886Updated 2 months ago
Alternatives and similar repositories for Real-Time-Voice-Cloning
Users that are interested in Real-Time-Voice-Cloning are comparing it to the libraries listed below
Sorting:
- 🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production☆43,540Updated last year
- 🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time☆36,775Updated 2 weeks ago
- Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)☆10,054Updated 2 years ago
- DeepFaceLab is the leading software for creating deepfakes.☆18,819Updated last year
- Deepfakes Software For All☆54,747Updated this week
- A python package to analyze and compare voices with deep learning☆3,156Updated 2 years ago
- Instant voice cloning by MIT and MyShell. Audio foundation model.☆35,478Updated 7 months ago
- Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text fronten…☆12,377Updated last month
- End-to-End Speech Processing Toolkit☆9,603Updated this week
- Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!☆40,594Updated this week
- Real-time face swap for PC streaming or video calls☆30,153Updated last year
- Deezer source separation library including pretrained models.☆27,795Updated 7 months ago
- Tacotron 2 - PyTorch implementation with faster-than-realtime inference☆5,284Updated last year
- Industry leading face manipulation platform☆25,894Updated this week
- This repository contains the source code for the paper First Order Motion Model for Image Animation☆14,974Updated last year
- Robust Speech Recognition via Large-Scale Weak Supervision☆91,194Updated 2 months ago
- A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Auto…☆16,141Updated this week
- A multi-voice TTS system trained with an emphasis on quality☆14,704Updated last year
- one-click face swap☆30,374Updated last year
- DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Ras…☆26,662Updated 5 months ago
- TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, Germa…☆3,987Updated last year
- Robust Video Matting in PyTorch, TensorFlow, TensorFlow.js, ONNX, CoreML!☆9,157Updated last year
- Invoke is a leading creative engine for Stable Diffusion models, empowering professionals, artists, and enthusiasts to generate and creat…☆26,285Updated last week
- LLM inference in C/C++☆90,119Updated last week
- A latent text-to-image diffusion model☆71,863Updated last year
- DALL·E Mini - Generate images from a text prompt☆14,814Updated 2 years ago
- WaveRNN Vocoder + TTS☆2,174Updated 3 years ago
- Label Studio is a multi-type data labeling and annotation tool with standardized output format☆25,496Updated last week
- Rembg is a tool to remove images background☆21,091Updated last week
- Background Matting: The World is Your Green Screen☆4,785Updated 3 years ago