pnkvalavala / multivoice
Multivoice: Enhance your foreign-language movie and TV show experience with personalized dubbed versions. Our project uses voice cloning and TTS to deliver natural and engaging dubbed dialogue for a seamless viewing adventure.
☆26Updated last year
Alternatives and similar repositories for multivoice
Users that are interested in multivoice are comparing it to the libraries listed below
Sorting:
- A simple script to prepare dataset for training with TTS Tortoise model via https://git.ecker.tech/mrq/ai-voice-cloning☆12Updated last year
- AudioLDM text to audio colab☆19Updated last year
- Translated vocal synthesis - Clone a voice and output speech in another language☆25Updated 3 years ago
- Advanced RVC Inference for quicker and effortless model downloads☆51Updated last month
- Turn any common eBook file into an HQ Audiobook with F5-TTS (Easy Install)☆23Updated last month
- A lightweight, efficient variation of the StyleTTS 2 text‐to‐speech model.☆16Updated last week
- VoiceCraftAI is a revolutionary AI tool to dub videos into multiple regional languages and lip-sync at the same time.☆61Updated 7 months ago
- Misc. tools/scripts that I made to use for tortoise☆21Updated 8 months ago
- High-performance ASR tool using Faster Whisper, supporting custom models, multi-language transcription, and real-time processing feedback…☆10Updated 6 months ago
- A project about learning how to synchronize subtitles in movies using machine learning.☆9Updated 2 years ago
- Auto-Lyrics: Lyrics transcription & alignment using Whisper and yt-dlp☆19Updated 2 weeks ago
- Auto-Video maker handling many AI's☆10Updated last year
- Sophia AI Assistant is a Python-based desktop AI that performs a variety of tasks, including answering questions, opening applications, b…☆16Updated 6 months ago
- AI Talking Head: create video from plain text or audio file in minutes, support up to 100+ languages and 350+ voice models.☆35Updated 2 years ago
- ☆27Updated last year
- ☆83Updated 10 months ago
- Google Colab (without Gradio) notebook for generating AI song covers. YouTube download audio, best voice separation, RVC inference, autom…☆17Updated last year
- Create storybooks generated using generative AI models from using LLMs for text to Stable Diffusion for illustrations (maybe also use tex…☆20Updated last year
- AI 3D avatar voice interface in browser. VAD -> STT -> LLM -> TTS -> VRM (Prototype/Proof-of-Concept)☆67Updated last year
- ONNX-compatible Fast SeamlessM4T—Massively Multilingual & Multimodal Machine Translation☆43Updated last year
- an improved version of Real-time-voice-cloning☆50Updated last year
- Site for sharing MusicGen + AudioGen Prompts and Creations☆42Updated last month
- ☆54Updated last year
- Automatically generate a lip-synced avatar based off of a transcript and audio☆13Updated 2 years ago
- Orchestrating AI for stunning lip-synced videos. Effortless workflow, exceptional results, all in one place.☆70Updated 10 months ago
- Listen, transcribe, reply - Voice Assistant using OpenAI & ElevenLabs API's☆14Updated last year
- Video Diffusion WebUI: Text2Video + Image2Video + Video2Video WebUI☆67Updated last year
- This project includes a Python script for fine-tuning a text-to-speech (TTS) model. The script utilizes custom datasets and use CUDA for …☆13Updated 7 months ago
- Generative voice cloning model using TTS synthesis with state-of-the-art Zero-Shot Multi-Speaker functionality. An web api built with the…☆47Updated 2 years ago
- llmon-py is a multimodal webui for Llama 3-8B.☆16Updated 10 months ago