crystal-zq-wang / VATT
Video Audio Translation Tool - automatically subtitles and dubs videos
☆14Updated 5 years ago
Alternatives and similar repositories for VATT:
Users that are interested in VATT are comparing it to the libraries listed below
- Auto-Lyrics: Lyrics transcription & alignment using Whisper and yt-dlp☆17Updated last month
- A python library to find differences between audio and transcriptions☆17Updated last year
- A project about learning how to synchronize subtitles in movies using machine learning.☆9Updated 2 years ago
- Multivoice: Enhance your foreign-language movie and TV show experience with personalized dubbed versions. Our project uses voice cloning …☆26Updated last year
- An experimental proof-of-concept script to automatically dub videos to English with the help of local TTS, voice cloning, audio separatio…☆12Updated 10 months ago
- A minimalistic automatic speech recognition streamlit based webapp powered by OpenAI's Whisper "State of the Art" models☆66Updated 2 years ago
- TTS Client for Coqui TTS server☆13Updated 2 years ago
- List of repositories relevant to VITS.☆36Updated 2 years ago
- Streamlit app to visualize and edit TTS datasets☆14Updated 3 years ago
- Voxella 🌍 - AI Video Translation and Dubbing App: Seamlessly translate and dub videos into multiple languages with Voxella. This powerfu…☆16Updated last year
- AI Talking Head: create video from plain text or audio file in minutes, support up to 100+ languages and 350+ voice models.☆34Updated 2 years ago
- Text To Speech Multilingual Support (+20 Language)☆42Updated last year
- An open source NLP as a service project focused on providing state of the art systems with ease. Training and inference by simple docker …☆20Updated 6 months ago
- Generative voice cloning model using TTS synthesis with state-of-the-art Zero-Shot Multi-Speaker functionality. An web api built with the…☆47Updated 2 years ago
- KATube is a tool to automate the process of creating datasets for training Text-To-Speech (TTS) and Speech-To-Text (STT) models. From a l…☆22Updated 8 months ago
- The YouTube Text-To-Speech dataset is comprised of waveform audio extracted from YouTube videos alongside their English transcriptions☆51Updated 3 years ago
- A simple streamlit based webapp to process text and correct punctuation built using "fullstop-punctuation-multilang-large" Model from Hug…☆11Updated last year
- Coqui STT Model Manager - install, manage and try out Coqui STT models from the Model Zoo☆25Updated 2 years ago
- Educational voice conversation partner using Chat-GPT, Whisper, and AWS Polly.☆14Updated last year
- AudioLDM text to audio colab☆19Updated last year
- Translated vocal synthesis - Clone a voice and output speech in another language☆24Updated 2 years ago
- Powered by OpenAI Whisper & Gradio☆30Updated 2 years ago
- ☆56Updated 9 months ago
- VoiceCraftAI is a revolutionary AI tool to dub videos into multiple regional languages and lip-sync at the same time.☆59Updated 5 months ago
- VoiceBank-2023 is the speech corpus specially designed for constructing personalized Mandarin text-to-speech (TTS) systems.☆39Updated last year
- A free & open tool for transcribing audio interviews with offline ASR support☆24Updated last year
- A curated list of awesome voice activity detection☆44Updated 4 months ago
- Use VITS and Opencpop to develop singing voice synthesis; Different from VISinger.☆35Updated 2 years ago
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆60Updated 2 weeks ago
- Zero-Shot Foreign Accent Conversion without a Native Reference☆30Updated 10 months ago