crystal-zq-wang / VATT
Video Audio Translation Tool - automatically subtitles and dubs videos
☆14Updated 5 years ago
Alternatives and similar repositories for VATT:
Users that are interested in VATT are comparing it to the libraries listed below
- Multivoice: Enhance your foreign-language movie and TV show experience with personalized dubbed versions. Our project uses voice cloning …☆26Updated last year
- Generative voice cloning model using TTS synthesis with state-of-the-art Zero-Shot Multi-Speaker functionality. An web api built with the…☆47Updated 2 years ago
- A python library to find differences between audio and transcriptions☆20Updated last year
- Translated vocal synthesis - Clone a voice and output speech in another language☆25Updated 3 years ago
- Auto-Video maker handling many AI's☆10Updated last year
- Streamlit app to visualize and edit TTS datasets☆14Updated 3 years ago
- AI Talking Head: create video from plain text or audio file in minutes, support up to 100+ languages and 350+ voice models.☆35Updated 2 years ago
- The YouTube Text-To-Speech dataset is comprised of waveform audio extracted from YouTube videos alongside their English transcriptions☆51Updated 4 years ago
- Powered by OpenAI Whisper & Gradio☆30Updated 2 years ago
- Site for sharing MusicGen + AudioGen Prompts and Creations☆41Updated last month
- A minimalistic automatic speech recognition streamlit based webapp powered by OpenAI's Whisper "State of the Art" models☆66Updated 2 years ago
- A simple streamlit based webapp to process text and correct punctuation built using "fullstop-punctuation-multilang-large" Model from Hug…☆11Updated last year
- Auto-Lyrics: Lyrics transcription & alignment using Whisper and yt-dlp☆19Updated last week
- A gradio interface for making transcribed and translated subtitles for videos☆39Updated 2 months ago
- create dataset from list of youtube links easily☆17Updated 2 years ago
- Speech to text to speech using Elevenlabs☆28Updated last year
- ☆14Updated 2 years ago
- This project includes a Python script for fine-tuning a text-to-speech (TTS) model. The script utilizes custom datasets and use CUDA for …☆13Updated 7 months ago
- VoiceCraftAI is a revolutionary AI tool to dub videos into multiple regional languages and lip-sync at the same time.☆60Updated 7 months ago
- Running Mozilla's implementation of Baidu DeepSpeech on Google Colaboratory☆16Updated 6 years ago
- This repository contains the implementation of the paper: "Span Classification with Structured Information for Disfluency Detection in Sp…☆12Updated last year
- A curated list of awesome voice activity detection☆50Updated 5 months ago
- AudioLDM text to audio colab☆19Updated last year
- Text prompt steered synthetic audio generators☆46Updated 3 weeks ago
- An easy-to-use Video generation colab notebook☆16Updated last year
- AI Video Translator / it uses ai to transcribe, translate and then reVoice a video into english in the original speakers voice☆18Updated last year
- Using Gradio interface to build UI for converting text to speech☆13Updated 4 years ago
- ☆27Updated last year
- Prepare spectrograms from audio for training a Riffusion model☆15Updated 2 years ago
- Coqui STT Model Manager - install, manage and try out Coqui STT models from the Model Zoo☆25Updated 2 years ago