crystal-zq-wang / VATTLinks
Video Audio Translation Tool - automatically subtitles and dubs videos
☆13Updated 5 years ago
Alternatives and similar repositories for VATT
Users that are interested in VATT are comparing it to the libraries listed below
Sorting:
- Multivoice: Enhance your foreign-language movie and TV show experience with personalized dubbed versions. Our project uses voice cloning …☆26Updated 2 years ago
- AI Talking Head: create video from plain text or audio file in minutes, support up to 100+ languages and 350+ voice models.☆36Updated 2 years ago
- VoiceCraftAI is a revolutionary AI tool to dub videos into multiple regional languages and lip-sync at the same time.☆69Updated last year
- Generative voice cloning model using TTS synthesis with state-of-the-art Zero-Shot Multi-Speaker functionality. An web api built with the…☆47Updated 2 years ago
- A python library to find differences between audio and transcriptions☆19Updated last year
- Coqui STT Model Manager - install, manage and try out Coqui STT models from the Model Zoo☆25Updated 2 years ago
- A minimalistic automatic speech recognition streamlit based webapp powered by OpenAI's Whisper "State of the Art" models☆65Updated 3 years ago
- A curated list of awesome OpenAI's Whisper☆98Updated 2 years ago
- Split long audio files based on subtitle-info in SRT File (Transcript saved in CSV)☆20Updated 5 years ago
- repo for active speaker detection for media videos.☆29Updated last year
- Translated vocal synthesis - Clone a voice and output speech in another language☆26Updated 3 years ago
- Zero-Shot Foreign Accent Conversion without a Native Reference☆34Updated last year
- AutomEditor is an AI based video editor that helps video bloggers to remove bloopers automatically. It uses multimodal spatio-temporal bl…☆47Updated 6 years ago
- Text To Speech Multilingual Support (+20 Language)☆50Updated 2 years ago
- Engage in conversation with your virtual self using AI techniques like NLP, voice cloning, and computer vision. Get accurate answers with…☆84Updated 2 years ago
- Transferability of cross-lingual and cross-age speech emotion recognition☆19Updated 2 years ago
- Real time multilingual face translator☆38Updated 2 months ago
- ☆57Updated last year
- ☆13Updated last year
- A testing repo to share code and thoughts on diarisation☆56Updated last year
- This repository contains the implementation of the paper: "Span Classification with Structured Information for Disfluency Detection in Sp…☆13Updated 2 years ago
- Using Gradio interface to build UI for converting text to speech☆13Updated 4 years ago
- Automatically generate, translate and overlay subtitles for any video.☆44Updated last month
- An open source NLP as a service project focused on providing state of the art systems with ease. Training and inference by simple docker …☆20Updated last year
- Identifying individual speakers in an audio stream based on the unique characteristics found in individual voices using Python☆18Updated 2 years ago
- Generate subtitles for long movies / podcasts with OpenAI Whisper API.☆31Updated 2 years ago
- [NCMMSC'2024] Emotion-Aware Prosodic Phrasing for Expressive Text-to-Speech☆22Updated last year
- Unsupervised video dubbing project☆40Updated 5 years ago
- A collection of optimized utilities for text-to-audio processing, enhancing both training and inference workflows. This repository contai…☆39Updated 6 months ago
- This repository is a repository for the paper, "Irgun: Improved residue based gradual up-scaling network for single image super resolutio…☆15Updated 5 years ago