crystal-zq-wang / VATTLinks
Video Audio Translation Tool - automatically subtitles and dubs videos
☆13Updated 5 years ago
Alternatives and similar repositories for VATT
Users that are interested in VATT are comparing it to the libraries listed below
Sorting:
- A minimalistic automatic speech recognition streamlit based webapp powered by OpenAI's Whisper "State of the Art" models☆67Updated 3 years ago
- Multivoice: Enhance your foreign-language movie and TV show experience with personalized dubbed versions. Our project uses voice cloning …☆26Updated 2 years ago
- Generative voice cloning model using TTS synthesis with state-of-the-art Zero-Shot Multi-Speaker functionality. An web api built with the…☆47Updated 3 years ago
- ☆58Updated last year
- The YouTube Text-To-Speech dataset is comprised of waveform audio extracted from YouTube videos alongside their English transcriptions☆52Updated 4 years ago
- VoiceCraftAI is a revolutionary AI tool to dub videos into multiple regional languages and lip-sync at the same time.☆73Updated last year
- Coqui STT Model Manager - install, manage and try out Coqui STT models from the Model Zoo☆25Updated 2 years ago
- Site for sharing MusicGen + AudioGen Prompts and Creations☆48Updated 9 months ago
- AI Talking Head: create video from plain text or audio file in minutes, support up to 100+ languages and 350+ voice models.☆36Updated 3 years ago
- A collection of optimized utilities for text-to-audio processing, enhancing both training and inference workflows. This repository contai…☆42Updated 9 months ago
- Real time multilingual face translator☆38Updated 5 months ago
- A bidirectional recurrent neural network model with attention mechanism for restoring missing punctuation in unsegmented text☆34Updated 5 years ago
- The human speaks a language with an accent. A particular accent necessarily reflects a person's linguistic background. The model defines …☆62Updated 4 years ago
- Codebase and project page for EDMSound☆35Updated 2 years ago
- Colaboratory Notebook for Ultimate Vocal Remover☆99Updated last year
- Zero-Shot Foreign Accent Conversion without a Native Reference☆36Updated last year
- An unofficial PyTorch implementation of VALL-E☆88Updated 5 months ago
- Text prompt steered synthetic audio generators☆52Updated 8 months ago
- This is not remotely close to a finished product, and does not intend to nor does this claim to be working fine-tuning code for MaskGCT. …☆13Updated last year
- Streamlit app to visualize and edit TTS datasets☆15Updated 4 years ago
- Autonomous video editing powered by Computer Vision and Motion Detection☆17Updated 2 years ago
- A fast speech-to-speech & speech-to-text translation model that supports simultaneous decoding and offers 28× speedup.☆76Updated last year
- ☆44Updated last year
- GPT for FACodec☆13Updated last year
- KATube is a tool to automate the process of creating datasets for training Text-To-Speech (TTS) and Speech-To-Text (STT) models. From a l…☆25Updated last year
- Implementation of Google's USM speech model in Pytorch☆34Updated 2 months ago
- VoiceBank-2023 is the speech corpus specially designed for constructing personalized Mandarin text-to-speech (TTS) systems.☆40Updated last week
- A simple voice conversion tool☆19Updated 3 years ago
- GPT-style network for phonemization with durations of text☆68Updated last year
- Generate audio datasets for training Text-To-Speech models, through smart audio splitting with silence detection, and transcription using…☆30Updated 2 years ago