imanousar / Automatic-Subtitles-Synchronization
A project about learning how to synchronize subtitles in movies using machine learning.
☆9Updated 2 years ago
Alternatives and similar repositories for Automatic-Subtitles-Synchronization:
Users that are interested in Automatic-Subtitles-Synchronization are comparing it to the libraries listed below
- Multivoice: Enhance your foreign-language movie and TV show experience with personalized dubbed versions. Our project uses voice cloning …☆26Updated last year
- An automatic movie trailer generator.☆41Updated 2 years ago
- A gradio interface for making transcribed and translated subtitles for videos☆39Updated 2 months ago
- Auto-Lyrics: Lyrics transcription & alignment using Whisper and yt-dlp☆19Updated last week
- Creates video from TTS output and viseme images.☆11Updated 2 years ago
- Automatically generate a lip-synced avatar based off of a transcript and audio☆14Updated 2 years ago
- Streamlit app to visualize and edit TTS datasets☆14Updated 3 years ago
- Video Audio Translation Tool - automatically subtitles and dubs videos☆14Updated 5 years ago
- canvas-based talking head model using viseme data☆31Updated last year
- Automatically generate a music video by extracting scenes from another video☆31Updated last year
- Powered by OpenAI Whisper & Gradio☆30Updated 2 years ago
- This is not remotely close to a finished product, and does not intend to nor does this claim to be working fine-tuning code for MaskGCT. …☆12Updated 5 months ago
- Python ffmpeg wrapper for audio and video editing (trim, subtitles/overlay, concat, merge, & more!)☆23Updated 5 years ago
- Generate transcriptions and subtitles using OpenAI whisper as a base model, stable-ts/whisperx as a timestamp stabilizer using ASR models…☆18Updated 2 years ago
- Autonomous video editing powered by Computer Vision and Motion Detection☆17Updated last year
- AI Talking Head: create video from plain text or audio file in minutes, support up to 100+ languages and 350+ voice models.☆35Updated 2 years ago
- SeamlessM4t-Translator: Utilizing the powerful Seamless M4t Facebook model in the backend, this project facilitates seamless translation …☆12Updated last year
- TTS Client for Coqui TTS server☆13Updated 2 years ago
- audio, NLP, ML with huggingface, nvidia/nemo, speechbrain☆11Updated last year
- A lightweight, efficient variation of the StyleTTS 2 text‐to‐speech model.☆14Updated last week
- Python Audio Separator in Real Time using MDX-NET model☆21Updated last year
- Orchestrating AI for stunning lip-synced videos. Effortless workflow, exceptional results, all in one place.☆69Updated 10 months ago
- ONNX-compatible Fast SeamlessM4T—Massively Multilingual & Multimodal Machine Translation☆43Updated last year
- This repository is a repository for the paper, "Irgun: Improved residue based gradual up-scaling network for single image super resolutio…☆13Updated 4 years ago
- Interface for using TTS and vocoder models in the form of a text editor☆20Updated 2 years ago
- Cloned repository from Hugging Face Spaces (CVPR 2022 Demo)☆54Updated 2 years ago
- Finally, some decent sample sentences☆22Updated last year
- Allows you to edit videos automatically using Motion Detection☆31Updated 4 years ago
- Website for generating subtitles for videos using OpenAI's Whisper Models☆11Updated 8 months ago
- A minimalistic automatic speech recognition streamlit based webapp powered by OpenAI's Whisper "State of the Art" models☆66Updated 2 years ago