imanousar / Automatic-Subtitles-SynchronizationLinks

A project about learning how to synchronize subtitles in movies using machine learning.

☆9

Alternatives and similar repositories for Automatic-Subtitles-Synchronization

Users that are interested in Automatic-Subtitles-Synchronization are comparing it to the libraries listed below

Sorting:

pnkvalavala / multivoice
Multivoice: Enhance your foreign-language movie and TV show experience with personalized dubbed versions. Our project uses voice cloning …
☆26Updated last year
flozi00 / atra
An open source NLP as a service project focused on providing state of the art systems with ease. Training and inference by simple docker …
☆20Updated 9 months ago
umutseven92 / LaFontaine
An automatic movie trailer generator.
☆41Updated 2 years ago
aflorithmic / viseme-to-video
Creates video from TTS output and viseme images.
☆12Updated 3 years ago
ras0k / auto-lyrics
Auto-Lyrics: Lyrics transcription & alignment using Whisper and yt-dlp
☆20Updated 2 months ago
SELMA-project / ml4audio
audio, NLP, ML with huggingface, nvidia/nemo, speechbrain
☆11Updated last year
prateekralhan / OpenAI_Whisper_ASR
A minimalistic automatic speech recognition streamlit based webapp powered by OpenAI's Whisper "State of the Art" models
☆66Updated 2 years ago
Many0therFunctions / MaskGCT-Text-To-Semantic-Finetune
This is not remotely close to a finished product, and does not intend to nor does this claim to be working fine-tuning code for MaskGCT. …
☆12Updated 7 months ago
EliasVincent / whisper-subtitles-webui
A gradio interface for making transcribed and translated subtitles for videos
☆42Updated 4 months ago
rsxdalv / musicgen-prompts
Site for sharing MusicGen + AudioGen Prompts and Creations
☆45Updated 3 months ago
ddPn08 / Latopia
Speech AI training and inference tools
☆36Updated 2 years ago
uberduck-ai / dataset_viewer
Streamlit app to visualize and edit TTS datasets
☆14Updated 3 years ago
EternalDusk / LipSyncVideoGenerator
Automatically generate a lip-synced avatar based off of a transcript and audio
☆13Updated 2 years ago
Mildemelwe / Non-English-Tacotron-2-Training-Notebook
Tacotron 2 training notebook supporting Japanese, French, and Mandarin
☆11Updated 2 years ago
kadirnar / codeformer-pip
Towards Robust Blind Face Restoration with Codebook Lookup Transformer
☆30Updated last year
kadirnar / Video-Diffusion-WebUI
Video Diffusion WebUI: Text2Video + Image2Video + Video2Video WebUI
☆67Updated last year
amrrs / ai-music-video
Ai generated music video with Riffusion and Gradio
☆21Updated 2 years ago
ex3ndr / supervoice-separate
Supervoice Speaker Separation Network
☆12Updated last year
nishgowda / autocutpro
Autonomous video editing powered by Computer Vision and Motion Detection
☆17Updated last year
qiye45 / Bert-VITS2_easy_training
简化Bert-VITS2模型训练
☆9Updated last year
austin-bowen / voicebox
Python text-to-speech library with built-in voice effects and support for multiple TTS engines
☆23Updated 3 months ago
souvikg544 / TTS_Data_Maker
Text to speech is an emerging zone of AI. This repository helps to create a dataset with audio and transcripts for personalized text to s…
☆28Updated 2 years ago
smaybius / Coqui-TTS-GUI-solution
Interface for using TTS and vocoder models in the form of a text editor
☆19Updated 2 years ago
coqui-ai / stt-model-manager
Coqui STT Model Manager - install, manage and try out Coqui STT models from the Model Zoo
☆26Updated 2 years ago
thorstenMueller / cTTS
TTS Client for Coqui TTS server
☆13Updated 2 years ago
Verssae / flask-tacotron2-tts-web-app
flask+tornado based NVIDIA tacotron2+waveglow tts web app
☆29Updated 2 years ago
ryanrudes / YTTTS
The YouTube Text-To-Speech dataset is comprised of waveform audio extracted from YouTube videos alongside their English transcriptions
☆51Updated 4 years ago
harmsm / pyfx
Python library for adding visual effects to video streams
☆12Updated 5 years ago
youmebangbang / TTS-dataset-tools
Automatically generates TTS dataset using audio and associated text. Make cuts under a custom length. Uses Google Speech to text API to p…
☆52Updated 3 years ago
TylorShine / MNP-SVC
Real-time end-to-end singing voice convertion
☆22Updated 8 months ago