pranauv1 / AI-Video-TranslationView external linksLinks
A simple Google Colab notebook which can translate an original video into multiple languages along with lip sync.
☆252Mar 12, 2025Updated 11 months ago
Alternatives and similar repositories for AI-Video-Translation
Users that are interested in AI-Video-Translation are comparing it to the libraries listed below
Sorting:
- ☆12Mar 18, 2024Updated last year
- ☆16Sep 30, 2023Updated 2 years ago
- This is not remotely close to a finished product, and does not intend to nor does this claim to be working fine-tuning code for MaskGCT. …☆13Dec 4, 2024Updated last year
- ☆12Mar 25, 2024Updated last year
- ☆14Nov 22, 2024Updated last year
- ☆25Mar 30, 2025Updated 10 months ago
- ☆14May 31, 2024Updated last year
- ☆15Jun 25, 2024Updated last year
- The repo for: TriHuman: A Real-time and Controllable Tri-plane Representation for Detailed Human Geometry and Appearance Synthesis☆19Nov 15, 2025Updated 2 months ago
- Tidy Tunes is an easy-to-use pipeline for mining high-quality audio data for speech generation models. To do so, it chains multiple open …☆22Updated this week
- This repo contains self made projects and learnables from various resources on using local LLMs and RAG☆14May 26, 2025Updated 8 months ago
- ☆15Mar 12, 2024Updated last year
- Jupyter notebooks for PuLID face transfer with Flux.1 dev. Able to run on Google Colab Free Tier☆18Dec 18, 2024Updated last year
- ☆20Dec 19, 2023Updated 2 years ago
- ☆24Dec 10, 2023Updated 2 years ago
- ☆22Aug 31, 2024Updated last year
- ☆19Jan 15, 2024Updated 2 years ago
- MnTTS: An Open-Source Mongolian Text-to-Speech Synthesis Dataset and Accompanied Baseline. (Accepted by IALP'2022)☆22Dec 5, 2022Updated 3 years ago
- Experimental method to use reference video to drive motion in generations without training in ComfyUI.☆37Apr 9, 2024Updated last year
- ☆33Feb 26, 2024Updated last year
- A Massive Multilingual Multi-speaker Speech Corpus for Scaling Indian TTS☆54Dec 11, 2024Updated last year
- DiFlow-TTS delivers low-latency zero-shot TTS via discrete flow matching and factorized speech tokens. A compact, open framework for fast…☆51Updated this week
- ☆18Jan 16, 2024Updated 2 years ago
- Animefy: ComfyUI workflow designed to convert images or videos into an anime-like style automatically.☆22Jul 2, 2024Updated last year
- A repo for generating random NFTs with metadata 100% on chain!☆37Mar 8, 2024Updated last year
- Unofficial pytorch implementation of VISinger: Variational Inference with Adversarial Learning for End-to-end Singing Voice Synthesis (IC…☆19May 12, 2023Updated 2 years ago
- Implementation of TTS model based on NVIDIA P-Flow TTS Paper☆77May 12, 2024Updated last year
- Website source code for our ACM MM'23 paper "Hierarchical Masked 3D Diffusion Model for Video Outpainting".☆41Apr 20, 2024Updated last year
- ☆25Dec 22, 2023Updated 2 years ago
- ☆11Aug 11, 2023Updated 2 years ago
- Official repository of the work "Low-complexity Unsupervised Audio Anomaly Detection exploiting Separable Convolutions and Angular Loss" …☆10Nov 6, 2024Updated last year
- ⭐ EASY TO USE ⭐ A yt-dlp ▶️ based Tiktok ♪ scraper - more cleanly explained☆11Feb 16, 2023Updated 2 years ago
- A collection of all our phonemeizers for dataset construction and inference☆27Feb 21, 2025Updated 11 months ago
- Reimplementation of Miipher☆29Aug 16, 2023Updated 2 years ago
- Repository with which to explore k-diffusion and diffusers, and within which changes to said packages may be tested.☆55Jan 28, 2024Updated 2 years ago
- [ECCV 2024] PanoFree: Tuning-Free Holistic Multi-view Image Generation with Cross-view Self-Guidance☆23Jul 25, 2024Updated last year
- SOFA_AI: Singing-Oriented Forced Aligner for Automatic Inference☆24May 28, 2024Updated last year
- A corpus of diacritized Hebrew texts (טקסט מנוקד)☆11May 4, 2022Updated 3 years ago
- Codebase for the paper HawkI: HawkI: Homography & Mutual Information Guidance for 3D-free Single Image to Aerial View☆13Jun 5, 2024Updated last year