ztsrdev / RTL-Inpainting
A pipeline focused on the in-painting of text in images. For example the removal of subtitles in a screenshot of a movie.
☆13Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for RTL-Inpainting
- A Master Thesis Project on Video Keyword Extractor using Video Summarization techniques.☆11Updated 4 years ago
- [NCMMSC'2024] Emotion-Aware Prosodic Phrasing for Expressive Text-to-Speech☆22Updated 2 months ago
- Supervoice Speaker Separation Network☆13Updated 5 months ago
- TriNet: stabilizing self-supervised learning from complete or slow collapse on ASR.☆26Updated last year
- CLIP中文encoder☆21Updated 2 years ago
- Code for TMM paper "Horizontal-to-Vertical Video Conversion"☆14Updated 3 years ago
- The project page repo for Neural Dubber.☆27Updated last year
- repo for active speaker detection for media videos.☆20Updated 11 months ago
- The Land-Diffuser is a novel application of the Denoising Diffusion Probabilistic Model (DDPM) in the realm of 3D Talking Head generation…☆12Updated 10 months ago
- [ICCV'21] The Right to Talk: An Audio-Visual Transformer Approach☆20Updated 3 years ago
- Audio-Visual Lip Synthesis via Intermediate Landmark Representation☆14Updated last year
- Code for "SelfTalk: A Self-Supervised Commutative Training Diagram to Comprehend 3D Talking Faces" ACM MM 2023☆30Updated last year
- ☆10Updated last year
- ☆43Updated 4 months ago
- ☆12Updated 5 months ago
- finetune script for SDXL adapted from waifu-diffusion trainer☆11Updated last year
- K-FACE Analysis Project on Pytorch☆10Updated 3 years ago
- Modify-Anything is based on yolov5,yolov8 for video and image detection. Segment-anything,lama_cleaner is applied to segment, modify, era…☆10Updated last year
- Instance-level Facial Attributes Editing (CVIU 2021)☆13Updated 2 years ago
- Talking head animation☆27Updated 11 months ago
- Some portrait matting models designed for mobile device.☆16Updated 5 years ago
- Official implementation of Generative Colorization of Structured Mobile Web Pages, WACV 2023.☆21Updated 11 months ago
- Code repository for the BMVC 2022 paper: Geometry Driven Progressive Warping for One Shot Face Animation☆12Updated last year
- DreamDance: Personalized Text-to-video Generation by Combining Text-to-Image Synthesis and Motion Transfer☆14Updated last year
- A Tiny Project For ASR model training and Deployment☆27Updated 2 years ago
- Controllable Face Generation via pretrained Conditional Adversarial Latent Autoencoder (ALAE)☆19Updated 4 years ago
- ☆12Updated 4 years ago