jxaizj / Modify-Anything
Modify-Anything is based on yolov5,yolov8 for video and image detection. Segment-anything,lama_cleaner is applied to segment, modify, erase, and obtain the target image. The background of the target image video can be changed, and the background of the image video can be changed.
☆14Updated last year
Alternatives and similar repositories for Modify-Anything:
Users that are interested in Modify-Anything are comparing it to the libraries listed below
- A sketch extractor for anime/illustration.☆19Updated 3 years ago
- Face_lib separate from AI_Power☆23Updated 7 months ago
- Thin Plate Spline Motion Model - ONNX. Extended version for FaceSwap - HeadSwap - PartSwap☆11Updated 5 months ago
- Karras et al. (2022) diffusion models for PyTorch☆17Updated last year
- Supervoice Speaker Separation Network☆12Updated 10 months ago
- Talking head animation☆27Updated last year
- Official implementations for paper: DreamTalk: When Expressive Talking Head Generation Meets Diffusion Probabilistic Models☆15Updated last year
- Music to Dance for 3D Avatar☆15Updated 3 years ago
- Talking Face Generation system☆19Updated last year
- ☆9Updated 4 years ago
- A skin smoothing filter to beautify faces.☆15Updated 4 years ago
- Cross-platform, customizable ML solutions for live and streaming media.☆24Updated 3 years ago
- 基于DINet的推理服务,推理视频流和视频☆15Updated last year
- A simple c++ library to detect scene transitions in a video☆14Updated 4 years ago
- A pipeline focused on the in-painting of text in images. For example the removal of subtitles in a screenshot of a movie.☆13Updated 2 years ago
- A pipeline to generate user-preferred photo-realistic avatars using stable-diffusion and bayesian-optimization.☆18Updated last month
- ☆11Updated last year
- A project about learning how to synchronize subtitles in movies using machine learning.☆9Updated 2 years ago
- [NCMMSC'2024] Emotion-Aware Prosodic Phrasing for Expressive Text-to-Speech☆22Updated 7 months ago
- Make any person bald!! Component of the paper: Learning to regulate 3D head shape by removing occluding hair from in-the-wild images.☆10Updated 2 years ago
- Automatically generate a lip-synced avatar based off of a transcript and audio☆14Updated 2 years ago
- ☆9Updated 3 years ago
- ☆13Updated last year
- PyTorch implementation of paper: Paint Transformer: Feed Forward Neural Painting with Stroke Prediction, ICCV 2021.☆28Updated 3 years ago
- lightweight LAMA inference wrapper☆25Updated last year
- a naive 3d human pose editor GUI.☆19Updated last year
- A multimodal large-scale model, which performs close to the closed-source Qwen-VL-PLUS on many datasets and significantly surpasses the p…☆14Updated last year
- An application that generates images or videos using Stable Diffusion models.☆20Updated 2 years ago
- Implementing an interactive AI avatar using Python, Blender and GPT☆10Updated last year
- Project Page for VividTalk☆15Updated last year