jxaizj / Modify-AnythingLinks
Modify-Anything is based on yolov5,yolov8 for video and image detection. Segment-anything,lama_cleaner is applied to segment, modify, erase, and obtain the target image. The background of the target image video can be changed, and the background of the image video can be changed.
☆15Updated 2 years ago
Alternatives and similar repositories for Modify-Anything
Users that are interested in Modify-Anything are comparing it to the libraries listed below
Sorting:
- An application that generates images or videos using Stable Diffusion models.☆20Updated 2 years ago
- Automatically generate a lip-synced avatar based off of a transcript and audio☆13Updated 2 years ago
- Talking Face Generation system☆19Updated last year
- Official implementations for paper: DreamTalk: When Expressive Talking Head Generation Meets Diffusion Probabilistic Models☆15Updated last year
- A pipeline to generate user-preferred photo-realistic avatars using stable-diffusion and bayesian-optimization.☆18Updated 2 months ago
- Talking head animation☆27Updated last year
- Cross-platform, customizable ML solutions for live and streaming media.☆24Updated 4 years ago
- A sketch extractor for anime/illustration.☆19Updated 3 years ago
- ☆18Updated last year
- Faceprecision is a comprehensive face analysis project leveraging advanced deep learning and computer vision techniques. This project inc…☆14Updated 11 months ago
- Unofficial pytorch implementation of TryOnGAN☆18Updated 3 years ago
- "Make-A-Video", new SOTA text to video by Meta-FAIR - Tensorflow☆14Updated 2 years ago
- Retrieval Augmented Generation for youtube videos with a BRAD agent☆32Updated 6 months ago
- 基于DINet的推理服务,推理视频流和视频☆16Updated last year
- DoyenTalker uses deep learning techniques to generate personalized avatar videos that speak user-provided text in a specified voice. The …☆12Updated 10 months ago
- I have created a dataset of Image-Text-Pairs by using the cosine similarity of the CLIP embeddings of the image & it's caption derrived f…☆15Updated 4 years ago
- Taking advantage of LlamaIndex's in-context learning paradigm, LlamaDoc empowers users to input PDF documents and pose any questions rela…☆14Updated 2 years ago
- AI_Video_Shorts_Creator is a python-based tool that uses OpenAI's GPT-4 power to automatically analyze videos, extract the most interesti…☆19Updated last year
- A skin smoothing filter to beautify faces.☆15Updated 4 years ago
- Style Transfer a face into cartoon without GAN. A UNet++ network with MobileNet v3 backbone optimized for mobile frameworks☆30Updated 3 years ago
- Query, ask and chat with a document-index via transformer models!☆17Updated 2 years ago
- ☆16Updated last year
- 【AIGC 实战入门笔记 —— AIGC 摩天大楼】分享 大语言模型(LLMs),大模型高效微调(SFT),检索增强生成(RAG),智能体(Agent),PPT自动生成, 角色扮演,文生图(Stable Diffusion) ,图像文字识别(OCR),语音识别(ASR),语…☆20Updated 3 months ago
- ☆15Updated last year
- Make any person bald!! Component of the paper: Learning to regulate 3D head shape by removing occluding hair from in-the-wild images.☆11Updated 3 years ago
- This project is under development.☆23Updated last year
- ImageSlider custom component for gradio.☆42Updated last year
- finetune script for SDXL adapted from waifu-diffusion trainer☆11Updated last year
- PyTorch implementation of paper: Paint Transformer: Feed Forward Neural Painting with Stroke Prediction, ICCV 2021.☆28Updated 3 years ago
- My personal implementation of the model from "Qwen-VL: A Frontier Large Vision-Language Model with Versatile Abilities", they haven't rel…☆12Updated last year