eren23 / sam-clip-diffusionLinks
SAM + CLIP + DIFFUSION for image to edit objects in images using plain text
☆15Updated 2 years ago
Alternatives and similar repositories for sam-clip-diffusion
Users that are interested in sam-clip-diffusion are comparing it to the libraries listed below
Sorting:
- This project is under development.☆23Updated 2 years ago
- A pipeline to generate user-preferred photo-realistic avatars using stable-diffusion and bayesian-optimization.☆18Updated 7 months ago
- Unofficial pytorch implementation of TryOnGAN☆18Updated 4 years ago
- Diffusion WebUI: Stable Diffusion + ControlNet + Inpaint☆52Updated 2 years ago
- Faysal-MD / Unmasking-Deepfake-Faces-from-Videos-An-Explainable-Cost-Sensitive-Deep-Learning-Approach-IEEE2023Deepfake faces detection from forged videos where used explainable AI for models' robustness as well as cost sensitive methods for mitiga…☆10Updated last year
- Stable Fashion: A prompt based virtual try on repository☆89Updated 3 years ago
- Modify-Anything is based on yolov5,yolov8 for video and image detection. Segment-anything,lama_cleaner is applied to segment, modify, era…☆17Updated 2 years ago
- Photorealism model use RealVisXL v4.0☆12Updated last year
- ☆15Updated 2 years ago
- A Python neural network made with TensorFlow that converts one person's voice into another.☆10Updated 4 years ago
- We archive data because we are interested in the diffs. All data is from https://video-api.cartoonnetwork.com. We run the check every min…☆10Updated this week
- ☆29Updated 2 years ago
- ☆15Updated 2 years ago
- Fine-tune and quantize Llama-2-like models to generate Python code using QLoRA, Axolot,..☆64Updated last year
- Demo example of consumer goods categorization☆30Updated 2 years ago
- Implementation of Grounding DINO & Segment Anything, and it allows masking based on prompt, which is useful for programmed inpainting.☆37Updated 2 years ago
- Style Transfer a face into cartoon without GAN. A UNet++ network with MobileNet v3 backbone optimized for mobile frameworks☆30Updated 3 years ago
- ☆51Updated 3 years ago
- DocQues answers queries on longer and multiple documents build on GPT-Index and GPT-3☆13Updated 3 years ago
- Document Summarization App using large language model (LLM) and Langchain framework. Used a pre-trained T5 model and its tokenizer from H…☆13Updated 2 years ago
- Automatically generate a lip-synced avatar based off of a transcript and audio☆14Updated 2 years ago
- ☆78Updated 2 years ago
- ☆30Updated 2 years ago
- LoRA fine-tuned Stable Diffusion Deployment☆31Updated 2 years ago
- DoyenTalker uses deep learning techniques to generate personalized avatar videos that speak user-provided text in a specified voice. The …☆13Updated last year
- Image captioning using python and BLIP☆50Updated 2 years ago
- Finetune any model on HF in less than 30 seconds☆56Updated 2 months ago
- Composition of Multimodal Language Models From Scratch☆15Updated last year
- A pipeline focused on the in-painting of text in images. For example the removal of subtitles in a screenshot of a movie.☆16Updated 3 years ago
- RAG-QA is a free, containerised question-answer framework that allows you to ask questions to your documents in an intuitive way☆19Updated last year