eren23 / sam-clip-diffusionLinks
SAM + CLIP + DIFFUSION for image to edit objects in images using plain text
☆15Updated 2 years ago
Alternatives and similar repositories for sam-clip-diffusion
Users that are interested in sam-clip-diffusion are comparing it to the libraries listed below
Sorting:
- This project is under development.☆23Updated 2 years ago
- ☆11Updated 4 years ago
- ☆15Updated 2 years ago
- This is a streamlit web interface for the Segment Anything.☆23Updated 2 years ago
- Unofficial pytorch implementation of TryOnGAN☆18Updated 3 years ago
- We archive data because we are interested in the diffs. All data is from https://video-api.cartoonnetwork.com. We run the check every min…☆10Updated this week
- Supporting code for: Video Enriched Retrieval Augmented Generation Using Aligned Video Captions☆30Updated last year
- A pipeline to generate user-preferred photo-realistic avatars using stable-diffusion and bayesian-optimization.☆18Updated 5 months ago
- Visual similarity search engine demo with use of PyTorch Metric Learning and Qdrant☆12Updated 2 years ago
- Google's Gemini implemented with GPT-4 Vision, Whisper and Resemble AI☆26Updated last year
- LoRA fine-tuned Stable Diffusion Deployment☆31Updated 2 years ago
- Diffusion WebUI: Stable Diffusion + ControlNet + Inpaint☆52Updated 2 years ago
- Web UI for Stable Diffusion prompt generation via GPT-2 trained model☆51Updated 2 years ago
- Modify-Anything is based on yolov5,yolov8 for video and image detection. Segment-anything,lama_cleaner is applied to segment, modify, era…☆16Updated 2 years ago
- A pipeline focused on the in-painting of text in images. For example the removal of subtitles in a screenshot of a movie.☆15Updated 3 years ago
- Style Transfer a face into cartoon without GAN. A UNet++ network with MobileNet v3 backbone optimized for mobile frameworks☆30Updated 3 years ago
- A Python neural network made with TensorFlow that converts one person's voice into another.☆10Updated 4 years ago
- Automatically generate a lip-synced avatar based off of a transcript and audio☆13Updated 2 years ago
- Photorealism model use RealVisXL v4.0☆12Updated last year
- Experiment on QnA tabular data using LLMs and SQL☆28Updated 11 months ago
- RAG-QA is a free, containerised question-answer framework that allows you to ask questions to your documents in an intuitive way☆17Updated last year
- Fine-tuning OpenAI CLIP Model for Image Search on medical images☆76Updated 3 years ago
- Minimal zero-shot intent classifier for arbitrary intent slot filling, via LLM prompting w LangChain.☆35Updated 2 years ago
- Simple image classification for custom dataset (pytorch-lightning, timm)☆28Updated 3 years ago
- A Simple Image Clustering Script using CLIP and Hierarchial Clustering☆38Updated 2 years ago
- Monetize.ai is a web-based chatbot that provides personalized investment advice using GPT-3.5 and Yahoo Finance API. It's built using Fla…☆16Updated 2 years ago
- Demo example of consumer goods categorization☆28Updated last year
- Prompts and evaluation data for LLMs on real world coding and writing tasks☆15Updated last month
- ☆50Updated 3 years ago
- ☆69Updated 6 months ago