NTUYWANG103 / clip-image-searchLinks
This code implements a versatile image search engine leveraging the CLIP model and FAISS, capable of processing both text-to-image and image-to-image queries.
☆48Updated last year
Alternatives and similar repositories for clip-image-search
Users that are interested in clip-image-search are comparing it to the libraries listed below
Sorting:
- A simple image search engine using CLIP feature.☆69Updated 2 years ago
- Chinese Stable Diffusion, zh SD,中文文生图,中文SD,中文Stable Diffusion☆49Updated last year
- ☆181Updated last week
- Image Editing Anything☆116Updated 2 years ago
- MuLan: Adapting Multilingual Diffusion Models for 110+ Languages (无需额外训练为任意扩散模型支持多语言能力)☆137Updated 6 months ago
- Codebase for the Recognize Anything Model (RAM)☆82Updated last year
- Generate image from anything with ImageBind and Stable Diffusion☆196Updated 2 years ago
- Chinese CLIP models with SOTA performance.☆56Updated last year
- A simple script that reads a directory of videos, grabs a random frame, and automatically discovers a prompt for it☆139Updated last year
- official code for paper: Exploring Domain Incremental Video Highlights Detection with the LiveFood Benchmark☆38Updated last year
- Precision Search through Multi-Style Inputs☆71Updated last week
- Mini-DALLE3: Interactive Text to Image by Prompting Large Language Models☆313Updated last year
- ☆187Updated last year
- DesignEdit: Unify Spatial-Aware Image Editing via Training-free Inpainting with a Multi-Layered Latent Diffusion Framework☆348Updated 8 months ago
- [IJCV'24] AutoStory: Generating Diverse Storytelling Images with Minimal Human Effort☆152Updated 8 months ago
- Grounding DINO with Segment Anything & Stable Diffusion colab☆197Updated last year
- Offical Code for GPT4Video: A Unified Multimodal Large Language Model for lnstruction-Followed Understanding and Safety-Aware Generation☆141Updated 9 months ago
- [TIP 2025] CharacterFactory: Sampling Consistent Characters with GANs for Diffusion Models 🔥☆219Updated 3 months ago
- ☆70Updated 2 years ago
- Code for CVPR 2022 paper "Scene Consistency Representation Learning for Video Scene Segmentation"☆100Updated 2 years ago
- A simple Segment Anything WebUI based on Gradio.☆81Updated 2 years ago
- Diffusers training with mmengine☆102Updated last year
- GlyphDraw2: Automatic Generation of Complex Glyph Posters with Diffusion Models and Large Language Models☆81Updated last year
- Image Textualization: An Automatic Framework for Generating Rich and Detailed Image Descriptions (NeurIPS 2024)☆164Updated last year
- A cli program of image retrieval using dinov2☆75Updated 2 years ago
- [CVPR2024] Make Your Dream A Vlog☆426Updated 2 months ago
- [Open-Source Project] Combining MMOCR with Segment Anything & Stable Diffusion. Automatically detect, recognize and segment text instance…☆570Updated last year
- Segment Anything combined with CLIP☆345Updated last year
- [ICCV2023] Segment Every Reference Object in Spatial and Temporal Spaces☆238Updated 5 months ago
- Research Code for Multimodal-Cognition Team in Ant Group☆161Updated last month