NTUYWANG103 / clip-image-searchLinks
This code implements a versatile image search engine leveraging the CLIP model and FAISS, capable of processing both text-to-image and image-to-image queries.
☆49Updated last year
Alternatives and similar repositories for clip-image-search
Users that are interested in clip-image-search are comparing it to the libraries listed below
Sorting:
- MuLan: Adapting Multilingual Diffusion Models for 110+ Languages (无需额外训练为任意扩散模型支持多语言能力)☆144Updated 11 months ago
- Chinese Stable Diffusion, zh SD,中文文生图,中文SD,中文Stable Diffusion☆49Updated last year
- Codebase for the Recognize Anything Model (RAM)☆88Updated 2 years ago
- ☆72Updated 2 years ago
- Image Editing Anything☆116Updated 2 years ago
- official code for paper: Exploring Domain Incremental Video Highlights Detection with the LiveFood Benchmark☆40Updated 2 years ago
- A simple image search engine using CLIP feature.☆74Updated 2 years ago
- [ICCV2023] Segment Every Reference Object in Spatial and Temporal Spaces☆236Updated 10 months ago
- ☆185Updated 5 months ago
- Chinese CLIP models with SOTA performance.☆60Updated 2 years ago
- AutoShot: A Short Video Dataset and State-of-the-Art Shot Boundary Detection - CVPR NAS 2023☆207Updated 2 years ago
- [TIP 2025] CharacterFactory: Sampling Consistent Characters with GANs for Diffusion Models 🔥☆221Updated 8 months ago
- The code of the paper "NExT-Chat: An LMM for Chat, Detection and Segmentation".☆253Updated last year
- Repository for 23'MM accepted paper "Curriculum-Listener: Consistency- and Complementarity-Aware Audio-Enhanced Temporal Sentence Groundi…☆52Updated 2 years ago
- [ECCV 2022] AutoTransition: Learning to Recommend Video Transition Effects☆65Updated 10 months ago
- ☆201Updated last year
- Mini-DALLE3: Interactive Text to Image by Prompting Large Language Models☆313Updated 2 years ago
- [ICCV 2023] Scenimefy: Learning to Craft Anime Scene via Semi-Supervised Image-to-Image Translation☆287Updated 10 months ago
- Multimodal chatbot with computer vision capabilities integrated, our 1st-gen LMM☆101Updated last year
- Generate image from anything with ImageBind and Stable Diffusion☆201Updated 2 years ago
- Search images with a text or image query, using Open AI's pretrained CLIP model.☆262Updated 3 years ago
- ☆21Updated 3 years ago
- Fine-tuning code for CLIP models☆262Updated 5 months ago
- Precision Search through Multi-Style Inputs☆73Updated 5 months ago
- Grounding DINO with Segment Anything & Stable Diffusion colab☆195Updated 2 years ago
- Model for watermark classification implemented with PyTorch☆121Updated last year
- GlyphDraw2: Automatic Generation of Complex Glyph Posters with Diffusion Models and Large Language Models☆85Updated last year
- A simple script that reads a directory of videos, grabs a random frame, and automatically discovers a prompt for it☆143Updated last year
- [AAAI2025] DesignEdit: Unify Spatial-Aware Image Editing via Training-free Inpainting with a Multi-Layered Latent Diffusion Framework☆363Updated last year
- [Open-Source Project] Combining MMOCR with Segment Anything & Stable Diffusion. Automatically detect, recognize and segment text instance…☆576Updated last year