NTUYWANG103 / clip-image-search
This code implements a versatile image search engine leveraging the CLIP model and FAISS, capable of processing both text-to-image and image-to-image queries.
☆37Updated 8 months ago
Related projects: ⓘ
- Chinese CLIP models with SOTA performance.☆44Updated last year
- Image Editing Anything☆112Updated last year
- Chinese Stable Diffusion, zh SD,中文文生图,中文SD,中文Stable Diffusion☆43Updated 6 months ago
- Repository for 23'MM accepted paper "Curriculum-Listener: Consistency- and Complementarity-Aware Audio-Enhanced Temporal Sentence Groundi…☆36Updated 8 months ago
- AutoShot: A Short Video Dataset and State-of-the-Art Shot Boundary Detection - CVPR NAS 2023☆98Updated last year
- ☆143Updated 3 weeks ago
- Incredibly descriptive audiovisual summaries for videos☆39Updated last month
- 这是一个基于stable diffusion的扩展绘画工具(outpainting)☆32Updated last year
- A simple image search engine using CLIP feature.☆52Updated last year
- A simple script that reads a directory of videos, grabs a random frame, and automatically discovers a prompt for it☆130Updated 7 months ago
- demo natural language video db using CLIP☆19Updated last month
- ☆29Updated 3 months ago
- ☆53Updated 7 months ago
- Diffusion WebUI: Stable Diffusion + ControlNet + Inpaint☆52Updated last year
- MuLan: Adapting Multilingual Diffusion Models for 110+ Languages (无需额外训练为任意扩散模型支持多语言能力)☆124Updated 3 months ago
- TheaterGen: Character Management with LLM for Consistent Multi-turn Image Generation☆48Updated 3 months ago
- CLIP中文encoder☆21Updated 2 years ago
- official code for paper: Exploring Domain Incremental Video Highlights Detection with the LiveFood Benchmark☆26Updated 8 months ago
- A Simple MLLM Surpassed QwenVL-Max with OpenSource Data Only in 14B LLM.☆35Updated last week
- ☆155Updated 2 months ago
- Precision Search through Multi-Style Inputs☆45Updated last month
- The official code of "RWKV-CLIP: A Robust Vision-Language Representation Learner"☆97Updated 2 months ago
- Exploring Efficient Fine-Grained Perception of Multimodal Large Language Models☆45Updated 4 months ago
- Make HD zoom video of stable diffusion outpainting☆30Updated last year
- ☆161Updated 2 months ago
- AI toolbox and pretrain models.☆36Updated 7 months ago
- ☆54Updated 3 weeks ago
- ONNX-Powered Inference for State-of-the-Art Face Upscalers☆74Updated last month
- ☆63Updated last year
- Offical Code for GPT4Video: A Unified Multimodal Large Language Model for lnstruction-Followed Understanding and Safety-Aware Generation☆128Updated 9 months ago