Repo for "Synergy of Sight and Semantics: Visual Intention Understanding with CLIP"
☆12Mar 12, 2025Updated 11 months ago
Alternatives and similar repositories for IntCLIP
Users that are interested in IntCLIP are comparing it to the libraries listed below
Sorting:
- Repo for "Uncertain Multimodal Intention and Emotion Understanding in the Wild"☆16Oct 20, 2025Updated 4 months ago
- EmoLLM: Multimodal Emotional Understanding Meets Large Language Models☆19Jun 24, 2024Updated last year
- Code release for TexFit: Text-Driven Fashion Image Editing with Diffusion Models (AAAI 2024)☆29Sep 30, 2024Updated last year
- A prototype of information retrieval system inclduing the document parsing, index construction, query parsing to support the vector space…☆15May 16, 2023Updated 2 years ago
- ☆14Jan 9, 2025Updated last year
- ☆10Nov 12, 2024Updated last year
- 📚 LaTeX templates and tools for creating beautiful, structured documents 📝☆14Oct 24, 2025Updated 4 months ago
- Deepseek-CoT☆10Oct 6, 2024Updated last year
- This repository contains the official code for "Flexible Biometrics Recognition: Bridging the Multimodality Gap through Attention, Alignm…☆12Oct 9, 2024Updated last year
- Pytorch implementation of 'Improving Self-supervised Lightweight Model Learning via Hard-aware Metric Distillation. In ECCV 2022'☆12Mar 22, 2023Updated 2 years ago
- Official implementation of "Pan-Sharpening With Wavelet-Enhanced High-Frequency Information"☆13Mar 28, 2024Updated last year
- ☆10Oct 4, 2023Updated 2 years ago
- ☆11Oct 18, 2022Updated 3 years ago
- Various object detection testing using YOLO and other algorithms, Raspberry pi based integration experiments.☆12Dec 9, 2024Updated last year
- ☆12Nov 11, 2024Updated last year
- paper code commit-fsmafl☆10Mar 18, 2024Updated last year
- It's a project of medical image processing.☆13Oct 16, 2022Updated 3 years ago
- ☆13Aug 14, 2022Updated 3 years ago
- ☆13Oct 25, 2024Updated last year
- [CVPR 2025] An Implementation of the paper "Pre-Instruction Data Selection for Visual Instruction Tuning"☆17Jun 9, 2025Updated 8 months ago
- Cross-Modality Fusion Mechanism for Multispectral Object Detection☆13Oct 11, 2022Updated 3 years ago
- Code of Paper OmniFuse: Composite Degradation-Robust Image Fusion with Language-Driven Semantics.☆28Sep 16, 2025Updated 5 months ago
- ☆11Aug 1, 2024Updated last year
- (2025' IJCV) This is the offical implementation for the paper titled "FusionBooster: A Unified Image Fusion Boosting Paradigm".☆14Jul 23, 2025Updated 7 months ago
- Smooth Variational Graph Embeddings for Efficient Neural Architecture Search☆14Feb 2, 2023Updated 3 years ago
- 🚀 A modern,lightly and intelligent macOS launcher. 一个现代的、设计简洁且智能的 macOS 启动器☆43Updated this week
- [ICLR'25] Official repository of paper: Ranking-aware adapter for text-driven image ordering with CLIP☆16Apr 17, 2025Updated 10 months ago
- ☆11Apr 27, 2022Updated 3 years ago
- Run SOTA Vision-Language Model Florence-2 on your data!☆15Mar 27, 2025Updated 11 months ago
- ☆17Dec 11, 2023Updated 2 years ago
- This repo contains the data used in "Towards Understanding Climate Change Perceptions: A Social Media Dataset"☆14Aug 30, 2024Updated last year
- This is the official repository of our NeurIPS 2025 paper "MaxSup: Overcoming Representation Collapse in Label Smoothing"☆21Nov 6, 2025Updated 3 months ago
- ☆13Dec 6, 2024Updated last year
- ☆15May 21, 2024Updated last year
- ☆52Dec 31, 2024Updated last year
- Source code for UP-Diff☆14Nov 26, 2024Updated last year
- This is the official implementation of our BMVC 2022 paper "SP-ViT: Learning 2D Spatial Priors for Vision Transformers"☆13Mar 27, 2023Updated 2 years ago
- Smooth Variational Graph Embeddings for Efficient Neural Architecture Search☆15Apr 8, 2024Updated last year
- CSANet: Cross-Temporal Interaction Symmetric Attention Network for Hyperspectral Image Change Detection☆12Sep 13, 2022Updated 3 years ago