[ToMM2023] - AMC: Adaptive Multi-expert Collaborative Network for Text-guided Image Retrieval
☆20Aug 30, 2024Updated last year
Alternatives and similar repositories for AMC
Users that are interested in AMC are comparing it to the libraries listed below
Sorting:
- [TCSVT2023] - ESA: External Space Attention Aggregation for Image-Text Retrieval☆23Aug 30, 2024Updated last year
- [ICCV2023] - CTP: Towards Vision-Language Continual Pretraining via Compatible Momentum Contrast and Topology Preservation☆38Oct 8, 2024Updated last year
- The official implementation for BLIP4CIR with bi-directional training | Bi-directional Training for Composed Image Retrieval via Text Pro…☆34Feb 7, 2024Updated 2 years ago
- C2P-CLIP-DeepfakeDetection☆93Dec 26, 2025Updated 2 months ago
- Source code for TCSVT paper “Deep Semantic-Aware Proxy Hashing for Multi-Label Cross-Modal Retrieval”☆18Nov 30, 2025Updated 3 months ago
- Multimodal-Composite-Editing-and-Retrieval-update☆35Oct 13, 2025Updated 4 months ago
- [ICCV 2023] The official PyTorch code for Group Pose: A Simple Baseline for End-to-End Multi-person Pose Estimation☆90Sep 7, 2023Updated 2 years ago
- The official implementation for Candidate Set Re-ranking for Composed Image Retrieval (TMLR) 01/2024☆20Feb 7, 2024Updated 2 years ago
- 2019_操作系统实验_16281047☆11Jun 15, 2019Updated 6 years ago
- Source code for paper "VD-PCR: Improving Visual Dialog with Pronoun Coreference Resolution"☆10Nov 1, 2022Updated 3 years ago
- CLIP-based Adaptive Graph Attention Network for Large-Scale Unsupervised Multi-modal Hashing Retrieval☆10Mar 18, 2024Updated last year
- [SIGIR 2024] - Simple but Effective Raw-Data Level Multimodal Fusion for Composed Image Retrieval☆44Jul 14, 2024Updated last year
- ☆28Feb 2, 2026Updated 3 weeks ago
- implementation for Mucko: Multi-Layer Cross-Modal Knowledge Reasoning for Fact-based Visual Question Answering☆10Mar 17, 2022Updated 3 years ago
- [TPAMI'2023]Knowledge-enriched Attention Network with Group-wise Semantic for Visual Storytelling☆11Jan 3, 2023Updated 3 years ago
- Finetuning Stable Diffusion from Diffusers☆12Mar 11, 2024Updated last year
- Official code for "Rethinking Chain-of-Thought Reasoning for Videos"☆20Dec 14, 2025Updated 2 months ago
- Code and data for EMNLP2019 Paper "Uncover the Ground-Truth Relations in Distant Supervision: A Neural Expectation-Maximization Framework…☆10May 24, 2020Updated 5 years ago
- ☆294May 1, 2025Updated 10 months ago
- Bert Abstractive Summarization of Online News Discussion Threads☆13Dec 8, 2022Updated 3 years ago
- ☆17Dec 31, 2025Updated 2 months ago
- Beyond Words: A Multimodal Exploration of Persuasion in Memes☆13Jun 8, 2024Updated last year
- This is the code repo for our paper "Say More with Less: Understanding Prompt Learning Behaviors through Gist Compression".☆12Feb 27, 2024Updated 2 years ago
- ☆13Jun 17, 2024Updated last year
- [CVPR 2025] Six-CD: Benchmarking Concept Removals for Benign Text-to-image Diffusion Models☆16Jan 8, 2026Updated last month
- Simple python interpreter☆13Jun 13, 2018Updated 7 years ago
- [ACM MM 2025 🔥🔥 ] MIRA: A first-of-its-kind medical RAG framework that fuses image features and retrieved knowledge with dynamic contex…☆18Aug 28, 2025Updated 6 months ago
- ☆11Nov 28, 2022Updated 3 years ago
- Code for the paper "Code Generation From Flowcharts with Texts: A Benchmark Dataset and An Approach"☆13Feb 11, 2023Updated 3 years ago
- Retrieval-augmented Image Captioning☆13Feb 16, 2023Updated 3 years ago
- ☆20Dec 3, 2025Updated 2 months ago
- The code for "Semi-Supervised Cross-Modal Hashing with Multi-view Graph Representation"☆10Apr 18, 2021Updated 4 years ago
- helper functions for processing and integrating visual language information with Qwen-VL Series Model☆17Aug 30, 2024Updated last year
- [NeurIPS 2024] SeeClear: This repo is the official implementation of "SeeClear: Semantic Distillation Enhances Pixel Condensation for Vid…☆18Oct 8, 2024Updated last year
- 🌈 PyTorch Implementation for EMNLP'21 Findings "Reasoning Visual Dialog with Sparse Graph Learning and Knowledge Transfer"☆13Feb 1, 2023Updated 3 years ago
- Fast division by invariant integers using multiplication☆13Jun 18, 2022Updated 3 years ago
- Build LLM Application with Local Documents☆19Jun 13, 2025Updated 8 months ago
- [AAAI 2024] XKD: Cross-modal Knowledge Distillation with Domain Alignment for Video Representation Learning.☆15Jul 9, 2024Updated last year
- ☆17Feb 20, 2024Updated 2 years ago