ChenAnno / Real20M_ACMMM2023View external linksLinks
Official implementation for "Real20M: A Large-scale E-commerce Dataset for Cross-domain Retrieval"
☆25Oct 27, 2025Updated 3 months ago
Alternatives and similar repositories for Real20M_ACMMM2023
Users that are interested in Real20M_ACMMM2023 are comparing it to the libraries listed below
Sorting:
- Official implementation for "SPIRIT: Style-guided Patch Interaction for Fashion Image Retrieval with Text Feedback"☆16Oct 27, 2025Updated 3 months ago
- Official implementation for "FashionERN: Enhance-and-Refine Network for Composed Fashion Image Retrieval"☆19Oct 27, 2025Updated 3 months ago
- Lion: Kindling Vision Intelligence within Large Language Models☆51Jan 25, 2024Updated 2 years ago
- ☆17Mar 5, 2025Updated 11 months ago
- Context-I2W: Mapping Images to Context-dependent words for Accurate Zero-Shot Composed Image Retrieval [AAAI 2024 Oral]☆56May 27, 2025Updated 8 months ago
- MXNet-Gluon model to Caffe (support SSD in gluoncv)☆10Jun 20, 2019Updated 6 years ago
- ☆10Oct 25, 2024Updated last year
- NightSurveillance Sataset for Pedestrian Detection☆11Jul 30, 2020Updated 5 years ago
- ☆11May 17, 2024Updated last year
- Black-box Few-shot Knowledge Distillation☆13Jul 19, 2022Updated 3 years ago
- A digital twin of the city of Chicago along with automated sensors☆12Nov 14, 2019Updated 6 years ago
- ☆24Jun 12, 2025Updated 8 months ago
- ☆12Jan 10, 2025Updated last year
- Automated neural architecture search algorithms implemented in PyTorch and Autogluon toolkit.☆12Apr 17, 2020Updated 5 years ago
- MADTP: Multimodal Alignment-Guided Dynamic Token Pruning for Accelerating Vision-Language Transformer☆50Sep 6, 2024Updated last year
- [CVPR 2023 (Highlight)] FAME-ViL: Multi-Tasking V+L Model for Heterogeneous Fashion Tasks☆56Sep 28, 2023Updated 2 years ago
- ☆12Feb 2, 2023Updated 3 years ago
- Repository for the paper I See You: A Vehicle-Pedestrian Interaction Dataset from Traffic Surveillance Cameras, presented at the LXAI wor…☆19Jun 9, 2025Updated 8 months ago
- A curated list of resources dedicated to computer vision and related algorithms for creating, correcting maps. Feel free to make PRs to c…☆13Jan 3, 2019Updated 7 years ago
- ☆15Mar 30, 2025Updated 10 months ago
- Data-Efficient Multimodal Fusion on a Single GPU☆68May 7, 2024Updated last year
- Code for paper: Unified Text-to-Image Generation and Retrieval☆16Jul 6, 2024Updated last year
- The application of large pre-trained vision model DINOv2 from MetaAI for feature points matching, and a ViT decoder used for Auto Encoder☆17Apr 27, 2023Updated 2 years ago
- Hierarchical And Quantized AutoEncoders☆13Jun 12, 2020Updated 5 years ago
- [AAAI 2025] Official code for "OmniCount: Multi-label Object Counting with Semantic-Geometric Priors"☆21Sep 30, 2025Updated 4 months ago
- ☆64Feb 1, 2026Updated last week
- Code for paper: "Region Proposals for Saliency Map Refinement for Weakly-supervised Disease Localisation and Classification"☆14Jun 29, 2021Updated 4 years ago
- ☆22Sep 9, 2025Updated 5 months ago
- [EMNLP 2024] Official code for "Beyond Embeddings: The Promise of Visual Table in Multi-Modal Models"☆20Oct 17, 2024Updated last year
- In this work, we implement different cross-modal learning schemes such as Siamese Network, Correlational Network and Deep Cross-Modal Pro…☆11Aug 23, 2021Updated 4 years ago
- Dynamic Modality Interaction Modeling for Image-Text Retrieval. SIGIR'21☆70May 26, 2022Updated 3 years ago
- ☆14Oct 14, 2021Updated 4 years ago
- FedCMR: Federated Cross-Modal Retrieval 的代码(the official implementation of FedCMR: Federated Cross-Modal Retrieval)☆17Oct 17, 2025Updated 3 months ago
- ☆18Aug 23, 2022Updated 3 years ago
- Official Implementation of "CLEFT: Language-Image Contrastive Learning with Efficient Large Language Model and Prompt Fine-Tuning" on MIC…☆17Feb 12, 2025Updated last year
- MindWatcher: Toward Smarter Multimodal Tool-Integrated Reasoning☆39Dec 30, 2025Updated last month
- [ECCV 2024 Workshop🎈] The first agriculture benchmark to evaluate MM-LLMs.☆23Jan 1, 2025Updated last year
- [AAAI-2024] Structural Information Guided Multimodal Pre-training for Vehicle-centric Perception, Xiao Wang, Wentao Wu, Chenglong Li, Zhi…☆27Jul 29, 2024Updated last year
- The official GitHub repo for the paper MOAT: Evaluating LMMs for Capability Integration and Instruction Grounding.☆22Dec 29, 2025Updated last month