This is the official code of the paper "Differentiable Cross Modal Hashing via Multimodal Transformers"
☆18Mar 11, 2024Updated last year
Alternatives and similar repositories for DCHMT
Users that are interested in DCHMT are comparing it to the libraries listed below
Sorting:
- Source code for TCSVT paper “Deep Semantic-Aware Proxy Hashing for Multi-Label Cross-Modal Retrieval”☆18Nov 30, 2025Updated 3 months ago
- ☆12Sep 8, 2022Updated 3 years ago
- ☆21Apr 10, 2024Updated last year
- This project summarizes the CLIP-based cross-modal hashing methods. Including DCMHT, MITH, DSPH, DNPH, TwDH (Two-Step Discrete Hashing fo…☆48Sep 15, 2025Updated 5 months ago
- ☆74May 26, 2025Updated 9 months ago
- The source code of "Bit-aware Semantic Transformer Hashing for Multi-modal Retrieval." (Accepted by SIGIR 2022)☆18Sep 15, 2022Updated 3 years ago
- CLIP-based Fusion-modal Reconstructing Hashing for Unsupervised Large-scale Cross-modal Retrieval☆13Aug 7, 2023Updated 2 years ago
- Cross-Modal-Hashing-Retrieval/Multi-Modal-Hashing-Retrieval☆23Jun 20, 2023Updated 2 years ago
- High-order nonlocal Hashing for unsupervised cross-modal retrieval☆14Nov 11, 2023Updated 2 years ago
- Cross-Modal-Real-valuded-Retrieval☆86Jul 18, 2023Updated 2 years ago
- [CVPR'25] 🌟🌟 EgoTextVQA: Towards Egocentric Scene-Text Aware Video Question Answering☆46Jun 19, 2025Updated 8 months ago
- 这是一个用于计算ViT及其变种模型的GradCAM自动脚本,可以自动处理批量的图像 A GradCAM automatic script to visualize the model result☆18Dec 16, 2024Updated last year
- This repository contains the author's implementation in PyTorch for the paper "Adaptive Label-aware Graph Convolutional Networks for Cros…☆15Dec 6, 2021Updated 4 years ago
- [IEEE TMM'25] Scene-Text Grounding for Text-Based Video Question Answering☆16Feb 16, 2026Updated 3 weeks ago
- 这个项目是基于python3的mxnet框架实现的实时视频人脸识别,其中包括视频传输,人脸识别等部分,用户可根据需要调整使用。整个项目建立在ubuntu18.04系统下。☆16Dec 12, 2020Updated 5 years ago
- The source code of "Teacher-Student Learning: Efficient Hierarchical Message Aggregation Hashing for Cross-Modal Retrieval." (Accepted by…☆19Jun 7, 2022Updated 3 years ago
- Source code for ICMR'19 paper "Triplet Fusion Network Hashing for Unpaired Cross-Modal Retrieval"☆18Mar 22, 2025Updated 11 months ago
- Source code for paper "Supervised Discrete Hashing" on CVPR-2015☆21Jan 1, 2020Updated 6 years ago
- Source code for paper "Asymmetric Deep Supervised Hashing" on AAAI-2018☆20Oct 29, 2019Updated 6 years ago
- Implementation of Weakly Supervised Deep Image Hashing through Tag Embeddings☆25Jun 22, 2022Updated 3 years ago
- Implementation of NIPS2023: Unleashing the Full Potential of Product Quantization for Large-Scale Image Retrieva☆11Nov 12, 2024Updated last year
- source code for "Deep adversarial discrete hashing for cross-modal retrieval"☆25Jul 6, 2023Updated 2 years ago
- The official implementation for BLIP4CIR with bi-directional training | Bi-directional Training for Composed Image Retrieval via Text Pro…☆34Feb 7, 2024Updated 2 years ago
- Implementation of accepted AAAI 2021 paper: Deep Unsupervised Image Hashing by Maximizing Bit Entropy☆84Dec 2, 2021Updated 4 years ago
- The code for the paper "Contrastive Quantization with Code Memory for Unsupervised Image Retrieval" (AAAI'22, Oral).☆38Oct 21, 2022Updated 3 years ago
- CLIP-based Adaptive Graph Attention Network for Large-Scale Unsupervised Multi-modal Hashing Retrieval☆10Mar 18, 2024Updated last year
- Awesome free AI courses available on YouTube☆12Apr 16, 2025Updated 10 months ago
- Scalable deep multimodal learning for cross-modal retrieval (SIGIR 2019, PyTorch Code)☆34Jul 24, 2020Updated 5 years ago
- Project Page for CoPRS, offering training overview, inference code, and downloadable links.☆20Oct 27, 2025Updated 4 months ago
- ☆42Mar 23, 2022Updated 3 years ago
- Source Code for Online Collective Matrix Factorization Hashing. Reference: Di Wang, Quan Wang, Yaqiang An, Xinbo Gao, and Yumin Tian. 202…☆11Oct 20, 2020Updated 5 years ago
- ☆11Jul 17, 2024Updated last year
- Code for the paper: Graph Jigsaw Learning for Cartoon Face Recognition☆10Jul 1, 2022Updated 3 years ago
- ☆10Sep 7, 2022Updated 3 years ago
- A Comprehensive Empirical Study of Vision-Language Pre-trained Model for Supervised Cross-Modal Retrieval☆43Apr 13, 2022Updated 3 years ago
- The code for the paper "Embracing Collaboration Over Competition: Condensing Multiple Prompts for Visual In-Context Learning" (CVPR'25).☆15Sep 25, 2025Updated 5 months ago
- Deep Graph-neighbor Coherence Preserving Network for Unsupervised Cross-modal Hashing