fortunechen / paper-reading_CrossModelGroup-USTCView external linksLinks
中科大跨模态智能组-每周论文分享
☆16Nov 20, 2022Updated 3 years ago
Alternatives and similar repositories for paper-reading_CrossModelGroup-USTC
Users that are interested in paper-reading_CrossModelGroup-USTC are comparing it to the libraries listed below
Sorting:
- Implementation of our IJCAI2022 oral paper, ER-SAN: Enhanced-Adaptive Relation Self-Attention Network for Image Captioning.☆24Aug 5, 2023Updated 2 years ago
- CVPR 2025 Accepted Papers☆23Dec 20, 2025Updated last month
- Implementation of our CVPR2022 paper, Negative-Aware Attention Framework for Image-Text Matching.☆119Jun 19, 2023Updated 2 years ago
- ☆18Mar 21, 2025Updated 10 months ago
- ☆13Feb 1, 2022Updated 4 years ago
- Implementation of our AAAI2022 paper, Show Your Faith: Cross-Modal Confidence-Aware Network for Image-Text Matching.☆36Jun 16, 2023Updated 2 years ago
- Pytorch implementation for Lightweight Generative Adversarial Networks for Text-Guided Image Manipulation.☆18Jan 4, 2022Updated 4 years ago
- Implementation of our CVPR2020 paper, Graph Structured Network for Image-Text Matching☆170Oct 12, 2020Updated 5 years ago
- Implementation of our ACMMM2019 paper, Focus Your Attention: A Bidirectional Focal Attention Network for Image-Text Matching☆39Jun 19, 2023Updated 2 years ago
- RSTPReid Dataset for Text-based Person Retrieval.☆32Sep 2, 2022Updated 3 years ago
- ☆35Nov 3, 2022Updated 3 years ago
- [BMVC 2021] Text-Based Person Search with Limited Data☆47Aug 12, 2022Updated 3 years ago
- code base for vision transformers☆36Dec 4, 2021Updated 4 years ago
- ☆35May 4, 2021Updated 4 years ago
- [AAAI2024] An official pytorch implement of the paper: Vision-Language Pre-training with Object Contrastive Learning for 3D Scene Underst…☆13Dec 8, 2024Updated last year
- ☆11Aug 20, 2025Updated 5 months ago
- Learning Fragment Self-Attention Embeddings for Image-Text Matching, in ACM MM 2019☆41Sep 24, 2019Updated 6 years ago
- Official repository for Scone (Subject-driven Composition and Distinction Enhancement) model, designed to support multi-subject compositi…☆28Jan 14, 2026Updated last month
- [CVPR 2022] Sequential Voting with Relational Box Fields for Active Object Detection☆10Jun 19, 2022Updated 3 years ago
- CLIP-based Fusion-modal Reconstructing Hashing for Unsupervised Large-scale Cross-modal Retrieval☆13Aug 7, 2023Updated 2 years ago
- Calculation of the entropy of the batch of images (whole image or patches)☆10Oct 15, 2021Updated 4 years ago
- Generative label fused network for image–text matching☆10Jan 13, 2023Updated 3 years ago
- Tis is code for Few-Shot Joint Multimodal Entity-Relation Extraction via Knowledge-Enhanced Cross-modal Prompt Model (ACM MM 2024))☆12Aug 27, 2024Updated last year
- Dual-path CNN with Max Gated block for Text-Based Person Re-identification☆10Dec 5, 2020Updated 5 years ago
- Official implementation of ICCV 2025 paper - DualReal: Adaptive Joint Training for Lossless Identity-Motion Fusion in Video Customization☆22Jul 13, 2025Updated 7 months ago
- ☆45Dec 26, 2021Updated 4 years ago
- In this codebase we establish a benchmark for egocentric user adaptation based on Ego4d.First, we start from a population model which ha…☆15Jan 16, 2025Updated last year
- ☆12May 19, 2025Updated 8 months ago
- Code of the paper "LGI-GT: Graph Transformers with Local and Global Operators Interleaving"☆12Sep 4, 2023Updated 2 years ago
- ☆12May 7, 2018Updated 7 years ago
- ☆12Oct 21, 2019Updated 6 years ago
- The Project of Our ICCV Paper☆10Nov 10, 2020Updated 5 years ago
- ☆14Mar 11, 2025Updated 11 months ago
- A pytorch implementation of our paper Image Captioning with Inherent Sentiment (ICME 2021 Oral).☆11Jul 18, 2022Updated 3 years ago
- Code for the paper: F. Ragusa, G. M. Farinella, A. Furnari. StillFast: An End-to-End Approach for Short-Term Object Interaction Anticipat…☆13Apr 11, 2023Updated 2 years ago
- An Android live wallpaper which plays the brick-busting game of breakout around your icons. Available on the Android Market.☆21Oct 16, 2010Updated 15 years ago
- Cross-modal Coherence Modeling for Caption Generation☆11Jul 24, 2020Updated 5 years ago
- [EMNLP 2024] SURf: Teaching Large Vision-Language Models to Selectively Utilize Retrieved Information☆12Oct 11, 2024Updated last year
- [AAAI 2024] MESED: A Multi-modal Entity Set Expansion Dataset with Fine-grained Semantic Classes and Hard Negative Entities☆16Apr 26, 2024Updated last year