xmu-xiaoma666 / SDATRView external linksLinks
Official Code for "Knowing what it is: Semantic-enhanced Dual Attention Transformer" (TMM2022)
☆19Oct 15, 2022Updated 3 years ago
Alternatives and similar repositories for SDATR
Users that are interested in SDATR are comparing it to the libraries listed below
Sorting:
- Towards Local Visual Modeling for Image Captioning☆29Mar 31, 2023Updated 2 years ago
- Official Code for 'RSTNet: Captioning with Adaptive Attention on Visual and Non-Visual Words' (CVPR 2021)☆123Dec 17, 2022Updated 3 years ago
- ☆24Apr 4, 2022Updated 3 years ago
- A paper list of image captioning.☆22Apr 23, 2022Updated 3 years ago
- ☆10Apr 20, 2018Updated 7 years ago
- Deliberate Attention Networks for Image Captioning (AAAI 2019)☆11Sep 30, 2019Updated 6 years ago
- CaMEL: Mean Teacher Learning for Image Captioning. ICPR 2022☆29Dec 1, 2022Updated 3 years ago
- Fine-Grained Visual Classification via Simultaneously Learning of Multi-regional Multi-grained Features☆12Mar 2, 2021Updated 4 years ago
- implement of Multi-Attention Multi-Class Constraint for Fine-grained Image Recognition with keras☆14Aug 3, 2020Updated 5 years ago
- CLIP4IDC: CLIP for Image Difference Captioning (AACL 2022)☆36Nov 12, 2022Updated 3 years ago
- Lightweight Transformer for Multi-modal Tasks☆16Dec 9, 2022Updated 3 years ago
- ☆19Jan 7, 2026Updated last month
- 2018阿里天池fashionAI服饰属性识别亚军☆17Sep 4, 2018Updated 7 years ago
- Weakly Supervised Grounding for VQA in Vision-Language Transformers☆16May 6, 2023Updated 2 years ago
- Image captioning with weight pruning in PyTorch☆22Jan 14, 2022Updated 4 years ago
- ☆20Nov 11, 2019Updated 6 years ago
- Look and Modify: Modification Networks for Image Captioning, BMVC 2019☆21Feb 18, 2020Updated 5 years ago
- Implementation of our IJCAI2022 oral paper, ER-SAN: Enhanced-Adaptive Relation Self-Attention Network for Image Captioning.☆24Aug 5, 2023Updated 2 years ago
- Official pytorch implementation of paper "Dual-Level Collaborative Transformer for Image Captioning" (AAAI 2021).☆202Jun 8, 2022Updated 3 years ago
- A PyTorch implementation of the paper Multimodal Transformer with Multiview Visual Representation for Image Captioning☆25Sep 4, 2020Updated 5 years ago
- A Out-of-box PyTorch Scaffold for Neural Network Quantization-Aware-Training (QAT) Research. Website: https://github.com/zhutmost/neuralz…☆25Dec 20, 2022Updated 3 years ago
- Implementation of 'X-Linear Attention Networks for Image Captioning' [CVPR 2020]☆273Jul 27, 2021Updated 4 years ago
- Exploiting Unlabeled Data for Target-Oriented Opinion Words Extraction☆24Sep 30, 2022Updated 3 years ago
- 整理cvpr论文,包括摘要,动机,架构,结果,总结☆27Dec 15, 2018Updated 7 years ago
- PyTorch implementation of Boosting Multi-Label Image Classification with Complementary Parallel Self-Distillation, IJCAI 2022.☆26Aug 25, 2022Updated 3 years ago
- Based on the WACV 2020 paper - Fine Grained Classification and Retrieval by Combining Visual and Locally Pooled Textual Features☆25Nov 15, 2021Updated 4 years ago
- ☆29Oct 19, 2022Updated 3 years ago
- 基于hrnet的backbone改进centernet☆23Aug 14, 2019Updated 6 years ago
- "Exploring Vision Transformers for Fine-grained Classification" at CVPRW FGVC8☆29Jun 25, 2021Updated 4 years ago
- Stack-Captioning: Coarse-to-Fine Learning for Image Captioning☆63Apr 18, 2018Updated 7 years ago
- Multi-Modal Reasoning Graph for Scene-Text Based Fine-Grained Image Classification and Retrieval☆64Dec 1, 2022Updated 3 years ago
- Official PyTorch implementation of our CVPR 2022 paper: Beyond a Pre-Trained Object Detector: Cross-Modal Textual and Visual Context for …☆61Oct 21, 2022Updated 3 years ago
- Implementation of 'End-to-End Transformer Based Model for Image Captioning' [AAAI 2022]☆69Jun 1, 2024Updated last year
- implementation of “Learning Spatial Regularization with Image-level Supervisions for Multi-label Image Classification” in Tensorflow☆26Sep 11, 2018Updated 7 years ago
- ☆38Feb 4, 2023Updated 3 years ago
- The implementation of multi-branch attentive Transformer (MAT).☆33Aug 27, 2020Updated 5 years ago
- A Lightweight Multi-modality Image Segmentation Network via Domain Adaptation using Gradient Magnitude and Shape Constraint☆10Apr 3, 2023Updated 2 years ago
- Official Code for All-in-One Medical Image Re-Identification (CVPR2025)☆18Jan 11, 2026Updated last month
- Official code for CVPR 2022 paper "Relieving Long-tailed Instance Segmentation via Pairwise Class Balance".☆37Apr 3, 2022Updated 3 years ago