Code and Resources for the Transformer Encoder Reasoning Network (TERN) - https://arxiv.org/abs/2004.09144
☆58Dec 6, 2023Updated 2 years ago
Alternatives and similar repositories for TERN
Users that are interested in TERN are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Learning Fragment Self-Attention Embeddings for Image-Text Matching, in ACM MM 2019☆41Sep 24, 2019Updated 6 years ago
- WACV 2022 Paper - Is An Image Worth Five Sentences? A New Look into Semantics for Image-Text Matching☆16Dec 10, 2021Updated 4 years ago
- Code for journal paper "Learning Dual Semantic Relations with Graph Attention for Image-Text Matching", TCSVT, 2020.☆74Oct 25, 2022Updated 3 years ago
- The offical code for paper "Matching Images and Text with Multi-modal Tensor Fusion and Re-ranking", ACM Multimedia 2019 Oral☆68Sep 28, 2019Updated 6 years ago
- [AAAI2021] The code of “Similarity Reasoning and Filtration for Image-Text Matching”☆219Apr 11, 2024Updated 2 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Detect adversarial images from intermediate features in distance space☆12Aug 22, 2018Updated 7 years ago
- code for our CVPR2020 paper "IMRAM: Iterative Matching with Recurrent Attention Memory for Cross-Modal Image-Text Retrieval"☆96Mar 8, 2020Updated 6 years ago
- Code and Resources for our paper "Virtual to Real adaptation of Pedestrian Detectors" - https://www.mdpi.com/1424-8220/20/18/5250☆11Jul 25, 2024Updated last year
- Relational Content-Based Image Retrieval (R-CBIR) - Retrieving images with given relationships among objects☆17Oct 12, 2021Updated 4 years ago
- Show, Edit and Tell: A Framework for Editing Image Captions, CVPR 2020☆82Jul 17, 2020Updated 5 years ago
- The official source code for the paper Consensus-Aware Visual-Semantic Embedding for Image-Text Matching (ECCV 2020)☆168Feb 7, 2022Updated 4 years ago
- The research tools developed for HoloLens2☆10Dec 2, 2022Updated 3 years ago
- Official implementation of the paper "ALADIN: Distilling Fine-grained Alignment Scores for Efficient Image-Text Matching and Retrieval"☆28Dec 6, 2023Updated 2 years ago
- CAMP: Cross-Modal Adaptive Message Passing for Text-Image Retrieval☆127Feb 26, 2020Updated 6 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- PyTorch code for ICCV'19 paper "Visual Semantic Reasoning for Image-Text Matching"☆304Jan 14, 2020Updated 6 years ago
- [AAAI'20] Code release for "HAL: Improved Text-Image Matching by Mitigating Visual Semantic Hubs".☆38Oct 4, 2023Updated 2 years ago
- Position Focused Attention Network for Image-Text Matching☆69Aug 20, 2019Updated 6 years ago
- Implementation of our CVPR2020 paper, Graph Structured Network for Image-Text Matching☆170Oct 12, 2020Updated 5 years ago
- 📄 Evidence Retrieval and Claim Verification for the FEVER shared task using Transformer Networks☆12Feb 21, 2020Updated 6 years ago
- Implementation of our ACMMM2019 paper, Focus Your Attention: A Bidirectional Focal Attention Network for Image-Text Matching☆39Jun 19, 2023Updated 2 years ago
- Cross-modal Coherence Modeling for Caption Generation☆11Jul 24, 2020Updated 5 years ago
- PyTorch implementation of the AAAI-21 paper "Dual Adversarial Label-aware Graph Neural Networks for Cross-modal Retrieval" and the TPAMI-…☆41Nov 1, 2022Updated 3 years ago
- Paper reading notes in the field of Image-Text Matching/Retrieval.☆13Mar 25, 2022Updated 4 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Code for "Learning the Best Pooling Strategy for Visual Semantic Embedding", CVPR 2021 (Oral)☆165Aug 24, 2025Updated 8 months ago
- A Computer Vision Approach for Pass Detection on Soccer Broadcast Video☆21May 8, 2023Updated 3 years ago
- PyTorch Code for the paper "VSE++: Improving Visual-Semantic Embeddings with Hard Negatives"☆521Dec 8, 2021Updated 4 years ago
- Joint Embedding with Multimodal Cues for Cross-Modal Video-Text Retrieval☆68Apr 10, 2020Updated 6 years ago
- Pytorch implementation of Hebbian learning algorithms to train deep convolutional neural networks.☆27Jul 2, 2024Updated last year
- AAAI2020-The official implementation of "Learning Cross-modal Context Graph for Visual Grounding"☆58Oct 25, 2021Updated 4 years ago
- Ask&Confirm: Active Detail Enriching for Cross-Modal Retrieval with Partial Query (ICCV2021)☆20Dec 4, 2021Updated 4 years ago
- Implementation of our AAAI2022 paper, Show Your Faith: Cross-Modal Confidence-Aware Network for Image-Text Matching.☆36Jun 16, 2023Updated 2 years ago
- The code of the paper "Cross-Modal Graph Matching Network for Image-Text Retrieval" in ACM Transactions on Multimedia Computing, Communic…☆45Jun 5, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆131Dec 10, 2022Updated 3 years ago
- PyTorch source code for "Stacked Cross Attention for Image-Text Matching" (ECCV 2018)☆580May 18, 2023Updated 3 years ago
- Pytorch implementation of Hebbian learning algorithms to train deep convolutional neural networks.☆31Aug 30, 2021Updated 4 years ago
- "Can images help recognize entities? A study of the role of images for Multimodal NER" (W-NUT at EMNLP 2021)☆21Nov 14, 2021Updated 4 years ago
- Code for ACL 2023 Oral Paper: ManagerTower: Aggregating the Insights of Uni-Modal Experts for Vision-Language Representation Learning☆12Aug 23, 2025Updated 8 months ago
- Code to reproduce experiments in 'LSTM-based real-time action detection and prediction in human motion streams'☆29Aug 31, 2022Updated 3 years ago
- The Paper List of Large Multi-Modality Model (Perception, Generation, Unification), Parameter-Efficient Finetuning, Vision-Language Pretr…☆445Sep 25, 2025Updated 7 months ago