☆11Dec 8, 2022Updated 3 years ago
Alternatives and similar repositories for tapm
Users that are interested in tapm are comparing it to the libraries listed below
Sorting:
- The implement of Commonsense Knowledge Aware Concept Selection For Diverse and Informative Visual Storytelling☆12Aug 19, 2021Updated 4 years ago
- Official Github repo of the VIST Challenge NAACL 2018☆17Aug 3, 2018Updated 7 years ago
- Code repository for our EMNLP 2020 long paper "Modeling Protagonist Emotions for Emotion-Aware Storytelling" (https://arxiv.org/abs/2010.…☆20Feb 8, 2021Updated 5 years ago
- Codes for paper "Towards Diverse Paragraph Captioning for Untrimmed Videos". CVPR 2021☆66Oct 21, 2021Updated 4 years ago
- L-Verse: Bidirectional Generation Between Image and Text☆107Apr 1, 2025Updated 11 months ago
- Visual Storytelling API☆36Feb 11, 2017Updated 9 years ago
- Code for the ACL paper "No Metrics Are Perfect: Adversarial Reward Learning for Visual Storytelling"☆136Jan 19, 2021Updated 5 years ago
- Normalization Matters in Weakly Supervised Object Localization (ICCV 2021)☆11Oct 24, 2021Updated 4 years ago
- Creating crowdsourcing based experiments made easy☆10May 25, 2020Updated 5 years ago
- Code for "Time-Aware Auto White Balance in Mobile Photography"☆28Jan 25, 2026Updated last month
- ☆11Dec 2, 2018Updated 7 years ago
- PyTorch implementation of Area Attention.☆11Nov 30, 2020Updated 5 years ago
- Supervision by Fusion: Towards Unsupervised Learning of Deep Salient Object Detector☆11Jun 24, 2023Updated 2 years ago
- Character Grounding and Re-Identification in Story of Videos and Text Descriptions☆10Jan 17, 2021Updated 5 years ago
- ☆11Feb 11, 2020Updated 6 years ago
- Code and data for experiments on semantic fragments☆11Jun 23, 2022Updated 3 years ago
- [TPAMI'2023]Knowledge-enriched Attention Network with Group-wise Semantic for Visual Storytelling☆11Jan 3, 2023Updated 3 years ago
- ☆13Nov 28, 2025Updated 3 months ago
- This is a repository for paper titled, PlaSma: Making Small Language Models Better Procedural Knowledge Models for (Counterfactual) Plann…☆14Nov 3, 2023Updated 2 years ago
- ☆11Aug 14, 2018Updated 7 years ago
- Measure the diversity of image descriptions, repository for our COLING 2018 paper.☆13Dec 29, 2019Updated 6 years ago
- Source code of "Leaky Thoughts: Large Reasoning Models Are Not Private Thinkers" EMNLP 2025☆16Jan 12, 2026Updated last month
- ☆47Apr 29, 2024Updated last year
- A pytorch implementation of "Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering" for image captioning.☆48Nov 15, 2021Updated 4 years ago
- ☆10Jul 28, 2022Updated 3 years ago
- One-Pixel Shortcut: on the Learning Preference of Deep Neural Networks (ICLR 2023 Spotlight)☆14Sep 28, 2025Updated 5 months ago
- Benchmark for evaluating open-ended generation☆50Nov 6, 2024Updated last year
- Placeholder for code of BSP.☆11Aug 13, 2021Updated 4 years ago
- Create Persona dataset from reddit en movie category comment☆11Aug 6, 2021Updated 4 years ago
- Code for EMNLP2021 paper “Transductive Learning for Unsupervised Text Style Transfer”☆12Sep 19, 2021Updated 4 years ago
- Code for ACL 2023 Oral Paper: ManagerTower: Aggregating the Insights of Uni-Modal Experts for Vision-Language Representation Learning☆12Aug 23, 2025Updated 6 months ago
- [NeurIPS 2019] Drill-down: Interactive Retrieval of Complex Scenes using Natural Language Queries☆12Apr 15, 2022Updated 3 years ago
- Repository for the paper "Data Efficient Masked Language Modeling for Vision and Language".☆18Sep 17, 2021Updated 4 years ago
- [ECCV 2024] Official PyTorch implementation of LUT "Learning with Unmasked Tokens Drives Stronger Vision Learners"☆13Dec 1, 2024Updated last year
- ProbSpace 「くずし字」識別チャレンジ 2位解法☆11Jun 17, 2019Updated 6 years ago
- ☆11Oct 4, 2023Updated 2 years ago
- [ICCV'23] UATVR: Uncertainty-Adaptive Text-Video Retrieval☆13Nov 5, 2023Updated 2 years ago
- AutoGluon Docker☆12Apr 17, 2020Updated 5 years ago
- Data and all☆14Sep 30, 2019Updated 6 years ago