Implementation of seq2seq model for Visual Storytelling Challenge (VIST) http://visionandlanguage.net/VIST/index.html
☆63Aug 17, 2018Updated 7 years ago
Alternatives and similar repositories for ai-visual-storytelling-seq2seq
Users that are interested in ai-visual-storytelling-seq2seq are comparing it to the libraries listed below
Sorting:
- GLAC Net: GLocal Attention Cascading Network for the Visual Storytelling Challenge☆45Aug 26, 2020Updated 5 years ago
- vist story telling evaluation tool☆21Dec 5, 2023Updated 2 years ago
- Visual Storytelling API☆36Feb 11, 2017Updated 9 years ago
- Visual Storytelling post-edit dataset☆18Sep 27, 2019Updated 6 years ago
- Aligntune : A Modular Toolkit for Post Training Alignment of LLMs☆35Feb 26, 2026Updated last week
- Codebase for VidHal: Benchmarking Hallucinations in Vision LLMs☆14Apr 19, 2025Updated 10 months ago
- Code for "Aligning Visual Regions and Textual Concepts for Semantic-Grounded Image Representations" (NeurIPS 2019)☆65Oct 19, 2020Updated 5 years ago
- Code voor mijn Master project omtrent VideoBERT☆39Nov 25, 2020Updated 5 years ago
- ☆17Jan 24, 2021Updated 5 years ago
- [EMNLP 2021] Code and data for our paper "Vision-and-Language or Vision-for-Language? On Cross-Modal Influence in Multimodal Transformers…☆20Jan 17, 2022Updated 4 years ago
- [IJCAI 2018] Deep Reasoning with Knowledge Grap for Social Relationship Understanding.☆21May 23, 2022Updated 3 years ago
- Automated Storytelling via Causal, Commonsense Plot Ordering☆53Sep 3, 2020Updated 5 years ago
- AMR-to-text Generation with Graph Transformer☆18Nov 16, 2020Updated 5 years ago
- 8th place solution to the Kaggle Quick, Draw! Doodle Recognition Competition☆22Dec 10, 2018Updated 7 years ago
- Code associated with the "Natural Language Rationales with Full-Stack Visual Reasoning" EMNLP Findings 2020 paper☆24Jan 15, 2021Updated 5 years ago
- CaMEL: Mean Teacher Learning for Image Captioning. ICPR 2022☆29Dec 1, 2022Updated 3 years ago
- Bayesian Optimization Excutable and Visualizable Application☆10Aug 14, 2023Updated 2 years ago
- Codes of AAAI 2020 paper "What Makes A Good Story? Designing Composite Rewards for Visual Storytelling"☆27May 31, 2021Updated 4 years ago
- Implementation for the paper "Hierarchical Conditional Relation Networks for Video Question Answering" (Le et al., CVPR 2020, Oral)☆134Jul 25, 2024Updated last year
- automated insights for tabular data☆10Feb 10, 2025Updated last year
- Neural State Machine implemented in PyTorch☆71Oct 10, 2019Updated 6 years ago
- Belief Revision based Caption Re-ranker with Visual Semantic Information. COLING 2022☆11Apr 13, 2025Updated 10 months ago
- Joint Embedding with Multimodal Cues for Cross-Modal Video-Text Retrieval☆68Apr 10, 2020Updated 5 years ago
- Code for a web demo of Plan, Write, and Revise: a neural system for interactive open-domain story generation☆34Oct 25, 2021Updated 4 years ago
- Code for Beyond Generic: Enhancing Image Captioning with Real-World Knowledge using Vision-Language Pre-Training Model☆13Feb 15, 2024Updated 2 years ago
- Creating crowdsourcing based experiments made easy☆10May 25, 2020Updated 5 years ago
- Partially Non-Autoregressive Image Captioning☆10Sep 30, 2021Updated 4 years ago
- Implementation of NAACL'19 Strong and Simple Baselines for Multimodal Utterance Embeddings☆10Jun 4, 2019Updated 6 years ago
- Keyword extraction using Scake, KeyBERT, Fine-tuning Transformer BERT-like models and ChatGPT.☆12May 22, 2023Updated 2 years ago
- end-to-end information extraction pipeline built by LayoutLMV2, pretrained model from HuggingFace☆11Aug 15, 2023Updated 2 years ago
- Notes on "Data Science from Scratch" by Joel Grus☆11Aug 9, 2016Updated 9 years ago
- Newspaper Segmentation into images and text☆12Jan 11, 2019Updated 7 years ago
- Generalization in Metric Learning: Should the Embedding Layer be the Embedding Layer?☆11Jan 3, 2019Updated 7 years ago
- Simple model for sentence compression (a.k.a Baseline in Klerke et al., NAACL 2016)☆10Dec 16, 2018Updated 7 years ago
- Implementation of CVPR 2016 paper☆74Jan 31, 2021Updated 5 years ago
- ( ACL 2019 ) “Automatic Generation of Personalized Comment Based on User Profile“☆39Mar 24, 2023Updated 2 years ago
- Official Code for Towards Transparent and Explainable Attention Models paper (ACL 2020)☆35Jun 22, 2022Updated 3 years ago
- Using LLMs and pre-trained caption models for super-human performance on image captioning.☆42Oct 13, 2023Updated 2 years ago
- Microsoft COCO Caption Evaluation Tool - Python 3☆33May 23, 2019Updated 6 years ago