[EMNLP'21] Visual News: Benchmark and Challenges in News Image Captioning
☆104Jul 18, 2024Updated last year
Alternatives and similar repositories for VisualNews-Repository
Users that are interested in VisualNews-Repository are comparing it to the libraries listed below
Sorting:
- [EACL'23] COVID-VTS: Fact Extraction and Verification on Short Video Platforms☆11Sep 26, 2023Updated 2 years ago
- [ICPRAI 2024] DocumentCLIP: Linking Figures and Main Body Text in Reflowed Documents☆16Apr 4, 2024Updated last year
- Code for our CVPR'22 paper: Open-Domain, Content-based, Multi-modal Fact-checking of Out-of-Context Images via Online Resources☆42Nov 15, 2022Updated 3 years ago
- [CVPR 2020] Transform and Tell: Entity-Aware News Image Captioning☆93Apr 19, 2024Updated last year
- SNIFFER: Multimodal Large Language Model for Explainable Out-of-Context Misinformation Detection☆80Aug 13, 2024Updated last year
- Good News Everyone! - CVPR 2019☆128Apr 14, 2022Updated 3 years ago
- Measure the diversity of image descriptions, repository for our COLING 2018 paper.☆13Dec 29, 2019Updated 6 years ago
- [TPAMI 2024 & CVPR 2023] PyTorch code for DGM4: Detecting and Grounding Multi-Modal Media Manipulation and beyond☆502Apr 23, 2024Updated last year
- This is the code for the paper "Cross-modal Ambiguity Learning for Multimodal Fake News Detection" of WWW2022.☆89Jul 20, 2022Updated 3 years ago
- [NAACL 2024] MMC: Advancing Multimodal Chart Understanding with LLM Instruction Tuning☆95Jan 7, 2025Updated last year
- ☆25Feb 6, 2023Updated 3 years ago
- Download Web-10K data by querying Bing Image Search☆10Feb 1, 2022Updated 4 years ago
- Patient data simulator following the structure of an open-ai gym.☆11Jul 9, 2019Updated 6 years ago
- Github repository for Plot and Rework: Modeling Storylines for Visual Storytelling (ACL-IJCNLP2021 Findings)☆22Aug 22, 2022Updated 3 years ago
- [ACM MM 2024] FKA-Owl: Advancing Multimodal Fake News Detection through Knowledge-Augmented LVLMs☆59Aug 8, 2024Updated last year
- [ACM MM'23] UMMAFormer: A Universal Multimodal-adaptive Transformer Framework For Temporal Forgery Localization☆77Nov 12, 2024Updated last year
- ☆10Jul 23, 2021Updated 4 years ago
- ☆11Sep 7, 2020Updated 5 years ago
- Multi-sense word embeddings from visual co-occurrences☆25Sep 5, 2019Updated 6 years ago
- Pytorch version of VidLanKD: Improving Language Understanding viaVideo-Distilled Knowledge Transfer (NeurIPS 2021))☆56Feb 6, 2023Updated 3 years ago
- Official repository for "FakeSV: A Multimodal Benchmark with Rich Social Context for Fake News Detection on Short Video Platforms", AAAI …☆167Sep 2, 2024Updated last year
- Learning phrase grounding from captioned images through InfoNCE bound on mutual information☆74Aug 22, 2020Updated 5 years ago
- Pre-trained V+L Data Preparation☆46Jun 2, 2020Updated 5 years ago
- ☆17Sep 2, 2023Updated 2 years ago
- Fact-checking system for textual and visual inputs.☆48Feb 24, 2026Updated last week
- The offical code implementation of paper "Interpretable Multimodal Misinformation Detection with Logic Reasoning", accepted by Finding of…☆31Feb 5, 2026Updated 3 weeks ago
- A task-agnostic vision-language architecture as a step towards General Purpose Vision☆92Jul 14, 2021Updated 4 years ago
- Code for AAAI'24 paper "Rethinking Graph Masked Autoencoders through Alignment and Uniformity”.☆14Jun 14, 2024Updated last year
- Code and Data for ManyModalQA: Modality Disambiguation and QA over Diverse Inputs☆17Mar 2, 2020Updated 6 years ago
- Show, Edit and Tell: A Framework for Editing Image Captions, CVPR 2020☆81Jul 17, 2020Updated 5 years ago
- Referring expression comprehension on ReferIt(RefClef)☆10Nov 28, 2016Updated 9 years ago
- - Image classification using Deep learning. - Utilizing both frequency and pixel domain information of images. - Implemented MVNN model f…☆19Mar 25, 2023Updated 2 years ago
- Code for the ACL paper "No Metrics Are Perfect: Adversarial Reward Learning for Visual Storytelling"☆136Jan 19, 2021Updated 5 years ago
- Words and their images in 98 languages☆14Mar 1, 2019Updated 7 years ago
- Generate a denotation graph from a set of image captions☆15Sep 4, 2018Updated 7 years ago
- Video captioning baseline models on Video2Commonsense Dataset.☆57Apr 15, 2021Updated 4 years ago
- FakeAVCeleb☆113Dec 23, 2021Updated 4 years ago
- [ICLR'24] Mitigating Hallucination in Large Multi-Modal Models via Robust Instruction Tuning☆296Mar 13, 2024Updated last year
- Scene Graph Parsing as Dependency Parsing☆41May 22, 2019Updated 6 years ago