Official Code for "Painting with Words: Elevating Detailed Image Captioning with Benchmark and Alignment Learning" (ICLR 2025)
☆12Mar 6, 2025Updated last year
Alternatives and similar repositories for DeCapBench
Users that are interested in DeCapBench are comparing it to the libraries listed below
Sorting:
- Guide for the slp group on how to use the Grnet cluster☆11Apr 16, 2020Updated 5 years ago
- An Arena-style Automated Evaluation Benchmark for Detailed Captioning☆58Jun 1, 2025Updated 9 months ago
- Dataset and codes for our paper "New Benchmark Dataset and Fine-Grained Cross-Modal Fusion Framework for Vietnamese Multimodal Aspect-Cat…☆14Dec 14, 2024Updated last year
- Official implementation of REArtGS (NeurIPS 2025)☆19Mar 6, 2026Updated 2 weeks ago
- Vietnamese Punctuation Prediction using Pretrained Language Models☆14May 8, 2022Updated 3 years ago
- [ACL2023, Findings] Source codes for the paper "Werewolf Among Us: Multimodal Resources for Modeling Persuasion Behaviors in Social Deduc…☆16Feb 22, 2025Updated last year
- Machine Reading Comprehension has attracted significant interest in research on natural language understanding, and large-scale datasets …☆10Aug 14, 2021Updated 4 years ago
- [CVPR2025] The implementation of the paper "OODD: Test-time Out-of-Distribution Detection with Dynamic Dictionary".☆18May 9, 2025Updated 10 months ago
- Homepage☆13Dec 20, 2025Updated 3 months ago
- A simple Python implementation of ngram sunburst (nested pie chart) visualization showed in CoQA paper☆13Mar 12, 2019Updated 7 years ago
- The official implementation of Vision-Language Alignment Learning under Affinity and Divergence Principles for Few-Shot Out-of-Distributi…☆29Jun 18, 2024Updated last year
- Textual Localization: Decomposing Multi-concept Images for Subject-Driven Text-to-Image Generation☆16Mar 10, 2024Updated 2 years ago
- Original code for our work on Sentiment Look-ahead.☆18Apr 27, 2021Updated 4 years ago
- SQuAD Question Generation module based on T5-large☆17Aug 26, 2022Updated 3 years ago
- ☆17Oct 30, 2022Updated 3 years ago
- [CVPR 2024] Retrieval-Augmented Image Captioning with External Visual-Name Memory for Open-World Comprehension☆62Apr 8, 2024Updated last year
- The official implementation of Bayesian Cross-modal Alignment Learning for Few-Shot Out-of-Distribution Generalization (AAAI2023).☆20Oct 13, 2025Updated 5 months ago
- [ICASSP 2025 Oral] The official implementation of paper "TextureDiffusion: Target Prompt Disentangled Editing for Various Texture Transfe…☆16Mar 13, 2025Updated last year
- ☆17Jul 10, 2022Updated 3 years ago
- Include Vietnamese stop words, Vietnamese person names, Vietnam GIS(Geographic Information System) data, Vietnamese Dictionary ...☆15Oct 18, 2017Updated 8 years ago
- The official implementation of CRoFT: Robust Fine-Tuning with Concurrent Optimization for OOD Generalization and Open-Set OOD Detection (…☆36Jun 17, 2024Updated last year
- A Lexical Normalization Corpus for Vietnamese Social Media Text☆20Mar 20, 2024Updated 2 years ago
- [ACL 2024] Mitigating Hallucinations in Large Vision-Language Models with Instruction Contrastive Decoding☆15Nov 10, 2025Updated 4 months ago
- ☆12Apr 19, 2024Updated last year
- Offboard Occupancy Refinement with Hybrid Propagation for Autonomous Driving☆16Feb 10, 2025Updated last year
- ViSD4SA, a Vietnamese Span Detection for Aspect-based sentiment analysis dataset☆17Oct 4, 2022Updated 3 years ago
- Adaptation of vision-language models (CLIP) to downstream tasks using local and global prompts.☆51Jul 10, 2025Updated 8 months ago
- This is the official Gtihub repo for our paper: "BEEAR: Embedding-based Adversarial Removal of Safety Backdoors in Instruction-tuned Lang…☆22Jul 3, 2024Updated last year
- Kuang-Yu Chang, Kung-Hung Lu, and Chu-Song Chen, "Aesthetic Critiques Generation for Photos," International Conference on Computer Vision…☆18Oct 11, 2022Updated 3 years ago
- [CVPR2025] Code Release of Patch Matters: Training-free Fine-grained Image Caption Enhancement via Local Perception☆21Jun 17, 2025Updated 9 months ago
- Using conversational games to evaluate powerful LLMs☆18Sep 3, 2023Updated 2 years ago
- ☆35Feb 4, 2026Updated last month
- This is the official implementation of ICLR 2024 paper "VDC: Versatile Data Cleanser based on Visual-Linguistic Inconsistency by Multimod…☆19Feb 24, 2025Updated last year
- An implementation of several unsupervised object discovery models (Slot Attention, SLATE, GNM) in PyTorch with pre-trained models.☆14May 26, 2025Updated 9 months ago
- Dataset and Code for Multimodal Fact Checking and Explanation Generation (Mocheg)☆63Nov 24, 2023Updated 2 years ago
- An end-to-end implementation process for building, labeling & deploying a dataset with 25136 images of 30 Vietnamese foods & their URLs: …☆21Nov 16, 2021Updated 4 years ago
- Data release for the ImageInWords (IIW) paper.☆227Nov 17, 2024Updated last year
- DL Backtrace is a new explainablity technique for deep learning models that works for any modality and model type.☆24Feb 16, 2026Updated last month
- Official implementation of Bayes Conditional Distribution Estimation for Knowledge Distillation Based on Conditional Mutual Information☆11Sep 28, 2023Updated 2 years ago