MAGAer13/DeCapBench

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/MAGAer13/DeCapBench)

MAGAer13 / DeCapBench

Official Code for "Painting with Words: Elevating Detailed Image Captioning with Benchmark and Alignment Learning" (ICLR 2025)

☆14

Alternatives and similar repositories for DeCapBench

Users that are interested in DeCapBench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

georgepar / grnet_guide
View on GitHub
Guide for the slp group on how to use the Grnet cluster
☆11Apr 16, 2020Updated 6 years ago
hoangquy18 / Multimodal-Aspect-Category-Sentiment-Analysis
View on GitHub
Dataset and codes for our paper "New Benchmark Dataset and Fine-Grained Cross-Modal Fusion Framework for Vietnamese Multimodal Aspect-Cat…
☆14Dec 14, 2024Updated last year
njucckevin / CapArena
View on GitHub
An Arena-style Automated Evaluation Benchmark for Detailed Captioning
☆59Jun 1, 2025Updated last year
heraclex12 / vietpunc
View on GitHub
Vietnamese Punctuation Prediction using Pretrained Language Models
☆14May 8, 2022Updated 4 years ago
SALT-NLP / PersuationGames
View on GitHub
[ACL2023, Findings] Source codes for the paper "Werewolf Among Us: Multimodal Resources for Modeling Persuasion Behaviors in Social Deduc…
☆16Feb 22, 2025Updated last year
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
HLTCHKUST / sentiment-lookahead
View on GitHub
Original code for our work on Sentiment Look-ahead.
☆18Apr 27, 2021Updated 5 years ago
kietnv / vireader
View on GitHub
Machine Reading Comprehension has attracted significant interest in research on natural language understanding, and large-scale datasets …
☆10Aug 14, 2021Updated 4 years ago
zxk1212 / OODD
View on GitHub
[CVPR2025] The implementation of the paper "OODD: Test-time Out-of-Distribution Detection with Dynamic Dictionary".
☆19May 9, 2025Updated last year
tarudesu / ViHateT5
View on GitHub
Repository for the paper "ViHateT5: Enhancing Hate Speech Detection in Vietnamese with A Unified Text-to-Text Transformer Model" (ACL'202…
☆12Aug 13, 2024Updated last year
mlip-cmu / mlip-cmu.github.io
View on GitHub
Homepage
☆13Jul 8, 2026Updated last week
mrzjy / sunburst
View on GitHub
A simple Python implementation of ngram sunburst (nested pie chart) visualization showed in CoQA paper
☆14Mar 12, 2019Updated 7 years ago
junjie-shentu / Textual-Localization
View on GitHub
Textual Localization: Decomposing Multi-concept Images for Subject-Driven Text-to-Image Generation
☆16Mar 10, 2024Updated 2 years ago
seoneun / T5-Question-Generation
View on GitHub
SQuAD Question Generation module based on T5-large
☆18Aug 26, 2022Updated 3 years ago
bino282 / ViNLP
View on GitHub
☆17Oct 30, 2022Updated 3 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
THU-CVML / TextureDiffusion
View on GitHub
[ICASSP 2025 Oral] The official implementation of paper "TextureDiffusion: Target Prompt Disentangled Editing for Various Texture Transfe…
☆17Mar 13, 2025Updated last year
Jiaxuan-Li / EVCap
View on GitHub
[CVPR 2024] Retrieval-Augmented Image Captioning with External Visual-Name Memory for Open-World Comprehension
☆64Apr 8, 2024Updated 2 years ago
LinLLLL / BayesCAL
View on GitHub
The official implementation of Bayesian Cross-modal Alignment Learning for Few-Shot Out-of-Distribution Generalization (AAAI2023).
☆12Oct 13, 2025Updated 9 months ago
kh4nh12 / ViVQA
View on GitHub
☆17Jul 10, 2022Updated 4 years ago
thuy-le-ep / Vietnamese-data
View on GitHub
Include Vietnamese stop words, Vietnamese person names, Vietnam GIS(Geographic Information System) data, Vietnamese Dictionary ...
☆14Oct 18, 2017Updated 8 years ago
kimkim00 / UIT-ViSD4SA
View on GitHub
ViSD4SA, a Vietnamese Span Detection for Aspect-based sentiment analysis dataset
☆17Oct 4, 2022Updated 3 years ago
ngxtnhi / ViLexNorm
View on GitHub
A Lexical Normalization Corpus for Vietnamese Social Media Text
☆20Mar 20, 2024Updated 2 years ago
reds-lab / BEEAR
View on GitHub
This is the official Gtihub repo for our paper: "BEEAR: Embedding-based Adversarial Removal of Safety Backdoors in Instruction-tuned Lang…
☆23Jul 3, 2024Updated 2 years ago
kylehkhsu / tripod
View on GitHub
☆12Apr 19, 2024Updated 2 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
MasterHow / OccFiner
View on GitHub
Offboard Occupancy Refinement with Hybrid Propagation for Autonomous Driving
☆15Feb 10, 2025Updated last year
MarcLafon / gallop
View on GitHub
Adaptation of vision-language models (CLIP) to downstream tasks using local and global prompts.
☆52Jul 10, 2025Updated last year
PostMindLab / ICD
View on GitHub
[ACL 2024] Mitigating Hallucinations in Large Vision-Language Models with Instruction Contrastive Decoding
☆18Nov 10, 2025Updated 8 months ago
ivclab / DeepPhotoCritic-ICCV17
View on GitHub
Kuang-Yu Chang, Kung-Hung Lu, and Chu-Song Chen, "Aesthetic Critiques Generation for Photos," International Conference on Computer Vision…
☆18Oct 11, 2022Updated 3 years ago
phungpx / LexiSignVQA
View on GitHub
LexiSignVQA: A Unified Training-free Multi-stage Approach to Multimodal Legal Question Answering on Traffic Sign Rules
☆23Nov 18, 2025Updated 8 months ago
jordddan / GameEval
View on GitHub
Using conversational games to evaluate powerful LLMs
☆18Sep 3, 2023Updated 2 years ago
PLUM-Lab / Mocheg
View on GitHub
Dataset and Code for Multimodal Fact Checking and Explanation Generation (Mocheg)
☆68Nov 24, 2023Updated 2 years ago
HHousen / object-discovery-pytorch
View on GitHub
An implementation of several unsupervised object discovery models (Slot Attention, SLATE, GNM) in PyTorch with pre-trained models.
☆15May 26, 2025Updated last year
LuongPhan / UIT-ViSFD
View on GitHub
☆17Oct 15, 2021Updated 4 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
zihao-ai / vdc
View on GitHub
This is the official implementation of ICLR 2024 paper "VDC: Versatile Data Cleanser based on Visual-Linguistic Inconsistency by Multimod…
☆19Feb 24, 2025Updated last year
google / imageinwords
View on GitHub
Data release for the ImageInWords (IIW) paper.
☆224Nov 17, 2024Updated last year
facebookresearch / dual-system-for-visual-language-reasoning
View on GitHub
Github repo for Peifeng's internship project
☆13Nov 7, 2023Updated 2 years ago
iclr2024mcmi / ICLRMCMI
View on GitHub
Official implementation of Bayes Conditional Distribution Estimation for Knowledge Distillation Based on Conditional Mutual Information
☆12Sep 28, 2023Updated 2 years ago
DataSenseiAryan / TS3000_TheChatBOT
View on GitHub
Its a social networking chat-bot trained on Reddit dataset . It supports open bounded queries developed on the concept of Neural Machine …
☆23Apr 9, 2021Updated 5 years ago
ds4v / 30VNFoods
View on GitHub
An end-to-end implementation process for building, labeling & deploying a dataset with 25136 images of 30 Vietnamese foods & their URLs: …
☆21Nov 16, 2021Updated 4 years ago
dorothy-yao / drfuse
View on GitHub
DrFuse: Learning Disentangled Representation for Clinical Multi-Modal Fusion with Missing Modality and Modal Inconsistency (AAAI24)
☆62Aug 20, 2024Updated last year