[IJCAI 2025] Image Captioning Evaluation in the Age of Multimodal LLMs: Challenges and Future Perspectives
☆32Nov 25, 2025Updated 4 months ago
Alternatives and similar repositories for awesome-captioning-evaluation
Users that are interested in awesome-captioning-evaluation are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [CVPR 2023 & IJCV 2025] Positive-Augmented Contrastive Learning for Image and Video Captioning Evaluation☆67Jul 29, 2025Updated 8 months ago
- ☆17Feb 20, 2025Updated last year
- [ICCV 2023] With a Little Help from your own Past: Prototypical Memory Networks for Image Captioning.☆19Jun 7, 2024Updated last year
- [BMVC 2024 Oral ✨] Revisiting Image Captioning Training Paradigm via Direct CLIP-based Optimization☆20Sep 11, 2024Updated last year
- ☆10Sep 2, 2021Updated 4 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- 我在校园的各项API,自动运行脚本,支持多人☆12Jun 28, 2022Updated 3 years ago
- The Land-Diffuser is a novel application of the Denoising Diffusion Probabilistic Model (DDPM) in the realm of 3D Talking Head generation…☆13Dec 23, 2023Updated 2 years ago
- [CVPR 2025] Recurrence-Enhanced Vision-and-Language Transformers for Robust Multimodal Document Retrieval☆36Sep 12, 2025Updated 6 months ago
- This is the official repository for the paper "Modeling Human Gaze Behavior with Diffusion Models for Unified Scanpath Prediction". ICCV …☆25Dec 4, 2025Updated 3 months ago
- Microsoft question-answering dataset☆10Jun 16, 2023Updated 2 years ago
- The Easiest Way to Run Commands as Systemd Services☆10Aug 27, 2025Updated 7 months ago
- [CVPR 2025] Augmenting Multimodal LLMs with Self-Reflective Tokens for Knowledge-based Visual Question Answering☆54Jul 14, 2025Updated 8 months ago
- [ECCV'24] Contrasting Deepfakes Diffusion via Contrastive Learning and Global-Local Similarities☆52Jul 2, 2025Updated 8 months ago
- An Arena-style Automated Evaluation Benchmark for Detailed Captioning☆58Jun 1, 2025Updated 9 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Crop growth stage modeling and classification☆11Apr 11, 2019Updated 6 years ago
- Project page for the 'CLAWS: Clustering Assisted Weakly Supervised Learning with Normalcy Suppression for Anomalous Event Detection', ECC…☆12May 29, 2021Updated 4 years ago
- [CVPR24 Highlights] Polos: Multimodal Metric Learning from Human Feedback for Image Captioning☆33May 25, 2025Updated 10 months ago
- [IEEE TMM 2023] This is the official repo of the paper "Perceptual Quality Improvement in Videoconferencing using Keyframes-based GAN".☆17Dec 10, 2024Updated last year
- [NO LONGER MAINTAINED, SUPERSEDED BY https://github.com/trueagi-io/pln-experimental and https://github.com/trueagi-io/PLN]. Probabilisti…☆16Sep 20, 2025Updated 6 months ago
- FeelingBlue: A Corpus for Understanding the Emotional Connotation of Color in Context, accepted at TACL 2022, presented at ACL 2023☆13Dec 28, 2023Updated 2 years ago
- [ECCV'24] Official Implementation of Autoregressive Visual Entity Recognizer.☆14Mar 2, 2024Updated 2 years ago
- ☆18Oct 5, 2023Updated 2 years ago
- Using CNN for classifying 101 different food categories - using VGG16, Alex Net and SVM☆10Jan 6, 2020Updated 6 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- SOMOSPIE (Soil Moisture Spatial Inference Engine) consists of a Jupyter Notebook and a suite of machine learning methods to process input…☆16Sep 16, 2025Updated 6 months ago
- Framewise online action recognition using 4D data☆13Dec 3, 2019Updated 6 years ago
- A currency rate converter App.☆15Sep 5, 2019Updated 6 years ago
- ☆19Jul 23, 2024Updated last year
- This repository contains a curated list of research papers and resources focusing on saliency and scanpath prediction, human attention, h…☆63May 9, 2025Updated 10 months ago
- S-GEAR: Semantically Guided Representation Learning For Action Anticipation☆15Apr 29, 2025Updated 11 months ago
- TUI application for viewing the status of GPU allocations on a Slurm cluster☆11Dec 11, 2023Updated 2 years ago
- Code for the ACL 2019 paper "Observing Dialogue in Therapy: Categorizing and Forecasting Behavioral Codes"☆14Jun 11, 2022Updated 3 years ago
- Presents an optimised Sentinel-1-based Soil Moisture estimation workflow on tilted topography with permanent vegetation cover☆16Aug 18, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- ☆13Jul 31, 2023Updated 2 years ago
- ☆48Jan 27, 2026Updated 2 months ago
- The implementation for "DEER: Descriptive Knowledge Graph for Explaining Entity Relationships" (EMNLP '22)☆13Oct 31, 2022Updated 3 years ago
- Training A Small Emotional Vision Language Model for Visual Art Comprehension☆16Jul 26, 2024Updated last year
- Official code for the paper "Two Effects, One Trigger: On the Modality Gap, Object Bias, and Information Imbalance in Contrastive Vision-…☆22May 11, 2025Updated 10 months ago
- 毕业设计-容器管理和监控平台☆10Jun 14, 2022Updated 3 years ago
- ☆13Jul 17, 2024Updated last year