[CVPR24 Highlights] Polos: Multimodal Metric Learning from Human Feedback for Image Captioning
☆34May 25, 2025Updated 11 months ago
Alternatives and similar repositories for Polos
Users that are interested in Polos are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Imagen-mini for girl image generation☆12Nov 19, 2022Updated 3 years ago
- This is the official implementation of the paper "MM-SHAP: A Performance-agnostic Metric for Measuring Multimodal Contributions in Vision…☆32Mar 12, 2024Updated 2 years ago
- [NeurIPS 2023] A faithful benchmark for vision-language compositionality☆92Feb 13, 2024Updated 2 years ago
- Data release for the ImageInWords (IIW) paper.☆225Nov 17, 2024Updated last year
- [ECCV24] Layer-Wise Relevance Propagation with Conservation Property for ResNet☆15Sep 20, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- LLaVA-JP is a Japanese VLM trained by LLaVA method☆64Jul 3, 2024Updated last year
- [IJCAI 2025] Image Captioning Evaluation in the Age of Multimodal LLMs: Challenges and Future Perspectives☆34Nov 25, 2025Updated 5 months ago
- PyTorch implementation of "PatchGame: Learning to Signal Mid-level Patches in Referential Games" to appear in NeurIPS 2021☆24Jun 4, 2021Updated 4 years ago
- ☆18Sep 13, 2023Updated 2 years ago
- This repository contains the data and code of the paper titled "IllusionVQA: A Challenging Optical Illusion Dataset for Vision Language M…☆24Apr 27, 2025Updated last year
- ☆13Oct 22, 2024Updated last year
- Microsoft question-answering dataset☆10Jun 16, 2023Updated 2 years ago
- ☆65Feb 5, 2024Updated 2 years ago
- ☆47Aug 26, 2025Updated 8 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆11Jun 28, 2019Updated 6 years ago
- Official implementation of TagAlign☆37Dec 11, 2024Updated last year
- Code for the paper: "No Zero-Shot Without Exponential Data: Pretraining Concept Frequency Determines Multimodal Model Performance" [NeurI…☆94Apr 29, 2024Updated 2 years ago
- ☆30Jan 3, 2023Updated 3 years ago
- Experiments and data for the paper "When and why vision-language models behave like bags-of-words, and what to do about it?" Oral @ ICLR …☆295Jun 7, 2023Updated 2 years ago
- Code and data for ImageCoDe, a contextual vison-and-language benchmark☆41Mar 1, 2024Updated 2 years ago
- The first unofficial implementation of CLIP4Caption: CLIP for Video Caption (ACMMM 2021)☆16Jan 2, 2023Updated 3 years ago
- ☆191Oct 28, 2024Updated last year
- Repo from the "Learning with limited labeled data" seminar @ Uni of Tuebingen. A collection of notes, notebooks and slideshows to underst…☆17Apr 13, 2023Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Source codes of "Exploring the Potential of Unsupervised Image Synthesis for SAR-Optical Image Matching" IEEE Access☆15May 5, 2021Updated 5 years ago
- (ECCV 2024) Empowering Multimodal Large Language Model as a Powerful Data Generator☆114Mar 21, 2025Updated last year
- The official implementation of the paper code for "Unsupervised Multi-Modal Remote Sensing Image Registration via Domain Adaptation".☆11May 21, 2024Updated last year
- (NeurIPS 2021) Pytorch implementation of paper "Re-ranking for image retrieval and transductive few-shot classification"☆31Nov 21, 2021Updated 4 years ago
- [ICME 2019] Source code and datasets for "Semi-supervised Compatibility Learning Across Categories for Clothing Matching"☆11Apr 26, 2024Updated 2 years ago
- ☆17Nov 4, 2022Updated 3 years ago
- ☆30Sep 12, 2022Updated 3 years ago
- Provably (and non-vacuously) bounding test error of deep neural networks under distribution shift with unlabeled test data.☆10Feb 27, 2024Updated 2 years ago
- PyTorch implementation of "PatchVAE: Learning Local Latent Codes for Recognition" to appear in CVPR 2020☆14Apr 9, 2020Updated 6 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Code for DVD A Diagnostic Dataset for Multi-step Reasoning in Video Grounded Dialogue☆14Oct 12, 2021Updated 4 years ago
- Official Code for ART: Automatic Red-teaming for Text-to-Image Models to Protect Benign Users (NeurIPS 2024)☆24Oct 23, 2024Updated last year
- [ACL 2023 Findings] FACTUAL dataset, the textual scene graph parser trained on FACTUAL.☆127May 5, 2026Updated 2 weeks ago
- M-HalDetect Dataset Release☆29Nov 4, 2023Updated 2 years ago
- ☆15May 13, 2024Updated 2 years ago
- [TPAMI 2023] Object Affinity Learning: Towards Annotation-free Instance Segmentation☆14Sep 14, 2023Updated 2 years ago
- A list of papers and other resources on language-guided image editing.☆39Jan 13, 2021Updated 5 years ago