keio-smilab24 / PolosView external linksLinks
[CVPR24 Highlights] Polos: Multimodal Metric Learning from Human Feedback for Image Captioning
☆33May 25, 2025Updated 8 months ago
Alternatives and similar repositories for Polos
Users that are interested in Polos are comparing it to the libraries listed below
Sorting:
- [ACL2023, Findings] Source codes for the paper "Werewolf Among Us: Multimodal Resources for Modeling Persuasion Behaviors in Social Deduc…☆16Feb 22, 2025Updated 11 months ago
- Imagen-mini for girl image generation☆12Nov 19, 2022Updated 3 years ago
- [NeurIPS 2023] A faithful benchmark for vision-language compositionality☆89Feb 13, 2024Updated 2 years ago
- ☆17Nov 4, 2022Updated 3 years ago
- [IJCAI 2025] Image Captioning Evaluation in the Age of Multimodal LLMs: Challenges and Future Perspectives☆29Nov 25, 2025Updated 2 months ago
- Densely Captioned Images (DCI) dataset repository.☆196Jul 1, 2024Updated last year
- PyTorch implementation of "PatchGame: Learning to Signal Mid-level Patches in Referential Games" to appear in NeurIPS 2021☆24Jun 4, 2021Updated 4 years ago
- ☆55May 31, 2022Updated 3 years ago
- ☆26Feb 3, 2023Updated 3 years ago
- Code Base for the work "Interactive Portrait Harmonization"☆28May 11, 2023Updated 2 years ago
- (NeurIPS 2021) Pytorch implementation of paper "Re-ranking for image retrieval and transductive few-shot classification"☆31Nov 21, 2021Updated 4 years ago
- Official Source code of "One-Shot Adaptation of GAN in Just One CLIP" IEEE Transactions on Pattern Anaylsis and Machine Intelligence (TPA…☆65Jun 5, 2023Updated 2 years ago
- ☆30Jan 3, 2023Updated 3 years ago
- L-Verse: Bidirectional Generation Between Image and Text☆107Apr 1, 2025Updated 10 months ago
- This code provides a PyTorch implementation for OTTER (Optimal Transport distillation for Efficient zero-shot Recognition), as described …☆71Dec 20, 2021Updated 4 years ago
- ☆66Feb 5, 2024Updated 2 years ago
- Evaluation benchmark for the task of Semantic Image Translation. Contains code to run FlexIT (CVPR 2022)☆34Mar 25, 2022Updated 3 years ago
- NegCLIP.☆38Feb 6, 2023Updated 3 years ago
- Filtering, Distillation, and Hard Negatives for Vision-Language Pre-Training☆141Dec 16, 2025Updated 2 months ago
- Touchstone: Evaluating Vision-Language Models by Language Models☆83Jan 18, 2024Updated 2 years ago
- ☆70Dec 5, 2025Updated 2 months ago
- Data repository for the VALSE benchmark.☆37Feb 15, 2024Updated 2 years ago
- DenseFusion-1M: Merging Vision Experts for Comprehensive Multimodal Perception☆159Dec 6, 2024Updated last year
- Web2Code: A Large-scale Webpage-to-Code Dataset and Evaluation Framework for Multimodal LLMs☆101Oct 23, 2024Updated last year
- Towards Memorization-Free Diffusion Models (CVPR2024) Codebase☆12Jun 2, 2024Updated last year
- The implementation of Learning Instance and Task-Aware Dynamic Kernels for Few Shot Learning☆13Apr 14, 2024Updated last year
- ☆12May 26, 2022Updated 3 years ago
- [ICME 2019] Source code and datasets for "Semi-supervised Compatibility Learning Across Categories for Clothing Matching"☆10Apr 26, 2024Updated last year
- Sequential Parameter Optimization in Python☆14Jan 12, 2026Updated last month
- Microsoft question-answering dataset☆10Jun 16, 2023Updated 2 years ago
- A collection of interesting papers on Diffusion Models☆15Dec 19, 2023Updated 2 years ago
- 「賞金で二郎一生分食べたい!」チームのレポジトリです.☆11Dec 9, 2021Updated 4 years ago
- A list of papers and other resources on language-guided image editing.☆39Jan 13, 2021Updated 5 years ago
- Davidsonian Scene Graph (DSG) for Text-to-Image Evaluation (ICLR 2024)☆104Dec 9, 2024Updated last year
- Official Pytorch repo of CVPR'23 and NeurIPS'23 papers on understanding replication in diffusion models.☆113Nov 22, 2023Updated 2 years ago
- The official repository of MM-R5☆28Jun 22, 2025Updated 7 months ago
- ☆11Oct 2, 2024Updated last year
- ☆10Dec 21, 2020Updated 5 years ago
- [ACM MM25] Official Pytorch implementation of [Decoupled Global-Local Alignment for Improving Compositional Understanding]☆15Jul 15, 2025Updated 7 months ago