[ACL 2023] Code and data for our paper "Measuring Progress in Fine-grained Vision-and-Language Understanding"
☆13Jun 11, 2023Updated 2 years ago
Alternatives and similar repositories for fine-grained-evals
Users that are interested in fine-grained-evals are comparing it to the libraries listed below
Sorting:
- [ICML 2022] Code and data for our paper "IGLUE: A Benchmark for Transfer Learning across Modalities, Tasks, and Languages"☆49Dec 7, 2022Updated 3 years ago
- A multi-task learning approach for conditioned response generation (NAACL 2021)☆12Nov 18, 2022Updated 3 years ago
- CVPR 2021 Official Pytorch Code for UC2: Universal Cross-lingual Cross-modal Vision-and-Language Pre-training☆34Nov 9, 2021Updated 4 years ago
- This repo contains codes and instructions for baselines in the VLUE benchmark.☆41Jul 16, 2022Updated 3 years ago
- ☆12Mar 13, 2025Updated last year
- pytorch implementation of mvp: a multi-stage vision-language pre-training framework☆11Apr 23, 2022Updated 3 years ago
- ☆12Feb 14, 2023Updated 3 years ago
- ☆45Aug 14, 2023Updated 2 years ago
- ☆14Feb 23, 2026Updated 3 weeks ago
- This repository contains the official code for "Flexible Biometrics Recognition: Bridging the Multimodality Gap through Attention, Alignm…☆11Oct 9, 2024Updated last year
- A Generative Dialogue State Tracking Model☆22Jun 24, 2021Updated 4 years ago
- The code of MGCC: Text-based Occluded Person Re-identification via Multi-Granularity Contrastive Consistency Learning☆20Feb 26, 2025Updated last year
- [ICCV 2025] Diffusion Curriculum (DisCL)☆18Sep 26, 2025Updated 5 months ago
- ☆14Jul 13, 2021Updated 4 years ago
- An experiment in declaratively programming parallel pipelines of state machines.☆18Mar 20, 2023Updated 3 years ago
- Repository to storage the 4mula dataset☆10Sep 1, 2021Updated 4 years ago
- Seeing What You Miss: Vision-Language Pre-training with Semantic Completion Learning☆20Dec 21, 2023Updated 2 years ago
- This is the official code implementation of Bongard-OpenWorld (ICLR 2024).☆14Jan 6, 2025Updated last year
- Generalized Convolution and Efficient Language Recognition☆18Jul 20, 2019Updated 6 years ago
- Official release code of RPG-PALM(ICCV2023)☆14Jul 28, 2023Updated 2 years ago
- The official implementation of ADDP (ICLR 2024)☆12Mar 27, 2024Updated last year
- FedCMR: Federated Cross-Modal Retrieval 的代码(the official implementation of FedCMR: Federated Cross-Modal Retrieval)☆17Oct 17, 2025Updated 5 months ago
- Do notation in Python.☆10Feb 22, 2021Updated 5 years ago
- visual question answering prompting recipes for large vision-language models☆28Sep 14, 2024Updated last year
- being the lecture materials and exercises for the 2016/17 session of Advanced Functional Programming at Strathclyde☆13May 9, 2017Updated 8 years ago
- MM-Instruct: Generated Visual Instructions for Large Multimodal Model Alignment☆35Jul 1, 2024Updated last year
- Various palmprint feature extraction techniques: CompCode, Local Tetra Patterns, RLOC☆10Sep 4, 2019Updated 6 years ago
- Implementation of papers in 101 lines of code.☆18Nov 12, 2023Updated 2 years ago
- EBDM: Exemplar-guided Image Translation with Brownian-bridge Diffusion Models☆11Jul 16, 2024Updated last year
- The official PyTorch implementation of Logical Consistency and Greater Descriptive Power for Facial Hair Attribute Learning - CVPR 2023☆12Aug 31, 2024Updated last year
- [IJCAI 2023] Text-Video Retrieval with Disentangled Conceptualization and Set-to-Set Alignment☆53Apr 9, 2024Updated last year
- Git repository for the course Logika v računalništvu☆15Apr 5, 2022Updated 3 years ago
- ☆82Jul 31, 2023Updated 2 years ago
- Official repository for Towards Multi-modal Transformers in Federated Learning (ECCV2024)☆21Feb 4, 2025Updated last year
- Official Implementation for MoPE (T-MM 2025)☆28Oct 10, 2025Updated 5 months ago
- A paper list that includes world models or generative video models for embodied agents.☆26Jan 17, 2025Updated last year
- Unofficial pixabay python API client☆13Feb 6, 2023Updated 3 years ago
- Egocentric Video Understanding Dataset (EVUD)☆33Jul 4, 2024Updated last year
- Official Implementation of paper "Multimodal Federated Learning with Missing Modality via Prototype Mask and Contrast"☆25Oct 22, 2024Updated last year