gregor-ge / FOCI-BenchmarkLinks
We present **FOCI**, a benchmark for Fine-grained Object ClassIfication for large vision language models (LVLMs).
☆16Updated last year
Alternatives and similar repositories for FOCI-Benchmark
Users that are interested in FOCI-Benchmark are comparing it to the libraries listed below
Sorting:
- Code for T-MARS data filtering☆35Updated last year
- ☆24Updated last year
- ☆11Updated 7 months ago
- This is the implementation of CounterCurate, the data curation pipeline of both physical and semantic counterfactual image-caption pairs.☆18Updated last year
- Code for "Merging Text Transformers from Different Initializations"☆20Updated 4 months ago
- Official Code for MIMETIC^2☆12Updated 7 months ago
- Repository for Skill Set Optimization☆13Updated 11 months ago
- [ICLR2024] (EvALign-ICL Benchmark) Beyond Task Performance: Evaluating and Reducing the Flaws of Large Multimodal Models with In-Context …☆22Updated last year
- This repository contains the code and data for the paper "VisOnlyQA: Large Vision Language Models Still Struggle with Visual Perception o…☆23Updated 2 months ago
- ☆37Updated 2 years ago
- Command-line tool for downloading and extending the RedCaps dataset.☆48Updated last year
- Official implementation of "Gemini in Reasoning: Unveiling Commonsense in Multimodal Large Language Models"☆36Updated last year
- implementation of dualformer☆17Updated 3 months ago
- Official Code Repository for EnvGen: Generating and Adapting Environments via LLMs for Training Embodied Agents (COLM 2024)☆34Updated 11 months ago
- Recursive Visual Programming (ECCV 2024)☆17Updated 7 months ago
- ☆26Updated last year
- Efficient Scaling laws and collaborative pretraining.☆16Updated 5 months ago
- ☆10Updated 2 months ago
- ☆20Updated last year
- Code for the paper "Data Feedback Loops: Model-driven Amplification of Dataset Biases"☆16Updated 2 years ago
- This repo contains code and data for ICLR 2025 paper MIA-Bench: Towards Better Instruction Following Evaluation of Multimodal LLMs☆31Updated 3 months ago
- Official code for the paper: "Metadata Archaeology"☆19Updated 2 years ago
- Repository for the paper Do SSL Models Have Déjà Vu? A Case of Unintended Memorization in Self-supervised Learning☆36Updated 2 years ago
- [TMLR 2024] Official implementation of "Sight Beyond Text: Multi-Modal Training Enhances LLMs in Truthfulness and Ethics"☆20Updated last year
- List of papers on Self-Correction of LLMs.☆73Updated 6 months ago
- ☆27Updated 2 years ago
- ☆31Updated last year
- Lottery Ticket Adaptation☆39Updated 7 months ago
- Byte-sized text games for code generation tasks on virtual environments☆19Updated 11 months ago
- Official implementation of the paper "MMInA: Benchmarking Multihop Multimodal Internet Agents"☆44Updated 4 months ago