gregor-ge / FOCI-BenchmarkLinks
We present **FOCI**, a benchmark for Fine-grained Object ClassIfication for large vision language models (LVLMs).
☆16Updated last year
Alternatives and similar repositories for FOCI-Benchmark
Users that are interested in FOCI-Benchmark are comparing it to the libraries listed below
Sorting:
- Official Code for MIMETIC^2☆12Updated 7 months ago
- Code for T-MARS data filtering☆35Updated last year
- Official code for the paper: "Metadata Archaeology"☆19Updated 2 years ago
- codebase for the SIMAT dataset and evaluation☆38Updated 3 years ago
- Repository for the paper Do SSL Models Have Déjà Vu? A Case of Unintended Memorization in Self-supervised Learning☆36Updated 2 years ago
- ☆34Updated 2 years ago
- A Pytorch implementation of Attention on Attention module (both self and guided variants), for Visual Question Answering☆43Updated 4 years ago
- Command-line tool for downloading and extending the RedCaps dataset.☆48Updated last year
- This dataset contains about 110k images annotated with the depth and occlusion relationships between arbitrary objects. It enables resear…☆16Updated 4 years ago
- Un-*** 50 billions multimodality dataset☆23Updated 2 years ago
- ☆26Updated 3 years ago
- Load any clip model with a standardized interface☆21Updated last year
- Recursive Visual Programming (ECCV 2024)☆17Updated 7 months ago
- PyTorch Implementation of the paper "MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training"☆24Updated 2 weeks ago
- Official repo for the TMLR paper "Discffusion: Discriminative Diffusion Models as Few-shot Vision and Language Learners"☆29Updated last year
- Directed masked autoencoders☆14Updated 2 years ago
- ☆24Updated 2 years ago
- Developing adversarial examples and showing their semantic generalization for the OpenAI CLIP model (https://github.com/openai/CLIP)☆26Updated 4 years ago
- Code for "Merging Text Transformers from Different Initializations"☆20Updated 5 months ago
- ☆13Updated 2 years ago
- Official Code Repository for EnvGen: Generating and Adapting Environments via LLMs for Training Embodied Agents (COLM 2024)☆34Updated last year
- reproduces experiments from "Grounding inductive biases in natural images: invariance stems from variations in data"☆17Updated 9 months ago
- ☆32Updated 3 years ago
- ☆1Updated last year
- Lottery Ticket Adaptation☆39Updated 7 months ago
- ☆13Updated 10 months ago
- The official repository for our paper "The Dual Form of Neural Networks Revisited: Connecting Test Time Predictions to Training Patterns …☆16Updated last month
- Describe the format of image/text datasets☆11Updated 3 years ago
- Google Research☆46Updated 2 years ago
- Official code and data for NeurIPS 2023 paper "ImageNet-Hard: The Hardest Images Remaining from a Study of the Power of Zoom and Spatial …☆39Updated last year