edchengg/infoseek_eval

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/edchengg/infoseek_eval)

edchengg / infoseek_eval

EMNLP2023 - InfoSeek: A New VQA Benchmark focus on Visual Info-Seeking Questions

☆26

Alternatives and similar repositories for infoseek_eval

Users that are interested in infoseek_eval are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

edchengg / oven_eval
View on GitHub
ICCV 2023 (Oral) Open-domain Visual Entity Recognition Towards Recognizing Millions of Wikipedia Entities
☆44Jun 7, 2025Updated last year
open-vision-language / infoseek
View on GitHub
☆78Oct 27, 2023Updated 2 years ago
open-vision-language / oven
View on GitHub
☆47Aug 15, 2023Updated 2 years ago
MrZilinXiao / AutoVER
View on GitHub
[ECCV'24] Official Implementation of Autoregressive Visual Entity Recognizer.
☆14Mar 2, 2024Updated 2 years ago
LinWeizheDragon / FLMR
View on GitHub
The huggingface implementation of Fine-grained Late-interaction Multi-modal Retriever.
☆108May 30, 2025Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
lezhang7 / MOQAGPT
View on GitHub
[EMNLP'2023 Findings] MoqaGPT, for zero-shot multimodal question answering with LLMs
☆13Dec 28, 2024Updated last year
Go2Heart / EchoSight
View on GitHub
[EMNLP 2024 Findings] The official PyTorch implementation of EchoSight: Advancing Visual-Language Models with Wiki Knowledge.
☆90Jan 19, 2026Updated 6 months ago
multimodal-art-projection / IV-Bench
View on GitHub
☆14Apr 23, 2025Updated last year
THU-KEG / Event-Level-Knowledge-Editing
View on GitHub
☆12Apr 25, 2024Updated 2 years ago
HITsz-TMG / SKURG
View on GitHub
☆20Nov 4, 2023Updated 2 years ago
aimagelab / ReflectiVA
View on GitHub
[CVPR 2025] Augmenting Multimodal LLMs with Self-Reflective Tokens for Knowledge-based Visual Question Answering
☆56Jul 14, 2025Updated last year
allenai / unifew
View on GitHub
Unifew: Unified Fewshot Learning Model
☆18Sep 10, 2021Updated 4 years ago
snap-research / MyVLM
View on GitHub
Official Implementation for "MyVLM: Personalizing VLMs for User-Specific Queries" (ECCV 2024)
☆188Jul 5, 2024Updated 2 years ago
alexandrosXe / A-Simple-Baseline-For-Knowledge-Based-VQA
View on GitHub
Repo for the EMNLP 2023 paper "A Simple Knowledge-Based Visual Question Answering"
☆25Dec 14, 2023Updated 2 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
BolinLai / LEGO
View on GitHub
[ECCV2024, Oral, Best Paper Finalist] This is the official implementation of the paper "LEGO: Learning EGOcentric Action Frame Generation…
☆41Feb 24, 2025Updated last year
SciMT / SciMT-benchmark
View on GitHub
☆11Jan 3, 2024Updated 2 years ago
LinWeizheDragon / Retrieval-Augmented-Visual-Question-Answering
View on GitHub
This is the official repository for Retrieval Augmented Visual Question Answering
☆252Dec 19, 2024Updated last year
mainaksingha01 / ODG-CLIP
View on GitHub
☆21Oct 9, 2025Updated 9 months ago
PaulLerner / ViQuAE
View on GitHub
Source code and data used in the papers ViQuAE (Lerner et al., SIGIR'22), Multimodal ICT (Lerner et al., ECIR'23) and Cross-modal Retriev…
☆39Dec 19, 2024Updated last year
Hoar012 / TDC-Video
View on GitHub
Official implementation of TDC.
☆15Jul 22, 2025Updated last year
jocelyn2002 / NJUAI_CodingHW
View on GitHub
A collection of my own homework codes, mainly from School of Artificial Intelligence, NJU.
☆13May 25, 2021Updated 5 years ago
luka-group / vlm-knowledge-conflict
View on GitHub
Code for paper "Unraveling Cross-Modality Knowledge Conflicts in Large Vision-Language Models."
☆54Oct 19, 2024Updated last year
OVAD-Benchmark / ovad-benchmark-code
View on GitHub
OVAD: Open-vocabulary Attribute Detection code
☆30Aug 28, 2023Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
runchu-tian / LongPiBench
View on GitHub
The repository for papaer "Distance between Relevant Information Pieces Causes Bias in Long-Context LLMs"
☆14Dec 16, 2024Updated last year
yasdel / mmrecsys_survey20
View on GitHub
☆12Nov 11, 2022Updated 3 years ago
Shimorina / relation-extraction-db-wikidata
View on GitHub
☆11Jul 17, 2022Updated 4 years ago
pkunlp-icler / MIC
View on GitHub
MMICL, a state-of-the-art VLM with the in context learning ability from ICL, PKU
☆49Jul 13, 2025Updated last year
ronghanghu / vqa-maskrcnn-benchmark-m4c
View on GitHub
Used in M4C feature extraction script: https://github.com/facebookresearch/mmf/blob/project/m4c/projects/M4C/scripts/extract_ocr_frcn_fea…
☆13Jan 30, 2020Updated 6 years ago
vacancy / PDSketch-Alpha-Release
View on GitHub
☆17Nov 1, 2023Updated 2 years ago
zcai0612 / InstantBooth
View on GitHub
My implement of InstantBooth
☆14Sep 11, 2023Updated 2 years ago
tmlr-group / WCA
View on GitHub
[ICML 2024] "Visual-Text Cross Alignment: Refining the Similarity Score in Vision-Language Models"
☆59Sep 3, 2024Updated last year
vl-illusion / GVIL
View on GitHub
Code and data for EMNLP 2023 paper "Grounding Visual Illusions in Language: Do Vision-Language Models Perceive Illusions Like Humans?"
☆15Jan 25, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
kamperh / speech_correspondence
View on GitHub
Correspondence and autoencoder neural network training for speech using Pylearn2.
☆14Dec 9, 2015Updated 10 years ago
guilk / KAT
View on GitHub
Research code for "KAT: A Knowledge Augmented Transformer for Vision-and-Language"
☆71Jul 11, 2022Updated 4 years ago
due-benchmark / du-schema
View on GitHub
JSON Schema format for storing datasets details, documents processed contents, and documents annotations in the document understanding do…
☆14Nov 5, 2024Updated last year
edchengg / easyproject
View on GitHub
ACL 2023 (Findings) End-to-end Cross-lingual Label Project
☆15Nov 24, 2023Updated 2 years ago
learn2phoenix / cvpr22_vplow_ow
View on GitHub
☆12May 19, 2023Updated 3 years ago
chunchiehy / musst
View on GitHub
Multi-span Style Extraction for Generative Reading Comprehension
☆10Apr 2, 2021Updated 5 years ago
Jiaxuan-Li / EVCap
View on GitHub
[CVPR 2024] Retrieval-Augmented Image Captioning with External Visual-Name Memory for Open-World Comprehension
☆64Apr 8, 2024Updated 2 years ago