open-vision-language/oven

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/open-vision-language/oven)

open-vision-language / oven

☆47

Alternatives and similar repositories for oven

Users that are interested in oven are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

edchengg / oven_eval
View on GitHub
ICCV 2023 (Oral) Open-domain Visual Entity Recognition Towards Recognizing Millions of Wikipedia Entities
☆44Jun 7, 2025Updated last year
open-vision-language / infoseek
View on GitHub
☆78Oct 27, 2023Updated 2 years ago
edchengg / infoseek_eval
View on GitHub
EMNLP2023 - InfoSeek: A New VQA Benchmark focus on Visual Info-Seeking Questions
☆26May 30, 2024Updated 2 years ago
google-research-datasets / maxm
View on GitHub
MaXM is a suite of test-only benchmarks for multilingual visual question answering in 7 languages: English (en), French (fr), Hindi (hi),…
☆13Jan 16, 2024Updated 2 years ago
TIGER-AI-Lab / UniIR
View on GitHub
Official code for paper "UniIR: Training and Benchmarking Universal Multimodal Information Retrievers" (ECCV 2024)
☆183Oct 1, 2024Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
PaulLerner / ViQuAE
View on GitHub
Source code and data used in the papers ViQuAE (Lerner et al., SIGIR'22), Multimodal ICT (Lerner et al., ECIR'23) and Cross-modal Retriev…
☆39Dec 19, 2024Updated last year
wangxw5 / wikiDiverse
View on GitHub
☆39Feb 28, 2023Updated 3 years ago
taoszhang / MMhops-R1
View on GitHub
MMhops-R1: Multimodal Multi-hop Reasoning
☆16Feb 28, 2026Updated 4 months ago
ZUCC-AI / UMIE
View on GitHub
Code and model for AAAI 2024: UMIE: Unified Multimodal Information Extraction with Instruction Tuning
☆48Jun 5, 2024Updated 2 years ago
THU-KEG / Event-Level-Knowledge-Editing
View on GitHub
☆12Apr 25, 2024Updated 2 years ago
ctongfei / hierarchical-typing
View on GitHub
Hierarchical entity typing via multi-level learning to rank
☆12Oct 13, 2020Updated 5 years ago
uvavision / SyViC
View on GitHub
[ICCV 2023] Going Beyond Nouns With Vision & Language Models Using Synthetic Data
☆13Sep 30, 2023Updated 2 years ago
drogozhang / Fine-grained-Entity-Typing-Papers
View on GitHub
Must-read papers on Fine-grained Entity Typing
☆19Jul 7, 2022Updated 4 years ago
HITsz-TMG / ICL-State-Vector
View on GitHub
☆12Jul 4, 2024Updated 2 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
TIGER-AI-Lab / GenAI-Arena
View on GitHub
Interface for GenAI-Arena [NeurIPS24]
☆16Feb 27, 2024Updated 2 years ago
google-research-datasets / 2.5vrd
View on GitHub
This dataset contains about 110k images annotated with the depth and occlusion relationships between arbitrary objects. It enables resear…
☆16Apr 28, 2021Updated 5 years ago
kleinercubs / ImgFact
View on GitHub
Beyond Entities: A Large-Scale Multi-Modal Knowledge Graph with Triplet Fact Grounding
☆11May 23, 2024Updated 2 years ago
allenai / aokvqa
View on GitHub
Official repository for the A-OKVQA dataset
☆117May 8, 2024Updated 2 years ago
WebQnA / WebQA
View on GitHub
☆68Jan 3, 2025Updated last year
yonatanbitton / wysiwyr
View on GitHub
☆37Oct 7, 2023Updated 2 years ago
aimagelab / PMA-Net
View on GitHub
[ICCV 2023] With a Little Help from your own Past: Prototypical Memory Networks for Image Captioning.
☆19Jun 7, 2024Updated 2 years ago
drogozhang / Criminal-Intelligence-QA-System
View on GitHub
Demo for advanced Java final project in 18-19 1 of Canghong Jin
☆25Nov 18, 2018Updated 7 years ago
LinWeizheDragon / FLMR
View on GitHub
The huggingface implementation of Fine-grained Late-interaction Multi-modal Retriever.
☆108May 30, 2025Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
yasumasaonoe / Box4Types
View on GitHub
☆25Aug 31, 2023Updated 2 years ago
google-research / pactran_metrics
View on GitHub
☆14Mar 24, 2023Updated 3 years ago
baopj / Vid-Morp
View on GitHub
☆12Dec 6, 2024Updated last year
nguyenthanhhy0108 / Web-Personal-Project-BE
View on GitHub
Software Engineering Back End Microservices Project
☆15Nov 20, 2024Updated last year
baopj / E3M
View on GitHub
[ECCV 2024] The first zero-shot setting for spatio-temporal video grounding.
☆11Jul 16, 2024Updated 2 years ago
Go2Heart / EchoSight
View on GitHub
[EMNLP 2024 Findings] The official PyTorch implementation of EchoSight: Advancing Visual-Language Models with Wiki Knowledge.
☆90Jan 19, 2026Updated 6 months ago
aws / aws-refcocog-adv
View on GitHub
☆22Jan 14, 2026Updated 6 months ago
ljang0 / videowebarena
View on GitHub
☆14Dec 25, 2024Updated last year
ethanm88 / llm-access-control
View on GitHub
Official Repository for Can Language Models be Instructed to Protect Personal Information?
☆14Oct 8, 2023Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
baaaad / ECE
View on GitHub
[ECCV'22 Poster] Explicit Image Caption Editing
☆22Nov 30, 2022Updated 3 years ago
google-deepmind / magiclens
View on GitHub
[ICML'24 Oral] "MagicLens: Self-Supervised Image Retrieval with Open-Ended Instructions"
☆211Oct 28, 2024Updated last year
junyangwang0410 / HaELM
View on GitHub
An automatic MLLM hallucination detection framework
☆19Sep 26, 2023Updated 2 years ago
Nithin-Holla / MetaWSD
View on GitHub
Repository containing code for the paper "Learning to Learn to Disambiguate: Meta-Learning for Few-Shot Word Sense Disambiguation", publi…
☆12Nov 12, 2020Updated 5 years ago
phddamuge / UniRPG
View on GitHub
This is code for the EMNLP 2022 Paper "UniRPG: Unified Discrete Reasoning over Table and Text as Program Generation".
☆10Apr 30, 2023Updated 3 years ago
MMBrowseComp / MM-BrowseComp
View on GitHub
☆70Jan 4, 2026Updated 6 months ago
uwnlp / taggerflow
View on GitHub
Training code for LSTM CCG Parsing
☆25Dec 2, 2016Updated 9 years ago