ICCV 2023 (Oral) Open-domain Visual Entity Recognition Towards Recognizing Millions of Wikipedia Entities
☆44Jun 7, 2025Updated last year
Alternatives and similar repositories for oven_eval
Users that are interested in oven_eval are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- EMNLP2023 - InfoSeek: A New VQA Benchmark focus on Visual Info-Seeking Questions☆26May 30, 2024Updated 2 years ago
- ☆47Aug 15, 2023Updated 2 years ago
- [ECCV'24] Official Implementation of Autoregressive Visual Entity Recognizer.☆14Mar 2, 2024Updated 2 years ago
- ☆78Oct 27, 2023Updated 2 years ago
- ACL 2023 (Findings) End-to-end Cross-lingual Label Project☆15Nov 24, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Evaluate robustness of adaptation methods on large vision-language models☆19Aug 23, 2023Updated 2 years ago
- Official code for paper "UniIR: Training and Benchmarking Universal Multimodal Information Retrievers" (ECCV 2024)☆182Oct 1, 2024Updated last year
- ☆14Apr 23, 2025Updated last year
- COLA: Evaluate how well your vision-language model can Compose Objects Localized with Attributes!☆25May 14, 2026Updated last month
- Official Pytorch implementation of LinCIR: Language-only Training of Zero-shot Composed Image Retrieval (CVPR 2024)☆148Jan 5, 2026Updated 5 months ago
- EANN(Pytorch)☆10Mar 12, 2022Updated 4 years ago
- ☆12Apr 25, 2024Updated 2 years ago
- QSDK是河北骑士智能科技有限公司基于RT-THREAD RTOS编写的NB-IOT模块驱动代码☆10Nov 12, 2019Updated 6 years ago
- Official Implementation for "MyVLM: Personalizing VLMs for User-Specific Queries" (ECCV 2024)☆188Jul 5, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Official PyTorch Implementation of MLLM Is a Strong Reranker: Advancing Multimodal Retrieval-augmented Generation via Knowledge-enhanced …☆92Nov 15, 2024Updated last year
- The official repository for AAAI 2024 Oral paper "Structured Probabilistic Coding".☆13Sep 7, 2024Updated last year
- [NLPCC'23] ZeroGen: Zero-shot Multimodal Controllable Text Generation with Multiple Oracles PyTorch Implementation☆14Oct 7, 2023Updated 2 years ago
- [ICML'24 Oral] "MagicLens: Self-Supervised Image Retrieval with Open-Ended Instructions"☆209Oct 28, 2024Updated last year
- [ICML 2024] "Visual-Text Cross Alignment: Refining the Similarity Score in Vision-Language Models"☆59Sep 3, 2024Updated last year
- Official repository for the A-OKVQA dataset☆116May 8, 2024Updated 2 years ago
- a multimodal retrieval dataset☆25Jul 8, 2023Updated 2 years ago
- [ICCV 2023] With a Little Help from your own Past: Prototypical Memory Networks for Image Captioning.☆19Jun 7, 2024Updated 2 years ago
- This is an official implementation of GRIT-VLP☆20Aug 8, 2022Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- E5-V: Universal Embeddings with Multimodal Large Language Models☆275Dec 10, 2025Updated 6 months ago
- Dataset and starting code for visual entailment dataset☆123Apr 21, 2022Updated 4 years ago
- RT-Thread&正点原子联合出品潘多拉开发板对接OneNET示例,使用AP6181 WiFi模块、OneNET EDP协议,实现数据上报和命令下发控制☆16Mar 8, 2020Updated 6 years ago
- LLaVA-NeXT-Image-Llama3-Lora, Modified from https://github.com/arielnlee/LLaVA-1.6-ft☆47Jul 17, 2024Updated last year
- Code for our ICLR 2024 paper "PerceptionCLIP: Visual Classification by Inferring and Conditioning on Contexts"☆80May 5, 2024Updated 2 years ago
- Filtering, Distillation, and Hard Negatives for Vision-Language Pre-Training☆141Dec 16, 2025Updated 6 months ago
- [BMVC 2023] Zero-shot Composed Text-Image Retrieval☆55Nov 26, 2024Updated last year
- Some papers about *diverse* image (a few videos) captioning☆25Apr 4, 2023Updated 3 years ago
- 基于RT-Thread的百度语音识别☆14Dec 9, 2019Updated 6 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- [ECCV2024, Oral, Best Paper Finalist] This is the official implementation of the paper "LEGO: Learning EGOcentric Action Frame Generation…☆41Feb 24, 2025Updated last year
- 基于潘多拉board主控(RT-Thread-v3.1.3)、麦克纳姆 全向轮底盘、遥控智能小车☆15Mar 8, 2020Updated 6 years ago
- Official PyTorch implementation of "MM-PoisonRAG: Disrupting Multimodal RAG with Local and Global Poisoning Attacks"☆16Dec 4, 2025Updated 7 months ago
- Visual Delta Generator with Large Multi-modal Model for Semi-supervised Composed Image Retrieval - CVPR2024☆21May 30, 2024Updated 2 years ago
- [ICCV2023] Official code for "VL-PET: Vision-and-Language Parameter-Efficient Tuning via Granularity Control"☆53Sep 21, 2023Updated 2 years ago
- Big-Interleaved-Dataset☆59Jan 21, 2023Updated 3 years ago
- Code and data for "Broaden the Vision: Geo-Diverse Visual Commonsense Reasoning" (EMNLP 2021).☆29Sep 4, 2021Updated 4 years ago