naver-ai / tablevqabenchView external linksLinks
☆45May 21, 2024Updated last year
Alternatives and similar repositories for tablevqabench
Users that are interested in tablevqabench are comparing it to the libraries listed below
Sorting:
- Official Implementation of Web-based Visual Corpus Builder (Webvicob), ICDAR 2023☆109Oct 24, 2023Updated 2 years ago
- Official Implementation of SCOB [ICCV 2023]☆23Nov 16, 2023Updated 2 years ago
- Evaluating Multimodal Generative AI with Korean Educational Standards, NAACL 2025.☆24May 15, 2025Updated 9 months ago
- [ECCV2024] Official implementation of paper, "DenseNets Reloaded: Paradigm Shift Beyond ResNets and ViTs".☆156Aug 8, 2024Updated last year
- Official PyTorch implementation of Extract Free Dense Misalignment from CLIP (AAAI'25)☆25Apr 20, 2025Updated 9 months ago
- Visually-Situated Natural Language Understanding with Contrastive Reading Model and Frozen Large Language Models, EMNLP 2023☆46Jun 11, 2024Updated last year
- Parse LaTeX math expressions☆30Oct 22, 2024Updated last year
- Official Repo for the paper: VCR: Visual Caption Restoration. Check arxiv.org/pdf/2406.06462 for details.☆32Feb 26, 2025Updated 11 months ago
- Official PyTorch Implementation of "Rethinking HTG Evaluation: Bridging Generation and Recognition" (Oral) - 1st Workshop on Critical Eva…☆17Sep 23, 2024Updated last year
- Repository for Multilingual-VQA task created during HuggingFace JAX/Flax community week.☆34Jul 27, 2021Updated 4 years ago
- [NeurIPS 2024] CharXiv: Charting Gaps in Realistic Chart Understanding in Multimodal LLMs☆140Apr 22, 2025Updated 9 months ago
- On Efficient Language and Vision Assistants for Visually-Situated Natural Language Understanding: What Matters in Reading and Reasoning, …☆19Dec 16, 2024Updated last year
- [EMNLP’24 Main] Encoding and Controlling Global Semantics for Long-form Video Question Answering☆18Oct 9, 2024Updated last year
- ☆124May 28, 2024Updated last year
- ☆78Aug 7, 2023Updated 2 years ago
- Render documents on a virtual paper with folds and other types of damage using blender geometry nodes.☆26Aug 14, 2023Updated 2 years ago
- The official implementation of the paper "MMFuser: Multimodal Multi-Layer Feature Fuser for Fine-Grained Vision-Language Understanding". …☆62Nov 5, 2024Updated last year
- EMNLP 2024 Findings "Schema-Driven Information Extraction from Heterogeneous Tables"☆26Dec 5, 2024Updated last year
- This repo contains evaluation code for the paper "AV-Odyssey: Can Your Multimodal LLMs Really Understand Audio-Visual Information?"☆31Dec 23, 2024Updated last year
- [CVPR'24] HallusionBench: You See What You Think? Or You Think What You See? An Image-Context Reasoning Benchmark Challenging for GPT-4V(…☆325Oct 14, 2025Updated 4 months ago
- ☆27Dec 29, 2023Updated 2 years ago
- ☆161Dec 27, 2022Updated 3 years ago
- VisualMRC: Machine Reading Comprehension on Document Images (AAAI2021)☆57Mar 31, 2025Updated 10 months ago
- ☆69Jan 9, 2024Updated 2 years ago
- This repo contains code and data for ICLR 2025 paper MIA-Bench: Towards Better Instruction Following Evaluation of Multimodal LLMs☆36Mar 9, 2025Updated 11 months ago
- Training code for CLIP-FlanT5☆30Jul 29, 2024Updated last year
- ☆29Apr 30, 2024Updated last year
- M-HalDetect Dataset Release☆27Nov 4, 2023Updated 2 years ago
- This is the official implementation of the paper "MM-SHAP: A Performance-agnostic Metric for Measuring Multimodal Contributions in Vision…☆32Mar 12, 2024Updated last year
- Codebase for fine-tuning Llama2 70B to generate math test questions and answers.☆11Aug 30, 2024Updated last year
- [ACL 2024 Findings & ICLR 2024 WS] An Evaluator VLM that is open-source, offers reproducible evaluation, and inexpensive to use. Specific…☆80Sep 13, 2024Updated last year
- ☆11May 25, 2023Updated 2 years ago
- Repo for paper "CODIS: Benchmarking Context-Dependent Visual Comprehension for Multimodal Large Language Models".☆12Oct 14, 2024Updated last year
- ☆11Dec 23, 2024Updated last year
- A large scale camera-taken table detection and recognition dataset.☆149Jul 21, 2025Updated 6 months ago
- Concurrency library☆16Oct 13, 2024Updated last year
- Pre-trained model weights of MAE-Face.☆39Jan 30, 2024Updated 2 years ago
- Logical inference system based on event semantics and degree semantics in formal semantics☆11Jan 22, 2023Updated 3 years ago
- Library implementation of "No Train, all Gain: Self-Supervised Gradients Improve Deep Frozen Representations"☆40Oct 31, 2024Updated last year