☆46May 21, 2024Updated 2 years ago
Alternatives and similar repositories for tablevqabench
Users that are interested in tablevqabench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official Implementation of Web-based Visual Corpus Builder (Webvicob), ICDAR 2023☆110Oct 24, 2023Updated 2 years ago
- Evaluating Multimodal Generative AI with Korean Educational Standards, NAACL 2025.☆26May 15, 2025Updated last year
- [ECCV2024] Official implementation of paper, "DenseNets Reloaded: Paradigm Shift Beyond ResNets and ViTs".☆155Aug 8, 2024Updated last year
- Official Implementation of SCOB [ICCV 2023]☆23Nov 16, 2023Updated 2 years ago
- Official PyTorch implementation of Extract Free Dense Misalignment from CLIP (AAAI'25)☆25Apr 20, 2025Updated last year
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Visually-Situated Natural Language Understanding with Contrastive Reading Model and Frozen Large Language Models, EMNLP 2023☆46Jun 11, 2024Updated 2 years ago
- Weakly opinionated library for implementing ML models. Less boilerplate, More rigor☆21Jul 1, 2022Updated 3 years ago
- [ECCV2022] DProST: Dynamic Projective Spatial Transformer Network for 6D Pose Estimation☆24Jul 19, 2024Updated last year
- Reduction of Video Compression Artifacts Based on Deep Temporal Networks (IEEE Access, 2018)☆56Apr 14, 2023Updated 3 years ago
- read 1 paper everyday (only weekday)☆56Sep 30, 2021Updated 4 years ago
- Repository for Multilingual-VQA task created during HuggingFace JAX/Flax community week.☆34Jul 27, 2021Updated 4 years ago
- [NeurIPS 2024] CharXiv: Charting Gaps in Realistic Chart Understanding in Multimodal LLMs☆153Apr 22, 2025Updated last year
- Official Implementation of SynthTIGER (Synthetic Text Image Generator), ICDAR 2021☆577Jun 14, 2024Updated 2 years ago
- On Efficient Language and Vision Assistants for Visually-Situated Natural Language Understanding: What Matters in Reading and Reasoning, …☆20Mar 13, 2026Updated 3 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Official repository for 'Risk of Bias in Chest Radiography Deep Learning Foundation Models'☆12Sep 27, 2023Updated 2 years ago
- This is the code repo for our paper "Say More with Less: Understanding Prompt Learning Behaviors through Gist Compression".☆13Feb 27, 2024Updated 2 years ago
- [CVPR 2024 Best paper award candidate] EGTR: Extracting Graph from Transformer for Scene Graph Generation☆148Jun 25, 2024Updated last year
- VisualMRC: Machine Reading Comprehension on Document Images (AAAI2021)☆57Mar 31, 2025Updated last year
- Dataset and Code for our ACL 2024 paper: "Multimodal Table Understanding". We propose the first large-scale Multimodal IFT and Pre-Train …☆227Jun 12, 2025Updated last year
- My collection of machine learning papers☆299Aug 10, 2023Updated 2 years ago
- [ACL 2024 Findings & ICLR 2024 WS] An Evaluator VLM that is open-source, offers reproducible evaluation, and inexpensive to use. Specific…☆86Sep 13, 2024Updated last year
- Rider Reinforcement Learning Environment with Proximal Policy Optimization☆14Sep 5, 2019Updated 6 years ago
- ☆26Dec 29, 2023Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Parse LaTeX math expressions☆31Oct 22, 2024Updated last year
- ☆14May 23, 2022Updated 4 years ago
- Code and data for the paper: DTSM: Toward Dense Table Structure Recognition with Text Query Encoder and Adjacent Feature Aggregator☆13Apr 28, 2024Updated 2 years ago
- HIPPO: Enhancing the Table Understanding Capability of Large Language Models through Hybrid-Modal Preference Optimization☆18May 29, 2025Updated last year
- This is a detailed code demo on how to conduct Full-Param Supervised Fine-tuning (SFT) and DPO (Direct Preference Optimization)☆19Jan 9, 2025Updated last year
- Data and code for ACL 2022 paper "MultiHiertt: Numerical Reasoning over Multi Hierarchical Tabular and Textual Data"☆54Oct 22, 2024Updated last year
- Study for Instant neural graphics primitives (Unofficial)☆11Jan 18, 2022Updated 4 years ago
- Official PyTorch Implementation of "Rethinking HTG Evaluation: Bridging Generation and Recognition" (Oral) - 1st Workshop on Critical Eva…☆17Sep 23, 2024Updated last year
- This is the paddle code for SeBoW(Self-Born wiring for neural trees), a kind of neural tree born form a large search space☆11Dec 10, 2021Updated 4 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Official Repo for the paper: VCR: Visual Caption Restoration. Check arxiv.org/pdf/2406.06462 for details.☆32Feb 26, 2025Updated last year
- Augmentation pipeline for rendering synthetic paper printing, faxing, scanning and copy machine processes☆547Jul 20, 2025Updated 10 months ago
- The project page for "SCITAB: A Challenging Benchmark for Compositional Reasoning and Claim Verification on Scientific Tables"☆23Dec 21, 2023Updated 2 years ago
- [NAACL 2024] MMC: Advancing Multimodal Chart Understanding with LLM Instruction Tuning☆95Jan 7, 2025Updated last year
- ☆19Jun 11, 2024Updated 2 years ago
- A collection of particularly difficult test scenarios for evaluating browser-use.☆27May 15, 2026Updated last month
- ☆19Mar 28, 2022Updated 4 years ago