☆17Jun 12, 2024Updated last year
Alternatives and similar repositories for pdfvqa
Users that are interested in pdfvqa are comparing it to the libraries listed below
Sorting:
- [CVPR 2025] GO-N3RDet: Geometry Optimized NeRF-enhanced 3D Object Detector☆16Mar 19, 2025Updated last year
- ☆16Nov 1, 2024Updated last year
- ☆18May 30, 2023Updated 2 years ago
- ☆15Sep 7, 2022Updated 3 years ago
- [NAACL 2025] Beyond End-to-End VLMs: Leveraging Intermediate Text Representations for Superior Flowchart Understanding☆20Aug 23, 2025Updated 7 months ago
- ☆14Jul 29, 2024Updated last year
- ☆69Jan 9, 2024Updated 2 years ago
- ☆12Apr 24, 2024Updated last year
- An official codebase for "NormLens: Reading Books is Great, But Not if You Are Driving! Visually Grounded Reasoning about Defeasible Comm…☆10May 9, 2024Updated last year
- Code and datasets for "Text encoders are performance bottlenecks in contrastive vision-language models". Coming soon!☆11May 24, 2023Updated 2 years ago
- Coq & Haskell code for Calculating Correct Compilers II☆12Feb 22, 2022Updated 4 years ago
- Official implementation of OpenTab (ICLR2024)☆13Mar 27, 2024Updated last year
- Zero Memory Widget☆10Dec 30, 2020Updated 5 years ago
- [EMNLP 2025] The official implementation of "Zero-shot Multimodal Document Retrieval via Cross-Modal Question Generation"☆15Aug 26, 2025Updated 6 months ago
- This repository compiles a list of papers/resources related to the graph retrieval-augmented generation! Star⭐ the repo and follow me if …☆10Dec 7, 2024Updated last year
- ☆30Jan 24, 2025Updated last year
- Corpus to accompany: "Selective Vision is the Challenge for Visual Reasoning: A Benchmark for Visual Argument Understanding"☆11Apr 11, 2025Updated 11 months ago
- Incremental View Maintenance support for DuckDB☆16Oct 24, 2023Updated 2 years ago
- ☆10Dec 3, 2021Updated 4 years ago
- CRNN_CTC_PyTorch☆10Oct 17, 2019Updated 6 years ago
- LLM inference in C/C++☆26Updated this week
- Code for the arxiv paper: Complex Claim Verification with Evidence Retrieved in the Wild☆13Nov 27, 2023Updated 2 years ago
- This repository stores the proposals submitted to the NumFOCUS Small Development Grants (SDG) program.☆19Nov 18, 2025Updated 4 months ago
- A collection of AWESOME language modeling techniques on tabular data applications.☆32Oct 14, 2024Updated last year
- DocBench: A Benchmark for Evaluating LLM-based Document Reading Systems☆69Sep 29, 2024Updated last year
- MTVQA: Benchmarking Multilingual Text-Centric Visual Question Answering. A comprehensive evaluation of multimodal large model multilingua…☆64May 15, 2025Updated 10 months ago
- The example of correspondence between fine classes and superclasses (coarse classes) in ImageNet.☆13Dec 4, 2024Updated last year
- ☆21Apr 2, 2025Updated 11 months ago
- ☆14May 26, 2023Updated 2 years ago
- A library for training crosscoders☆16May 28, 2025Updated 9 months ago
- resnet_cifar10_cifar100_imagenet☆14Oct 30, 2018Updated 7 years ago
- k for BareMetal☆12Dec 10, 2024Updated last year
- ☆13Aug 26, 2024Updated last year
- ☆25Mar 6, 2026Updated 2 weeks ago
- Sparse Fourier Backpropagation in Cryo-EM Reconstruction☆12Dec 3, 2023Updated 2 years ago
- Visualizing ImageNet Classes Hierarchical Structure.☆15Apr 8, 2018Updated 7 years ago
- A single-line modification to any (dualizer-based) optimizer that allows the optimizer to adapt to the scale of the gradients as they cha…☆19Jan 11, 2025Updated last year
- ALTo: Adaptive-Length Tokenizer for Autoregressive Mask Generation☆27May 27, 2025Updated 9 months ago
- Create cohorts from databases utilizing the OMOP CDM☆10May 19, 2025Updated 10 months ago