☆18Jun 12, 2024Updated last year
Alternatives and similar repositories for pdfvqa
Users that are interested in pdfvqa are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [NAACL 2025] Beyond End-to-End VLMs: Leveraging Intermediate Text Representations for Superior Flowchart Understanding☆21Aug 23, 2025Updated 8 months ago
- ☆70Jan 9, 2024Updated 2 years ago
- ☆12Apr 24, 2024Updated 2 years ago
- An official codebase for "NormLens: Reading Books is Great, But Not if You Are Driving! Visually Grounded Reasoning about Defeasible Comm…☆10May 9, 2024Updated last year
- Coq & Haskell code for Calculating Correct Compilers II☆12Feb 22, 2022Updated 4 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- [EMNLP 2025] The official implementation of "Zero-shot Multimodal Document Retrieval via Cross-Modal Question Generation"☆15Aug 26, 2025Updated 8 months ago
- This repository compiles a list of papers/resources related to the graph retrieval-augmented generation! Star⭐ the repo and follow me if …☆10Dec 7, 2024Updated last year
- Corpus to accompany: "Selective Vision is the Challenge for Visual Reasoning: A Benchmark for Visual Argument Understanding"☆11Apr 11, 2025Updated last year
- ☆10Dec 3, 2021Updated 4 years ago
- Incremental View Maintenance support for DuckDB☆16Oct 24, 2023Updated 2 years ago
- Code for the arxiv paper: Complex Claim Verification with Evidence Retrieved in the Wild☆13Nov 27, 2023Updated 2 years ago
- A collection of AWESOME language modeling techniques on tabular data applications.☆33Oct 14, 2024Updated last year
- MTVQA: Benchmarking Multilingual Text-Centric Visual Question Answering. A comprehensive evaluation of multimodal large model multilingua…☆64May 15, 2025Updated 11 months ago
- ☆21Apr 2, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆14May 26, 2023Updated 2 years ago
- DocBench: A Benchmark for Evaluating LLM-based Document Reading Systems☆72Sep 29, 2024Updated last year
- k for BareMetal☆13Dec 10, 2024Updated last year
- ☆13Aug 26, 2024Updated last year
- Visualizing ImageNet Classes Hierarchical Structure.☆15Apr 8, 2018Updated 8 years ago
- This Repository provides a Jupyter Notebook for building a small language model from scratch using 'TinyStories' dataset. Covers data pr…☆38Jun 7, 2025Updated 10 months ago
- Integrating neurosymbolic representations into LLMs for interpretability, steering, and running symbolic algorithms☆14Feb 2, 2026Updated 3 months ago
- Official Repo for the Paper "AI as Humanity's Salieri: Quantifying Linguistic Creativity of Language Models via Systematic Attribution o…☆26Jan 12, 2025Updated last year
- Python验证码生成工具☆11Mar 5, 2022Updated 4 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ALS Prolog Compiler & Development Environment☆18Feb 21, 2026Updated 2 months ago
- Yonsei Natural Language Understanding tool☆12Dec 7, 2022Updated 3 years ago
- Learning to route instances for Human vs AI Feedback (ACL Main '25)☆28Jul 23, 2025Updated 9 months ago
- ☆18Jul 3, 2024Updated last year
- [ECCV 2022] SuperTickets: Drawing Task-Agnostic Lottery Tickets from Supernets via Jointly Architecture Searching and Parameter Pruning☆20Jul 7, 2022Updated 3 years ago
- Optimizing database queries with array programming☆20Sep 21, 2020Updated 5 years ago
- TAT-DQA: Towards Complex Document Understanding By Discrete Reasoning☆23Sep 17, 2024Updated last year
- User-Mode KolibriOS developer tools☆28Jan 22, 2025Updated last year
- Contrast-guided Feature Adjustment Module for Visual Information Extraction☆30May 23, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- DroidAgent: Intent-Driven Mobile GUI Testing with Autonomous LLM Agents☆68Mar 12, 2024Updated 2 years ago
- ☆29Feb 24, 2025Updated last year
- ☆28Dec 9, 2024Updated last year
- [NAACL 2024] MMC: Advancing Multimodal Chart Understanding with LLM Instruction Tuning☆95Jan 7, 2025Updated last year
- 🦦 Source Code for EMNLP-22 findings paper "Mixed-modality Representation Learning and Pre-training for Joint Table-and-Text Retrieval in…☆21May 10, 2023Updated 2 years ago
- Self-Organizing and Incremental Neural Networks☆20Sep 26, 2013Updated 12 years ago
- Benchmarking Physical Risk Awareness of Foundation Model-based Embodied AI Agents☆23Nov 28, 2024Updated last year