multimodal document analysis
☆166May 14, 2026Updated last week
Alternatives and similar repositories for mmda
Users that are interested in mmda are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Incorporating VIsual LAyout Structures for Scientific Text Classification☆180Mar 18, 2023Updated 3 years ago
- SMASHED is a toolkit designed to apply transformations to samples in datasets, such as fields extraction, tokenization, prompting, batchi…☆35May 24, 2024Updated 2 years ago
- S2APLER: S2 Agglomeration of Papers with Low Error Rate (it's for academic paper clustering)☆21May 15, 2026Updated last week
- Implementation of DocFormer: End-to-End Transformer for Document Understanding, a multi-modal transformer based architecture for the task…☆288Feb 13, 2023Updated 3 years ago
- Software that makes labeling PDFs easy.☆430May 13, 2024Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Benchmark dataset for the evaluation of scientific article representations on the task of citation recommendation across various scientif…☆12Oct 21, 2022Updated 3 years ago
- Implementation of the paper: Going Full-TILT Boogie on Document Understanding with Text-Image-Layout Transformer.☆18Apr 23, 2023Updated 3 years ago
- Official PyTorch implementation of LiLT: A Simple yet Effective Language-Independent Layout Transformer for Structured Document Understan…☆365Oct 31, 2022Updated 3 years ago
- ☆34Jan 2, 2024Updated 2 years ago
- code for participation in ICDAR2021 Table Recognition track (Team Name: LTIAYN = Kaen Context)☆22Jun 16, 2021Updated 4 years ago
- Index of URLs to pdf files all over the internet and scripts☆25May 2, 2023Updated 3 years ago
- library supporting NLP and CV research on scientific papers☆795Nov 8, 2024Updated last year
- DocBank: A Benchmark Dataset for Document Layout Analysis☆645Aug 12, 2024Updated last year
- Parsers for scientific papers (PDF2JSON, TEX2JSON, JATS2JSON)☆466Apr 11, 2024Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- A Benchmark of PDF Information Extraction Tools using a Multi-Task and Multi-Domain Evaluation Framework for Academic Documents☆31Dec 8, 2022Updated 3 years ago
- A Unified Toolkit for Deep Learning Based Document Image Analysis☆5,735Aug 15, 2024Updated last year
- ☆18Oct 22, 2022Updated 3 years ago
- Japanese / English Bilingual LLM☆30Dec 23, 2025Updated 5 months ago
- ☆1,046Jul 9, 2025Updated 10 months ago
- Code for the ICDAR2021 paper "Visual FUDGE: Form Understanding via Dynamic Graph Editing"☆33Mar 4, 2022Updated 4 years ago
- ☆61Aug 18, 2021Updated 4 years ago
- ☆482Jul 8, 2025Updated 10 months ago
- Code for Analyzing Redundancy in Pretrained Transformer Models accepted at EMNLP 2020☆14Oct 6, 2020Updated 5 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- pytorch版基于gpt+nezha的中文多轮Cdial☆11Oct 22, 2022Updated 3 years ago
- Tool to parse wiki tables from the HTML dump of Wikipedia☆11Jun 12, 2022Updated 3 years ago
- API client for fetching and comparing passages from legislation☆14Jan 26, 2025Updated last year
- S2ORC: The Semantic Scholar Open Research Corpus: https://www.aclweb.org/anthology/2020.acl-main.447/☆1,057Apr 26, 2024Updated 2 years ago
- The corresponding code for our paper: "Exploring the Challenges of Open Domain Multi-Document Summarization". Do not hesitate to open an …☆33Jun 24, 2023Updated 2 years ago
- A curated list of resources for Document Understanding (DU) topic☆1,514Jun 2, 2023Updated 2 years ago
- Data/Code Repository for https://api.semanticscholar.org/CorpusID:218470122☆140Jul 25, 2024Updated last year
- Ukrainian ELECTRA model☆12Mar 11, 2023Updated 3 years ago
- Algorithms, papers, datasets, performance comparisons for Document AI.☆208Mar 1, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆15Jun 16, 2021Updated 4 years ago
- SPECTER: Document-level Representation Learning using Citation-informed Transformers☆579Jun 12, 2023Updated 2 years ago
- A machine learning tool for fishing entities☆268Feb 27, 2026Updated 2 months ago
- ☆14Aug 3, 2022Updated 3 years ago
- Unifew: Unified Fewshot Learning Model☆18Sep 10, 2021Updated 4 years ago
- ↔️ Utilizing RBERT model structure for KLUE Relation Extraction task☆15Nov 15, 2022Updated 3 years ago
- A Python library aimed at dissecting and augmenting NER training data.☆60May 11, 2023Updated 3 years ago