multimodal document analysis
☆165Feb 28, 2026Updated 3 weeks ago
Alternatives and similar repositories for mmda
Users that are interested in mmda are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Incorporating VIsual LAyout Structures for Scientific Text Classification☆180Mar 18, 2023Updated 3 years ago
- SMASHED is a toolkit designed to apply transformations to samples in datasets, such as fields extraction, tokenization, prompting, batchi…☆35May 24, 2024Updated last year
- S2APLER: S2 Agglomeration of Papers with Low Error Rate (it's for academic paper clustering)☆21Nov 4, 2025Updated 4 months ago
- Implementation of DocFormer: End-to-End Transformer for Document Understanding, a multi-modal transformer based architecture for the task…☆288Feb 13, 2023Updated 3 years ago
- Software that makes labeling PDFs easy.☆427May 13, 2024Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Benchmark dataset for the evaluation of scientific article representations on the task of citation recommendation across various scientif…☆12Oct 21, 2022Updated 3 years ago
- Implementation of the paper: Going Full-TILT Boogie on Document Understanding with Text-Image-Layout Transformer.☆18Apr 23, 2023Updated 2 years ago
- Official PyTorch implementation of LiLT: A Simple yet Effective Language-Independent Layout Transformer for Structured Document Understan…☆362Oct 31, 2022Updated 3 years ago
- ☆34Jan 2, 2024Updated 2 years ago
- code for participation in ICDAR2021 Table Recognition track (Team Name: LTIAYN = Kaen Context)☆22Jun 16, 2021Updated 4 years ago
- Index of URLs to pdf files all over the internet and scripts☆25May 2, 2023Updated 2 years ago
- library supporting NLP and CV research on scientific papers☆791Nov 8, 2024Updated last year
- DocBank: A Benchmark Dataset for Document Layout Analysis☆642Aug 12, 2024Updated last year
- Parsers for scientific papers (PDF2JSON, TEX2JSON, JATS2JSON)☆461Apr 11, 2024Updated last year
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- A Benchmark of PDF Information Extraction Tools using a Multi-Task and Multi-Domain Evaluation Framework for Academic Documents