A dataset of region-annotated scientific articles.
☆21Jan 24, 2020Updated 6 years ago
Alternatives and similar repositories for article-regions
Users that are interested in article-regions are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆25Apr 18, 2020Updated 6 years ago
- TensorFlow implementation of a segmentation system for document images.☆35Sep 9, 2018Updated 7 years ago
- Library with user interface elements and client-server communication classes based on Google Web Toolkit (GWT) that can be used for crowd…☆14Oct 3, 2017Updated 8 years ago
- TDF-ICDAR 2019 Dataset for Typeset Math Formula Detection☆69Feb 9, 2020Updated 6 years ago
- convert PubLayNet data into METS/PAGE-XML☆10Mar 17, 2020Updated 6 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- DocBank: A Benchmark Dataset for Document Layout Analysis☆646Aug 12, 2024Updated last year
- Pytorch implementation for "Implicit Feature Alignment: Learn to Convert Text Recognizer to Text Spotter".☆67Jun 15, 2021Updated 5 years ago
- GTDB dataset for training & evaluation for mathematical OCR systems☆29Apr 9, 2021Updated 5 years ago
- an unofficial code for augment-XY-CUT in XYLayoutLM☆30Jul 12, 2022Updated 3 years ago
- Robust End-to-End Offline Chinese Handwriting Text Page Spotter with Text Kernel☆37Jul 30, 2021Updated 4 years ago
- Used in M4C feature extraction script: https://github.com/facebookresearch/mmf/blob/project/m4c/projects/M4C/scripts/extract_ocr_frcn_fea…☆13Jan 30, 2020Updated 6 years ago
- ☆1,047Jul 9, 2025Updated 11 months ago
- Document Layout Analysis resources repos for development with PdfPig.☆635Oct 1, 2023Updated 2 years ago
- JSON Schema format for storing datasets details, documents processed contents, and documents annotations in the document understanding do…☆14Nov 5, 2024Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆10Oct 1, 2020Updated 5 years ago
- Multi-span Style Extraction for Generative Reading Comprehension☆10Apr 2, 2021Updated 5 years ago
- A network for irregular text recognition.☆26Dec 11, 2020Updated 5 years ago
- ICDAR 2019: MaskRCNN on PubLayNet datasets. Paragraph detection, table detection, figure detection,...☆183May 11, 2021Updated 5 years ago
- Official repository of the paper MPMQA: Multimodal Question Answering on Product Manuals (AAAI 2023)☆21Nov 28, 2022Updated 3 years ago
- AI_DocumentLayoutAnalysis☆39Nov 25, 2020Updated 5 years ago
- Code and Data for ManyModalQA: Modality Disambiguation and QA over Diverse Inputs☆18Mar 2, 2020Updated 6 years ago
- ☆17May 24, 2023Updated 3 years ago
- ☆15May 26, 2021Updated 5 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- The official implement of CTRNet++.☆15Dec 30, 2024Updated last year
- This repository is for the "LLM-Aligned Geographic Item Tokenization for Local-Life Recommendation".☆18Nov 18, 2025Updated 6 months ago
- Reproduced the DFT method without using Verl. https://arxiv.org/abs/2508.05629☆23Oct 14, 2025Updated 8 months ago
- https://www.nlp.ecei.tohoku.ac.jp/projects/aio/☆16Aug 4, 2022Updated 3 years ago
- Page to PAGE Layout Analysis Tool☆192Jan 17, 2022Updated 4 years ago
- Keras library for building (Universal) Transformers, facilitating BERT and GPT models☆11Jan 29, 2019Updated 7 years ago
- Swagger (OpenAPI) helper and code generator for Julia☆38Dec 9, 2025Updated 6 months ago
- A large-scale infographics dataset from Visual.ly with metadata and additional crowdsourced annotations☆16Oct 8, 2018Updated 7 years ago
- this is a high performance cuda porting of cbow model of word2vec☆17Sep 14, 2014Updated 11 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- DatasetImgLabeler is a image annotation tool for researchers to prepare datasets in ICDAR2015 format☆12Dec 7, 2019Updated 6 years ago
- Implementation of the paper: "Turning Tables: Generating Examples from Semi-structured Tables for Endowing Language Models with Reasoning…☆22Nov 2, 2021Updated 4 years ago
- Searching a High Performance Feature Extractor for Text Recognition Network. TPAMI 2022☆13Nov 25, 2022Updated 3 years ago
- Tool to parse wiki tables from the HTML dump of Wikipedia☆11Jun 12, 2022Updated 4 years ago
- Web-based page layout editor created for EMOP (Early Modern OCR Project).☆11May 21, 2021Updated 5 years ago
- Marathon: A Multiple-choice Long Context Evaluation Benchmark for Large Language Models.☆10May 16, 2024Updated 2 years ago
- The code of 《HAM: Hidden Anchor Mechanism for Scene Text Detection》☆11Sep 22, 2020Updated 5 years ago