allenai / deepfigures-open
Companion code to the paper "Extracting Scientific Figures with Distantly Supervised Neural Networks" 🤖
☆138Updated 2 years ago
Alternatives and similar repositories for deepfigures-open:
Users that are interested in deepfigures-open are comparing it to the libraries listed below
- Command line tool to extract figures, tables, and captions from scholarly documents in PDF form.☆130Updated 6 years ago
- ☆40Updated 4 years ago
- A project about benchmarking and evaluating existing PDF extraction tools on their semantic abilities to extract the body texts from PDF …☆66Updated 4 years ago
- Science-parse version 2☆235Updated 5 years ago
- Incorporating VIsual LAyout Structures for Scientific Text Classification☆175Updated last year
- Neuralized version of the Reference String Parser component of the ParsCit package.☆80Updated 2 years ago
- Science Parse parses scientific papers (in PDF form) and returns them in structured form.☆644Updated 8 months ago
- Python client for GROBID Web services☆308Updated 3 weeks ago
- Java command-line tools for comparing results to ground truth for table location and structure detection as used in the ICDAR 2013 Table …☆33Updated 4 years ago
- ICDAR 2021 Competition on Scientific Literature Parsing☆34Updated 4 years ago
- Framework for information extraction from tables☆41Updated 5 years ago
- Detectron2 for Document Layout Analysis☆185Updated 6 months ago
- Table structure recognition dataset of the paper: Complicated Table Structure Recognition☆357Updated 4 years ago
- ☆40Updated 6 years ago
- Service for converting and enhancing heterogeneous publisher XML formats into TEI☆50Updated 5 months ago
- ☆77Updated 2 years ago
- Repository for NAACL 2019 paper on Citation Intent prediction☆116Updated 5 years ago
- Toolbox for OCR post-correction☆122Updated 5 years ago
- ☆87Updated 5 years ago
- A tool for extracting arbitrary tables from untagged PDF documents☆38Updated 4 years ago
- The ICDAR 2019 cTDaR is to evaluate the performance of methods for table detection (TRACK A) and table recognition (TRACK B). For the fir…☆173Updated 2 years ago
- Apache PDFBox extension for precisely extracting character/symbol locations and identities from born-digital PDF files.☆19Updated 3 years ago
- multimodal document analysis☆162Updated 8 months ago
- Page to PAGE Layout Analysis Tool☆191Updated 3 years ago
- ☆92Updated 2 years ago
- Table Extraction Tool☆90Updated 6 years ago
- Extracting scientific claims from biomedical abstracts (powered by AllenNLP)☆141Updated 3 years ago
- Dataset accompanying the SPECTER model☆130Updated 2 years ago
- Publicly released code for the LAMBERT model☆101Updated 3 years ago
- ☆129Updated last year