allenai / deepfigures-openLinks
Companion code to the paper "Extracting Scientific Figures with Distantly Supervised Neural Networks" 🤖
☆142Updated 3 years ago
Alternatives and similar repositories for deepfigures-open
Users that are interested in deepfigures-open are comparing it to the libraries listed below
Sorting:
- Command line tool to extract figures, tables, and captions from scholarly documents in PDF form.☆130Updated 7 years ago
- Given a scholarly PDF, extract figures, tables, captions, and section titles.☆682Updated last year
- Science-parse version 2☆248Updated 5 years ago
- A project about benchmarking and evaluating existing PDF extraction tools on their semantic abilities to extract the body texts from PDF …☆69Updated 4 years ago
- ☆41Updated 5 years ago
- Science Parse parses scientific papers (in PDF form) and returns them in structured form.☆675Updated last year
- Incorporating VIsual LAyout Structures for Scientific Text Classification☆180Updated 2 years ago
- Neuralized version of the Reference String Parser component of the ParsCit package.☆81Updated 3 years ago
- Repository for NAACL 2019 paper on Citation Intent prediction☆126Updated 5 years ago
- A tool for extracting arbitrary tables from untagged PDF documents☆40Updated 4 years ago
- ☆94Updated 3 years ago
- Python client for GROBID Web services☆364Updated last week
- ☆81Updated 3 years ago
- PDF to XML ALTO file converter☆253Updated 3 weeks ago
- Extracting scientific claims from biomedical abstracts (powered by AllenNLP)☆144Updated 4 years ago
- Framework for information extraction from tables☆41Updated 6 years ago
- multimodal document analysis☆167Updated last year
- Corpus of Open Access articles from multiple fields in Science, Technology, and Medicine.☆74Updated 8 years ago
- Toolbox for OCR post-correction☆121Updated 6 years ago
- Code for ICPR2022 paper: "Graph Neural Networks and Representation Embedding for table extraction in PDF Documents"☆35Updated 2 years ago
- ICDAR 2021 Competition on Scientific Literature Parsing☆35Updated 5 years ago
- GROBID extension for identifying and normalizing physical quantities.☆82Updated 3 months ago
- Apache PDFBox extension for precisely extracting character/symbol locations and identities from born-digital PDF files.☆19Updated 3 weeks ago
- Parsers for scientific papers (PDF2JSON, TEX2JSON, JATS2JSON)☆439Updated last year
- Service for converting and enhancing heterogeneous publisher XML formats into TEI☆57Updated last year
- Publicly released code for the LAMBERT model☆103Updated 4 years ago
- A Named-Entity Recogniser based on Grobid.☆54Updated 4 months ago
- Table Extraction Tool☆90Updated 7 years ago
- Functional and structural analysis of tables in research papers (Table disentangling)☆20Updated 8 years ago
- Java command-line tools for comparing results to ground truth for table location and structure detection as used in the ICDAR 2013 Table …☆33Updated 5 years ago