Companion code to the paper "Extracting Scientific Figures with Distantly Supervised Neural Networks" 🤖
☆147Jun 14, 2022Updated 3 years ago
Alternatives and similar repositories for deepfigures-open
Users that are interested in deepfigures-open are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Given a scholarly PDF, extract figures, tables, captions, and section titles.☆734Mar 10, 2024Updated 2 years ago
- Multi-Entity Extraction Framework for Academic Documents (with default extraction tools)☆31Oct 3, 2023Updated 2 years ago
- Data and code for the paper "CiteWorth: Cite-Worthiness Detection for Improved Scientific Document Understanding"☆14Sep 8, 2022Updated 3 years ago
- Science-parse version 2☆255Nov 20, 2019Updated 6 years ago
- Science Parse parses scientific papers (in PDF form) and returns them in structured form.☆697May 26, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- DocBankLoader is a dataset loader for DocBank, and can convert DocBank to the Object Detection models' format.☆25Mar 17, 2021Updated 5 years ago
- Line Chart Data Extraction: Official code for LineFormer - ICDAR23 Paper☆58Nov 26, 2025Updated 4 months ago
- Companion Website for the research project on Explorable Multiverse Analysis Reports☆13Jun 20, 2023Updated 2 years ago
- Repository for NAACL 2019 paper on Citation Intent prediction☆129Dec 1, 2019Updated 6 years ago
- [VL/HCC 2017] TraceDiff: Debugging Unexpected Code Behavior Using Trace Divergences☆12Sep 2, 2017Updated 8 years ago
- ☆98May 20, 2022Updated 3 years ago
- Experimental Git Mirror of "https://sourceforge.net/p/lemur/galago" using "https://github.com/felipec/git-remote-hg"☆13Dec 17, 2020Updated 5 years ago
- ☆13Jan 14, 2022Updated 4 years ago
- PDF to XML ALTO file converter☆268Feb 11, 2026Updated last month
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Softcite software mention recognizer, finding mentions and citations to software from within the academic literature☆82Sep 30, 2025Updated 5 months ago
- This repository contains the dataset and code for our ACL'23 publication: "MatSci-NLP: Evaluating Scientific Language Models on Materials…☆16Nov 21, 2023Updated 2 years ago
- An open-source CRF Reference String Parsing Package☆161May 6, 2020Updated 5 years ago
- DocBank: A Benchmark Dataset for Document Layout Analysis☆642Aug 12, 2024Updated last year
- ☆14Jan 26, 2021Updated 5 years ago
- Utility to compile string of chemical terms into data structure with chemical formula and composition☆13Sep 17, 2021Updated 4 years ago
- A project about benchmarking and evaluating existing PDF extraction tools on their semantic abilities to extract the body texts from PDF …☆69Nov 7, 2020Updated 5 years ago
- A machine learning software for extracting information from scholarly documents☆4,727Updated this week
- Easy trees in LaTeX and TikZ☆14Dec 16, 2022Updated 3 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Parsers for scientific papers (PDF2JSON, TEX2JSON, JATS2JSON)☆461Apr 11, 2024Updated last year
- 使用SURF+Kmeans建立的图像检索系统(CBIR)☆13May 8, 2020Updated 5 years ago
- ProPara (Process Paragraph Comprehension) dataset and models☆81Aug 30, 2019Updated 6 years ago
- Create TensorRT-runtime for vietocr☆12Jun 8, 2021Updated 4 years ago
- Open Access PDF harvester, metadata aggregator and full-text ingester☆62May 3, 2024Updated last year
- How do we process data in different formats like docx, pdf etc and generate insights to be linked with structured data in database?This p…☆14May 27, 2020Updated 5 years ago
- Grobid module for superconductor material and properties extraction☆22May 17, 2025Updated 10 months ago
- A BERT model for scientific text.☆1,677Feb 22, 2022Updated 4 years ago
- Python client for GROBID Web services☆394Mar 5, 2026Updated 3 weeks ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- code for participation in ICDAR2021 Table Recognition track (Team Name: LTIAYN = Kaen Context)☆22Jun 16, 2021Updated 4 years ago
- Project page for "Cross-Domain Document Object Detection: Benchmark Suite and Method, CVPR 2020"☆46Oct 15, 2020Updated 5 years ago
- Code for the ICDAR2021 paper "Visual FUDGE: Form Understanding via Dynamic Graph Editing"☆33Mar 4, 2022Updated 4 years ago
- ChemDataExtractor toolkit updated to include semi-supervised quaternary relationship extraction☆13Feb 8, 2021Updated 5 years ago
- ☆13Jul 24, 2024Updated last year
- ☆1,043Jul 9, 2025Updated 8 months ago
- Geometry Normalization Networks for Accurate Scene Text Detection (iccv 2019)☆21Apr 3, 2020Updated 5 years ago