Command line tool to extract figures, tables, and captions from scholarly documents in PDF form.
☆130Apr 9, 2018Updated 8 years ago
Alternatives and similar repositories for pdffigures
Users that are interested in pdffigures are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Given a scholarly PDF, extract figures, tables, captions, and section titles.☆740Mar 10, 2024Updated 2 years ago
- Multi-Entity Extraction Framework for Academic Documents (with default extraction tools)☆31Oct 3, 2023Updated 2 years ago
- Compile optimized Vega and Vega-Lite bundles.☆18Apr 13, 2021Updated 5 years ago
- Edit and explore Mosaic visualizations (real-time interaction for big datasets) in your browser☆22Oct 31, 2025Updated 5 months ago
- Context-Aware, Recommender-Powered Visualization Authoring☆22Jul 22, 2020Updated 5 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Automatically interact with SVG charts.☆19Sep 23, 2025Updated 6 months ago
- REV: Reverse-Engineering Visualizations☆61Jun 3, 2019Updated 6 years ago
- Probabilistic data structures for large or streaming data sets.☆20May 13, 2017Updated 8 years ago
- Training data generator for text detection☆38Jul 16, 2020Updated 5 years ago
- Examples or utilizing Microsoft Academic for conducting covid-19 research☆23Dec 26, 2022Updated 3 years ago
- A slim, non-SWIG Python adapter to CTesseract (Tesseract OCR for C).☆24Apr 25, 2014Updated 11 years ago
- Science Parse parses scientific papers (in PDF form) and returns them in structured form.☆698May 26, 2024Updated last year
- Content ExtRactor and MINEr☆511Jun 30, 2022Updated 3 years ago
- Tools for Natural Language Text aware PDF structure analysis☆15Mar 11, 2022Updated 4 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- A Living Papers article starter template.☆25Nov 3, 2023Updated 2 years ago
- An open-source CRF Reference String Parsing Package☆161May 6, 2020Updated 5 years ago
- Various attempts at scanning aerial imagery to detect baseball diamonds.☆17Jul 13, 2014Updated 11 years ago
- ☆22Jun 2, 2017Updated 8 years ago
- A set of tools for extracting tables from PDF files helping to do data mining on (OCR-processed) scanned documents.☆2,257Jun 24, 2022Updated 3 years ago
- Some ideas on making Bags into Git repositories☆16Dec 23, 2014Updated 11 years ago
- ☆14Mar 14, 2024Updated 2 years ago
- ☆19Dec 19, 2018Updated 7 years ago
- DFKI Layout Detection for OCR-D☆47May 1, 2025Updated 11 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Common UI Library that powers Polestar and Voyager☆13Jan 15, 2017Updated 9 years ago
- Data loader for the Apache Arrow format.☆63Apr 2, 2026Updated 2 weeks ago
- ☆12Apr 24, 2017Updated 8 years ago
- AASC: ACL Anthology Sentence Corpus☆20Oct 28, 2020Updated 5 years ago
- Agglomerative hierarchical clustering in JavaScript☆19Dec 17, 2024Updated last year
- Quick start for MicroFlo on Arduino - clone and go!☆15Dec 31, 2017Updated 8 years ago
- This sample .Net application shows you how to use the .Net SDK to read and write files to Azure Data Lake Store, and do other filesystem …☆10Oct 18, 2023Updated 2 years ago
- Custom HTML elements to reuse Wikidata☆14Jan 6, 2023Updated 3 years ago
- Deep learning based page layout analysis☆197Apr 24, 2019Updated 6 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Code for "iSeqL: Interactive Sequence Learning" to be presented at ACM IUI'20☆11Jan 4, 2023Updated 3 years ago
- NLP for evidence-based medicine. https://www.robotreviewer.net/.☆18Jan 16, 2023Updated 3 years ago
- An Apache Arrow-backed file format for pre-projected, pre-triangulated maps, including dot density algorithms and regl visualization.☆18Feb 10, 2023Updated 3 years ago
- An implementation of DTW for spoken term detection. Including non-constrained, segmental DTW, slope-constrained versions. For more detail…☆15Jun 2, 2019Updated 6 years ago
- this repository contains the dataset and the source code for the EMNLP 2019 paper "A Neural Citation Count Prediction Model based on Peer…☆10Oct 8, 2021Updated 4 years ago
- Pluggable DSL that uses pipes to perform a series of linear transformations to extract data☆16Jul 9, 2024Updated last year
- A collection of Jupyter notebooks.☆11Oct 31, 2017Updated 8 years ago