Command line tool to extract figures, tables, and captions from scholarly documents in PDF form.
β130Apr 9, 2018Updated 8 years ago
Alternatives and similar repositories for pdffigures
Users that are interested in pdffigures are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Given a scholarly PDF, extract figures, tables, captions, and section titles.β744Mar 10, 2024Updated 2 years ago
- Companion code to the paper "Extracting Scientific Figures with Distantly Supervised Neural Networks" π€β148Jun 14, 2022Updated 3 years ago
- Multi-Entity Extraction Framework for Academic Documents (with default extraction tools)β31Oct 3, 2023Updated 2 years ago
- Compile optimized Vega and Vega-Lite bundles.β18Apr 13, 2021Updated 5 years ago
- Edit and explore Mosaic visualizations (real-time interaction for big datasets) in your browserβ21Oct 31, 2025Updated 6 months ago
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Context-Aware, Recommender-Powered Visualization Authoringβ22Jul 22, 2020Updated 5 years ago
- A place to collect and share knowledge about liberating data from PDFsβ56Jan 30, 2022Updated 4 years ago
- Automatically interact with SVG charts.β19Sep 23, 2025Updated 7 months ago
- REV: Reverse-Engineering Visualizationsβ61Jun 3, 2019Updated 6 years ago
- Training data generator for text detectionβ38Jul 16, 2020Updated 5 years ago
- table understanding dataset for comparative evaluation of different table understanding algorithmsβ13Jun 15, 2018Updated 7 years ago
- Content ExtRactor and MINErβ512Jun 30, 2022Updated 3 years ago
- An open-source CRF Reference String Parsing Packageβ161May 6, 2020Updated 6 years ago
- Library with user interface elements and client-server communication classes based on Google Web Toolkit (GWT) that can be used for crowdβ¦β14Oct 3, 2017Updated 8 years ago
- Deploy on Railway without the complexity - Free Credits Offer β’ AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Various attempts at scanning aerial imagery to detect baseball diamonds.β17Jul 13, 2014Updated 11 years ago
- Science-parse version 2β257Nov 20, 2019Updated 6 years ago
- A set of tools for extracting tables from PDF files helping to do data mining on (OCR-processed) scanned documents.β2,256Jun 24, 2022Updated 3 years ago
- Some ideas on making Bags into Git repositoriesβ16Dec 23, 2014Updated 11 years ago
- β14Mar 14, 2024Updated 2 years ago
- β19Dec 19, 2018Updated 7 years ago
- Configuration Space Exploration Frameworkβ17Oct 13, 2020Updated 5 years ago
- DFKI Layout Detection for OCR-Dβ47May 1, 2025Updated last year
- β11Jan 10, 2026Updated 4 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer β’ AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- β21Dec 5, 2016Updated 9 years ago
- AASC: ACL Anthology Sentence Corpusβ20Oct 28, 2020Updated 5 years ago
- β24Mar 3, 2024Updated 2 years ago
- Interactive visual analytic tool for exploring epigenomics data w/ associated metadata, powered by HiGlass and Goslingβ13Nov 10, 2023Updated 2 years ago
- Custom HTML elements to reuse Wikidataβ14Jan 6, 2023Updated 3 years ago
- Uses GloVe embeddings and greedy sequence segmentation to semantically segment a text document into any number of k segments.β33Feb 17, 2019Updated 7 years ago
- Core UI Module for After the Deadlineβ20Mar 5, 2022Updated 4 years ago
- An Apache Arrow-backed file format for pre-projected, pre-triangulated maps, including dot density algorithms and regl visualization.β18Feb 10, 2023Updated 3 years ago
- An implementation of DTW for spoken term detection. Including non-constrained, segmental DTW, slope-constrained versions. For more detailβ¦β15Jun 2, 2019Updated 6 years ago
- Managed Database hosting by DigitalOcean β’ AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- The Open Spectral Database - Its all about the dataβ18Aug 7, 2024Updated last year
- 2D Hilbert curve mapping in JavaScriptβ16Jan 4, 2023Updated 3 years ago
- View component for Vega visualizations.β21Dec 20, 2018Updated 7 years ago
- Phylogenetic Application written in OCaml and Cβ19Jan 29, 2020Updated 6 years ago
- A modular JavaScript library to create PDFsβ11Mar 5, 2021Updated 5 years ago
- Tools for working with book dataβ20Nov 25, 2025Updated 5 months ago
- Digital Library of the Middle East web application, based on Spotlightβ21Updated this week