Command line tool to extract figures, tables, and captions from scholarly documents in PDF form.
β130Apr 9, 2018Updated 8 years ago
Alternatives and similar repositories for pdffigures
Users that are interested in pdffigures are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Given a scholarly PDF, extract figures, tables, captions, and section titles.β749Mar 10, 2024Updated 2 years ago
- Companion code to the paper "Extracting Scientific Figures with Distantly Supervised Neural Networks" π€β148Jun 14, 2022Updated 4 years ago
- Multi-Entity Extraction Framework for Academic Documents (with default extraction tools)β31Oct 3, 2023Updated 2 years ago
- β41May 15, 2020Updated 6 years ago
- A place to collect and share knowledge about liberating data from PDFsβ55Jan 30, 2022Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient β’ AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Training data generator for text detectionβ38Jul 16, 2020Updated 5 years ago
- PDF Extraction Toolkitβ43Nov 23, 2020Updated 5 years ago
- table understanding dataset for comparative evaluation of different table understanding algorithmsβ13Jun 15, 2018Updated 8 years ago
- Examples or utilizing Microsoft Academic for conducting covid-19 researchβ23Dec 26, 2022Updated 3 years ago
- Science Parse parses scientific papers (in PDF form) and returns them in structured form.β700May 26, 2024Updated 2 years ago
- Content ExtRactor and MINErβ512Jun 30, 2022Updated 4 years ago
- Supervised learning of morphologyβ28Jan 17, 2017Updated 9 years ago
- Models, scripts, and data sets for data annotation (aka coding, aka rating)β12Mar 9, 2015Updated 11 years ago
- An open-source CRF Reference String Parsing Packageβ161May 6, 2020Updated 6 years ago
- Simple, predictable pricing with DigitalOcean hosting β’ AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Various attempts at scanning aerial imagery to detect baseball diamonds.β17Jul 13, 2014Updated 11 years ago
- A set of tools for extracting tables from PDF files helping to do data mining on (OCR-processed) scanned documents.β2,256Jun 24, 2022Updated 4 years ago
- β14Mar 14, 2024Updated 2 years ago
- Code for the paper: "Mining Algorithm Roadmap in Scientific Publications" - KDD 2019β23Jul 22, 2023Updated 2 years ago
- Read FASTA files indexed with .fai indexes. Also supports BGZIP+.gziβ12May 19, 2026Updated last month
- β19Dec 19, 2018Updated 7 years ago
- Configuration Space Exploration Frameworkβ16Oct 13, 2020Updated 5 years ago
- Data loader for the Apache Arrow format.β65Apr 2, 2026Updated 3 months ago
- Agglomerative hierarchical clustering in JavaScriptβ19Dec 17, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer β’ AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- β24Mar 3, 2024Updated 2 years ago
- Prototype your Jupyter Widget in the browser with anywidget and JupyterLite π‘β17Apr 7, 2025Updated last year
- cross platform multiuser network painting program using Qt, boost asioβ41Oct 15, 2012Updated 13 years ago
- Interactive visual analytic tool for exploring epigenomics data w/ associated metadata, powered by HiGlass and Goslingβ13Nov 10, 2023Updated 2 years ago
- This sample .Net application shows you how to use the .Net SDK to read and write files to Azure Data Lake Store, and do other filesystem β¦β10Oct 18, 2023Updated 2 years ago
- Custom HTML elements to reuse Wikidataβ14Jan 6, 2023Updated 3 years ago
- Core UI Module for After the Deadlineβ20Mar 5, 2022Updated 4 years ago
- this repository contains the dataset and the source code for the EMNLP 2019 paper "A Neural Citation Count Prediction Model based on Peerβ¦β10Oct 8, 2021Updated 4 years ago
- 2D Hilbert curve mapping in JavaScriptβ16Jan 4, 2023Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer β’ AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- β18Nov 13, 2024Updated last year
- A modular JavaScript library to create PDFsβ11Mar 5, 2021Updated 5 years ago
- Digital Library of the Middle East web application, based on Spotlightβ21Updated this week
- Extract tables from PDF filesβ358May 17, 2016Updated 10 years ago
- β16Dec 6, 2014Updated 11 years ago
- Full dataset of Reuters composed of 8,551,441 news titles, links and timestamps (Jan 2007 - Aug 2016).β22Aug 17, 2016Updated 9 years ago
- Allow anyone with a modern browser to stream a 1GB, 10GB, 100GB, or 1TB file over the Internet and into a happy home.β32Oct 7, 2018Updated 7 years ago