A project about benchmarking and evaluating existing PDF extraction tools on their semantic abilities to extract the body texts from PDF documents, especially from scientific articles.
☆73Nov 7, 2020Updated 5 years ago
Alternatives and similar repositories for pdf-text-extraction-benchmark
Users that are interested in pdf-text-extraction-benchmark are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- my take at a PDF text extraction utility☆26Jun 15, 2015Updated 11 years ago
- AI Assistance for Writing Scientific Alt Text☆14Feb 7, 2024Updated 2 years ago
- The repository of Icecite, a research paper management system.☆15Mar 29, 2018Updated 8 years ago
- A basic tool that extracts the structure from the PDF files of scientific articles.☆77Jan 4, 2022Updated 4 years ago
- A Knowledge Base for research software relying on large-scale text mining and curated knowledge sources☆18May 14, 2023Updated 3 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- PDF Extraction Toolkit☆43Nov 23, 2020Updated 5 years ago
- Multi-Entity Extraction Framework for Academic Documents (with default extraction tools)☆31Oct 3, 2023Updated 2 years ago
- Material parsers and other tools, scripts Initially developed for Grobid Superconductor☆14Feb 21, 2025Updated last year
- Packaging Metadata Comparions☆18Apr 3, 2020Updated 6 years ago
- Softcite software mention recognizer, finding mentions and citations to software from within the academic literature