A basic tool that extracts the structure from the PDF files of scientific articles.
☆76Jan 4, 2022Updated 4 years ago
Alternatives and similar repositories for pdfact
Users that are interested in pdfact are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A project about benchmarking and evaluating existing PDF extraction tools on their semantic abilities to extract the body texts from PDF …☆71Nov 7, 2020Updated 5 years ago
- Named Entity Disambiguation and Linking☆16May 24, 2024Updated last year
- my take at a PDF text extraction utility☆25Jun 15, 2015Updated 10 years ago
- table understanding dataset for comparative evaluation of different table understanding algorithms☆13Jun 15, 2018Updated 7 years ago
- A Test Collection for Evaluating Retrieval of Studies for Inclusion in Systematic Reviews☆12Sep 22, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- The qlever command-line tool. With this you can control (almost) everything QLever can do☆68Updated this week
- Keyphrase Extraction Prototypes☆15Nov 24, 2016Updated 9 years ago
- LA-PDFText is a system for extracting accurate text from PDF-based research articles (and an interface to be able to improve performance …☆82Mar 2, 2018Updated 8 years ago
- Ogee Arches is a package designed for the Arches platform that implements the Linked.art data model, provides a complete vocabulary to su…☆15Feb 4, 2026Updated 3 months ago
- Tokenize and clean strings in Python☆11Jan 11, 2018Updated 8 years ago
- XSLT application to generate MARCXML from BIBFRAME RDF/XML☆19May 7, 2026Updated last week
- Entity linking evaluation and analysis tool☆26Apr 25, 2026Updated 3 weeks ago
- Shared XSLT Files☆30May 11, 2021Updated 5 years ago
- Spell checker using Brill and Moore's noisy channel error model☆12Jan 9, 2019Updated 7 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Multi-Entity Extraction Framework for Academic Documents (with default extraction tools)☆31Oct 3, 2023Updated 2 years ago
- Batch scripts curating BioRxiv and PubMed articles by using Altmetric score.☆11May 9, 2020Updated 6 years ago
- A tool for correcting misspellings in textual input using the Noisy Channel Model.☆11Sep 26, 2020Updated 5 years ago
- A command-line DSL budget manager☆13Oct 25, 2022Updated 3 years ago
- The OpenCitations RDF Resource Browser☆15Oct 29, 2025Updated 6 months ago
- ☆24Oct 19, 2023Updated 2 years ago
- Science Parse parses scientific papers (in PDF form) and returns them in structured form.☆699May 26, 2024Updated last year
- Convert ALTO XML to plain text + minimal metadata☆17Oct 17, 2024Updated last year
- ☆20Jul 22, 2021Updated 4 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- SPARQLGraph - Visual Query Builder for Biological RDF Databases☆16Oct 15, 2015Updated 10 years ago
- Recommendation engine for scholarly articles☆12Oct 22, 2019Updated 6 years ago
- demonstrating shunting yard algorithm and evaluation of arithmetic expressions☆15Jan 11, 2025Updated last year
- Mirror of the official development repository of PHAIDRA. We monitor our public github repo, so contributions via issues & pull requests…☆22May 11, 2026Updated last week
- Sources and Documentation for the HINT project☆13May 12, 2026Updated last week
- Analyze XML extracted from PDFs (e.g. from TET or PDFMiner)☆20Jan 11, 2018Updated 8 years ago
- liberate all kinds of data from PDF and other unstructural format and make the information machine-readable and visualizeable for popul…☆31Jun 1, 2018Updated 7 years ago
- Basic RDF Datatypes☆15Feb 23, 2026Updated 2 months ago
- Incorporating VIsual LAyout Structures for Scientific Text Classification☆180Mar 18, 2023Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Text pattern search using marisa-trie☆19Jan 26, 2025Updated last year
- Readme2: An R Package for Improved Automated Nonparametric Content Analysis for Social Science☆51May 11, 2026Updated last week
- Keyphrase Generation for Scientific Document Retrieval☆11Oct 2, 2020Updated 5 years ago
- hnsw implemented by python☆22Nov 28, 2019Updated 6 years ago
- A generator for synthetic streams of financial transactions.☆11Sep 20, 2022Updated 3 years ago
- INCLUSIFY is a tool to support the practical use of diversity-sensitive language in German.☆12Sep 14, 2022Updated 3 years ago
- ELOT Literate Ontology Tool☆27Updated this week