A basic tool that extracts the structure from the PDF files of scientific articles.
☆76Jan 4, 2022Updated 4 years ago
Alternatives and similar repositories for pdfact
Users that are interested in pdfact are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A project about benchmarking and evaluating existing PDF extraction tools on their semantic abilities to extract the body texts from PDF …☆69Nov 7, 2020Updated 5 years ago
- A Test Collection for Evaluating Retrieval of Studies for Inclusion in Systematic Reviews☆12Sep 22, 2023Updated 2 years ago
- Scripts for file processing and analysis in phylogenetics and phylogeography☆13Jan 6, 2021Updated 5 years ago
- Structured Data from PDF image-based files☆91Mar 1, 2013Updated 13 years ago
- The qlever command-line tool. With this you can control (almost) everything QLever can do☆67Updated this week
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Fork of https://gitlab.com/autokent/pdf-parse☆13May 27, 2025Updated 10 months ago
- Simulated user for TREC 2016-2017 Dynamic Domain track☆10Dec 27, 2017Updated 8 years ago
- SIGIR'20: An Analysis of BERT in Document Ranking☆21Jul 27, 2020Updated 5 years ago
- Systematic Review Query Visualisation and Understanding Interface☆17Dec 5, 2025Updated 3 months ago
- Natural Language to SQL Queries in the OMOP CDM Datasets☆11Jun 12, 2023Updated 2 years ago
- Ogee Arches is a package designed for the Arches platform that implements the Linked.art data model, provides a complete vocabulary to su…☆15Feb 4, 2026Updated last month
- A distributed stream querying engine that provides sub-millisecond stateful query at millions of queries per-second over fast-evolving li…☆10Jul 18, 2018Updated 7 years ago
- A toolkit for asynchronously validating dense retriever checkpoints during training.☆27Aug 10, 2023Updated 2 years ago
- R library for simulation of PKPD models [DEPRECATED, see InsightRX/PKPDsim]☆21Aug 16, 2017Updated 8 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Spell checker using Brill and Moore's noisy channel error model☆12Jan 9, 2019Updated 7 years ago
- Blacklight IIIF Content Search plugin☆13Mar 17, 2026Updated last week
- An R package to write Datalog queries and interact with a Datomic database☆11Aug 12, 2021Updated 4 years ago
- Batch scripts curating BioRxiv and PubMed articles by using Altmetric score.☆11May 9, 2020Updated 5 years ago
- Non-official Fuseki Docker image with GeoSPARQL support☆12Mar 17, 2026Updated last week
- A tool for correcting misspellings in textual input using the Noisy Channel Model.☆11Sep 26, 2020Updated 5 years ago
- ☆16Oct 20, 2025Updated 5 months ago
- The OpenCitations RDF Resource Browser☆15Oct 29, 2025Updated 4 months ago
- Computer Vision Segmentation for Document Layout Analysis☆10Sep 26, 2022Updated 3 years ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- Science Parse parses scientific papers (in PDF form) and returns them in structured form.☆697May 26, 2024Updated last year
- Convert ALTO XML to plain text + minimal metadata☆17Oct 17, 2024Updated last year
- ☆20Jul 22, 2021Updated 4 years ago
- A python implementation of "Risk of non Adaptedness" method (with a bit of R too!)☆10Aug 2, 2021Updated 4 years ago
- Tree reconstruction of ancestry using incomplete lineage sorting☆11Jan 9, 2026Updated 2 months ago
- Recommendation engine for scholarly articles☆12Oct 22, 2019Updated 6 years ago
- Detects and blacklists paralog RAD loci analyzed in Stacks or ipyrad, based on the McKinney 2017 method (doi:10.1111/1755-0998.12613)☆10Sep 4, 2019Updated 6 years ago
- Mirror of the official development repository of PHAIDRA. We monitor our public github repo, so contributions via issues & pull requests…☆22Mar 20, 2026Updated last week
- Analyze XML extracted from PDFs (e.g. from TET or PDFMiner)☆20Jan 11, 2018Updated 8 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- TeXoo – A Zoo of Text Extractors☆18Jun 2, 2020Updated 5 years ago
- Basic RDF Datatypes☆15Feb 23, 2026Updated last month
- Incorporating VIsual LAyout Structures for Scientific Text Classification☆180Mar 18, 2023Updated 3 years ago
- Text pattern search using marisa-trie☆18Jan 26, 2025Updated last year
- VESPA: Very large-scale Evolutionary and Selective Pressure Analyses☆15Mar 18, 2022Updated 4 years ago
- A set of "real-time" covid19 county-level dashboards w/ national and state choropleths for monitoring localized infection resurgences as …☆10Apr 12, 2023Updated 2 years ago
- A graph annotation tool using a flask server and javascript☆12Mar 25, 2023Updated 3 years ago