ad-freiburg/pdfact

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ad-freiburg/pdfact)

ad-freiburg / pdfact

A basic tool that extracts the structure from the PDF files of scientific articles.

☆77

Alternatives and similar repositories for pdfact

Users that are interested in pdfact are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ckorzen / icecite
View on GitHub
The repository of Icecite, a research paper management system.
☆15Mar 29, 2018Updated 8 years ago
qurator-spk / sbb_ned
View on GitHub
Named Entity Disambiguation and Linking
☆16May 24, 2024Updated 2 years ago
data-liberation / table-understanding-dataset
View on GitHub
table understanding dataset for comparative evaluation of different table understanding algorithms
☆13Jun 15, 2018Updated 8 years ago
tamirhassan / pdfxtk
View on GitHub
PDF Extraction Toolkit
☆43Nov 23, 2020Updated 5 years ago
justincbagley / piranha
View on GitHub
Scripts for file processing and analysis in phylogenetics and phylogeography
☆14Jan 6, 2021Updated 5 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
qlever-dev / qlever-control
View on GitHub
The qlever command-line tool. With this you can control (almost) everything QLever can do
☆73Updated this week
trec-dd / trec-dd-jig
View on GitHub
Simulated user for TREC 2016-2017 Dynamic Domain track
☆10Dec 27, 2017Updated 8 years ago
jingtaozhan / bert-ranking-analysis
View on GitHub
SIGIR'20: An Analysis of BERT in Document Ranking
☆21Jul 27, 2020Updated 5 years ago
ielab / searchrefiner
View on GitHub
Systematic Review Query Visualisation and Understanding Interface
☆17Dec 5, 2025Updated 7 months ago
BMKEG / lapdftext
View on GitHub
LA-PDFText is a system for extracting accurate text from PDF-based research articles (and an interface to be able to improve performance …
☆82Mar 2, 2018Updated 8 years ago
JonathanRaiman / ciseau
View on GitHub
Tokenize and clean strings in Python
☆11Jan 11, 2018Updated 8 years ago
jruipinto / ImageMagick-action
View on GitHub
A GitHub action to auto optimize uploaded images using ImageMagick
☆10Apr 28, 2024Updated 2 years ago
terrierteam / pyterrier_pisa
View on GitHub
A Python interface to PISA
☆37Jun 4, 2026Updated last month
ielab / asyncval
View on GitHub
A toolkit for asynchronously validating dense retriever checkpoints during training.
☆27Aug 10, 2023Updated 2 years ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
lcnetdev / bibframe2marc
View on GitHub
XSLT application to generate MARCXML from BIBFRAME RDF/XML
☆19Jul 8, 2026Updated last week
brainstorm / s3-rust-htslib-bam
View on GitHub
AWS lambda S3 + rust-htslib: A serverless bioinformatics example
☆14Jun 7, 2022Updated 4 years ago
reeset / marcedit_xslt_files
View on GitHub
Shared XSLT Files
☆30May 11, 2021Updated 5 years ago
schmmd / ollie
View on GitHub
Ollie is a open information extractor that uses dependency parses.
☆12Sep 27, 2013Updated 12 years ago
adrianeboyd / BrillMooreSpellChecker
View on GitHub
Spell checker using Brill and Moore's noisy channel error model
☆13Jan 9, 2019Updated 7 years ago
boston-library / blacklight_iiif_search
View on GitHub
Blacklight IIIF Content Search plugin
☆14Mar 17, 2026Updated 4 months ago
ParkerICI / datalogr
View on GitHub
An R package to write Datalog queries and interact with a Datomic database
☆11Aug 12, 2021Updated 4 years ago
SeerLabs / pdfmef
View on GitHub
Multi-Entity Extraction Framework for Academic Documents (with default extraction tools)
☆31Oct 3, 2023Updated 2 years ago
thegetty / Ogee
View on GitHub
Ogee Arches is a package designed for the Arches platform that implements the Linked.art data model, provides a complete vocabulary to su…
☆16Feb 4, 2026Updated 5 months ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
Vanille-N / billig
View on GitHub
A command-line DSL budget manager
☆13Oct 25, 2022Updated 3 years ago
TSO-Openup / FlintSparqlEditor
View on GitHub
Flint SPARQL editor
☆51Oct 16, 2012Updated 13 years ago
Imamachi-n / BioRxivCurator
View on GitHub
Batch scripts curating BioRxiv and PubMed articles by using Altmetric score.
☆11May 9, 2020Updated 6 years ago
cftang0827 / face_alignment
View on GitHub
Simple face alignment library by using face_recognition and opencv
☆16Mar 13, 2019Updated 7 years ago
webis-de / mturk-manager
View on GitHub
An alternative front end for Amazon Mechanical Turk
☆12May 13, 2024Updated 2 years ago
amartinsec / MS-URI-Handlers
View on GitHub
☆25Oct 19, 2023Updated 2 years ago
allenai / science-parse
View on GitHub
Science Parse parses scientific papers (in PDF form) and returns them in structured form.
☆699May 26, 2024Updated 2 years ago
phaidra / phaidra
View on GitHub
Mirror of the official development repository of PHAIDRA. We monitor our public github repo, so contributions via issues & pull requests…
☆22Jul 7, 2026Updated 2 weeks ago
Layout-Parser / annotation-service
View on GitHub
☆20Jul 22, 2021Updated 4 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
has-abi / docparser
View on GitHub
Extract text from your DOCX documents.
☆11Feb 10, 2024Updated 2 years ago
justahuman1 / flask_dashboard
View on GitHub
A dashboard for integrating Python, Tableau, and Google Sheets for automated data collection, analysis, and visualization.
☆10Dec 8, 2022Updated 3 years ago
edgardomortiz / paralog-finder
View on GitHub
Detects and blacklists paralog RAD loci analyzed in Stacks or ipyrad, based on the McKinney 2017 method (doi:10.1111/1755-0998.12613)
☆10Sep 4, 2019Updated 6 years ago
avalanchesiqi / networked-popularity
View on GitHub
Code and Data for paper: Estimating Attention Flow in Online Video Networks (CSCW '19)
☆12Nov 19, 2019Updated 6 years ago
yinleon / inspect-element
View on GitHub
Inspect Element is a practitioner's guide to auditing algorithms and data-driven investigations
☆39Jul 10, 2025Updated last year
xigt / freki
View on GitHub
Analyze XML extracted from PDFs (e.g. from TET or PDFMiner)
☆20Jan 11, 2018Updated 8 years ago
alexsleat / projectChimaera
View on GitHub
Rinzler is an exceptionally skilled warrior and is the elite combatant in all games in the Grid.
☆11Jul 7, 2012Updated 14 years ago