A basic tool that extracts the structure from the PDF files of scientific articles.
☆76Jan 4, 2022Updated 4 years ago
Alternatives and similar repositories for pdfact
Users that are interested in pdfact are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A project about benchmarking and evaluating existing PDF extraction tools on their semantic abilities to extract the body texts from PDF …☆70Nov 7, 2020Updated 5 years ago
- my take at a PDF text extraction utility☆25Jun 15, 2015Updated 10 years ago
- A Test Collection for Evaluating Retrieval of Studies for Inclusion in Systematic Reviews☆12Sep 22, 2023Updated 2 years ago
- The qlever command-line tool. With this you can control (almost) everything QLever can do☆67Apr 9, 2026Updated last week
- Keyphrase Extraction Prototypes☆15Nov 24, 2016Updated 9 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Ogee Arches is a package designed for the Arches platform that implements the Linked.art data model, provides a complete vocabulary to su…☆15Feb 4, 2026Updated 2 months ago
- Natural Language to SQL Queries in the OMOP CDM Datasets☆11Jun 12, 2023Updated 2 years ago
- A distributed stream querying engine that provides sub-millisecond stateful query at millions of queries per-second over fast-evolving li…☆10Jul 18, 2018Updated 7 years ago
- Tokenize and clean strings in Python☆11Jan 11, 2018Updated 8 years ago
- XSLT application to generate MARCXML from BIBFRAME RDF/XML☆19Apr 2, 2026Updated 2 weeks ago
- Shared XSLT Files☆30May 11, 2021Updated 4 years ago
- Spell checker using Brill and Moore's noisy channel error model☆12Jan 9, 2019Updated 7 years ago
- Blacklight IIIF Content Search plugin☆13Mar 17, 2026Updated 3 weeks ago
- Multi-Entity Extraction Framework for Academic Documents (with default extraction tools)☆31Oct 3, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Non-official Fuseki Docker image with GeoSPARQL support☆12Mar 26, 2026Updated 3 weeks ago
- Flint SPARQL editor☆51Oct 16, 2012Updated 13 years ago
- A tool for correcting misspellings in textual input using the Noisy Channel Model.☆11Sep 26, 2020Updated 5 years ago
- Graphical analysis of PDF structure.☆13Jan 9, 2017Updated 9 years ago
- ☆16Apr 9, 2026Updated last week
- Simple face alignment library by using face_recognition and opencv☆16Mar 13, 2019Updated 7 years ago
- Science Parse parses scientific papers (in PDF form) and returns them in structured form.☆698May 26, 2024Updated last year
- Convert ALTO XML to plain text + minimal metadata☆17Oct 17, 2024Updated last year
- ☆20Jul 22, 2021Updated 4 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- A SPARQL language server☆41Updated this week
- Run IPython, Pattern, NLTK, Pandas, NumPy, SciPy, Numba, Biopython inside Docker☆47Jul 14, 2014Updated 11 years ago
- Recommendation engine for scholarly articles☆12Oct 22, 2019Updated 6 years ago
- Code and Data for paper: Estimating Attention Flow in Online Video Networks (CSCW '19)☆12Nov 19, 2019Updated 6 years ago
- Jurisdiction ID and abbreviation data files for using with Jurism and other projects.☆44Nov 8, 2023Updated 2 years ago
- Mirror of the official development repository of PHAIDRA. We monitor our public github repo, so contributions via issues & pull requests…☆22Updated this week
- Analyze XML extracted from PDFs (e.g. from TET or PDFMiner)☆20Jan 11, 2018Updated 8 years ago
- liberate all kinds of data from PDF and other unstructural format and make the information machine-readable and visualizeable for popul…☆31Jun 1, 2018Updated 7 years ago
- Basic RDF Datatypes☆15Feb 23, 2026Updated last month
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Incorporating VIsual LAyout Structures for Scientific Text Classification☆180Mar 18, 2023Updated 3 years ago
- Text pattern search using marisa-trie☆19Jan 26, 2025Updated last year
- VESPA: Very large-scale Evolutionary and Selective Pressure Analyses☆15Mar 18, 2022Updated 4 years ago
- Models assisting Red Cross Home Fire Preparedness team target areas for smoke alarm installs. Predictions and indicators used in smoke_a…☆10Jul 7, 2016Updated 9 years ago
- A set of "real-time" covid19 county-level dashboards w/ national and state choropleths for monitoring localized infection resurgences as …☆10Apr 12, 2023Updated 3 years ago
- Template repository for creating a UU styled Quarto presentation☆13Oct 29, 2025Updated 5 months ago
- hnsw implemented by python☆22Nov 28, 2019Updated 6 years ago