LA-PDFText is a system for extracting accurate text from PDF-based research articles (and an interface to be able to improve performance where needed). The system is open-source and provides a simple baseline function for extracting text from primary research articles using rules that developers can customize. This means that the system works qu…
☆81Mar 2, 2018Updated 8 years ago
Alternatives and similar repositories for lapdftext
Users that are interested in lapdftext are comparing it to the libraries listed below
Sorting:
- High-level build project for all LAPDF-Text submodules☆103Jul 2, 2015Updated 10 years ago
- LA-PDFText is a system for extracting accurate text from PDF-based research articles (and an interface to be able to improve performance …☆15Mar 21, 2019Updated 7 years ago
- Web-based page layout editor created for EMOP (Early Modern OCR Project).☆11May 21, 2021Updated 4 years ago
- Whole-cell modeling tutorials☆17Jun 26, 2020Updated 5 years ago
- A basic tool that extracts the structure from the PDF files of scientific articles.☆76Jan 4, 2022Updated 4 years ago
- Provides an HTTP server endpoint for interacting with Zotero☆25Nov 10, 2024Updated last year
- Babel creates cliques of equivalent identifiers across many biomedical vocabularies.☆14Updated this week
- my take at a PDF text extraction utility☆25Jun 15, 2015Updated 10 years ago
- The JSON API Browser☆40Dec 5, 2013Updated 12 years ago
- Full text extraction using the Open Source Tesseract OCR software https://code.google.com/p/tesseract-ocr/ and imagemagick☆13May 16, 2015Updated 10 years ago
- An ordered Python dictionary with attribute-style access.☆16Jun 23, 2020Updated 5 years ago
- First pass at a thin wrapper around the Monarch API and ChatGPT plugin☆12Mar 21, 2025Updated last year
- REx: Relation Extraction. Modernized re-write of the code in the master's thesis: "Relation Extraction using Distant Supervision, SVMs, a…☆22Mar 7, 2018Updated 8 years ago
- PHP client library for communicating with GetEventStore.☆12Mar 7, 2016Updated 10 years ago
- Go bindings for the Apache Lucy full text search library. The Apache Lucy search engine library provides full-text search for dynamic pro…☆47Sep 24, 2014Updated 11 years ago
- Word2Vec - Google's word2vec in Scala using UMASS factorie library for better hacking and research.☆16Apr 7, 2014Updated 11 years ago
- small example on how to get SVO (subject, verb, object) information from an input, as well as whether that input was a question.☆17Aug 6, 2019Updated 6 years ago
- wrapper for the crossref events api☆23May 23, 2023Updated 2 years ago
- The seat_saver repo for Elm 0.17☆12Dec 22, 2016Updated 9 years ago
- Various scripts that facilitate the preparation of Automatic Speech Recognition related resources☆17Apr 16, 2020Updated 5 years ago
- MiTextExplorer - interactive browser of text and document covariates.☆24Jun 17, 2015Updated 10 years ago
- Network Embedding All the Things☆18Oct 4, 2022Updated 3 years ago
- Convert Transkribus PAGE-XML to standard PAGE-XML☆12Dec 10, 2025Updated 3 months ago
- A tool for analyzing and visualizing discrete temporal events☆17Aug 15, 2018Updated 7 years ago
- A repository for the "Combining DBpedia and Topic Modeling" GSoC 2016 idea☆13Sep 1, 2016Updated 9 years ago
- An R data package for NIH EXPORTER data☆15Mar 8, 2025Updated last year
- An R package for working with phenotypic screening data☆10Nov 22, 2018Updated 7 years ago
- ☆19May 10, 2024Updated last year
- several algorithms for converting dependency structures into constituency structures.☆10Feb 7, 2022Updated 4 years ago
- 🐸 Idiomatic conversion between URIs and compact URIs (CURIEs) in Python☆25Updated this week
- Want to automate you or your patient's treatments in a novel way we haven't thought of? TOP APIs make it trivial to expand or change func…☆11Mar 1, 2019Updated 7 years ago
- Number names in R 2️⃣💬☆13Mar 15, 2024Updated 2 years ago
- A generator for synthetic streams of financial transactions.☆11Sep 20, 2022Updated 3 years ago
- ☆31Mar 7, 2017Updated 9 years ago
- The lexDAO Registry // scripts for legal & ethereal deal security☆12Dec 10, 2022Updated 3 years ago
- ☆11Nov 17, 2015Updated 10 years ago
- texrex web page cleaning & ClaraX random walk crawler☆11Dec 13, 2021Updated 4 years ago
- NP-KG: Knowledge Graph Framework for Natural Product-Drug Interactions☆19Jun 25, 2024Updated last year
- CQRS & EventSourcing library for php >= 5.5☆14Oct 20, 2016Updated 9 years ago