PDBF - A Toolkit for Creating Janiform Data Documents
☆50Jul 31, 2016Updated 9 years ago
Alternatives and similar repositories for PDBF
Users that are interested in PDBF are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Allow URLs to point to any text piece in a document☆16Sep 18, 2017Updated 8 years ago
- OCRopus model for Gothic print (Fraktur)☆19Feb 16, 2020Updated 6 years ago
- Universalizing Open-Access Journals & Papers☆19Mar 8, 2017Updated 9 years ago
- MonetDB RESTful Proxy☆14Mar 28, 2019Updated 6 years ago
- Scripts for scraping metadata from Academia.edu and migrating publications into Zenodo.org via its REST API☆12Jan 25, 2017Updated 9 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Converters for various file formats used for representing OCR☆12Apr 30, 2025Updated 10 months ago
- Crop And Splice Segments (of scanned pages)☆14Mar 11, 2019Updated 7 years ago
- Server-side Zotero translation based on Mozilla xpcshell (deprecated)☆37Aug 31, 2018Updated 7 years ago
- An API spec to define how to find text in a Web document, using basic information, and return DOM ranges☆15Mar 5, 2019Updated 7 years ago
- stub repo for prelim work on SSRN replacement☆15May 19, 2016Updated 9 years ago
- Core libraries by the PRImA Research Lab☆16Jul 30, 2024Updated last year
- A Markdown pre-processor with support for BibTeX citations.☆12May 22, 2018Updated 7 years ago
- Erweiterung von Zotero für die Katalogisierung☆49Feb 22, 2024Updated 2 years ago
- An OJS 3 plugin to generate an article citation in any CSL citation style using citeproc-php.☆16Mar 13, 2026Updated last week
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Named Entity Recognition tool for Europeana Newspapers☆14Apr 5, 2018Updated 7 years ago
- JournalTouch provides a touch-optimized interface for browsing current journal tables of contents in Responsive Design. Fun!☆14May 27, 2019Updated 6 years ago
- Manuals, lexica, OCR test data for PoCoTo and the profiler☆15Jul 2, 2021Updated 4 years ago
- A python library to deal with scientific papers.☆17Apr 2, 2016Updated 9 years ago
- Specification of the @OCR-D technical architecture, interface definitions and data exchange format(s)☆17Sep 18, 2025Updated 6 months ago
- API wrapper enabling Wikisources to submit images for optical character recognition.☆16Mar 19, 2026Updated last week
- Web service for creating and hosting IIIF manifests from METS/MODS documents☆36Dec 8, 2022Updated 3 years ago
- Japanese trained data of clstm☆15Jun 6, 2016Updated 9 years ago
- Web application for transcribing OCR ground truth from Archive.org☆17Feb 22, 2018Updated 8 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆15Nov 3, 2018Updated 7 years ago
- A repository for online OCRD training infrastructure.☆13Aug 20, 2020Updated 5 years ago
- A small RDF library with minimal dependencies for embedded devices☆12May 6, 2019Updated 6 years ago
- Ergonomic line-by-line transcription of scanned text.☆54Feb 2, 2026Updated last month
- ☆11Oct 20, 2017Updated 8 years ago
- Command-line client for the DataCite Metadata Store (MDS)☆18Mar 9, 2021Updated 5 years ago
- JS for overlaying OCR on image using HOCR formatted HTML☆26Jul 30, 2016Updated 9 years ago
- A general purpose processing framework for corpora of scientific documents☆65Mar 11, 2026Updated 2 weeks ago
- Sentry wrapper for shell script invocations☆12Oct 24, 2025Updated 5 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Text-Induced Corpus Clean-up☆20Jun 20, 2023Updated 2 years ago
- Adding links to full text in Wikipedia references☆37Jun 16, 2025Updated 9 months ago
- A LevelDB backed URL unshortening microservice written in JavaScript☆31Dec 10, 2022Updated 3 years ago
- SPARQL-LD: A SPARQL Extension for Fetching and Querying Linked Data☆17Jul 3, 2023Updated 2 years ago
- Web privacy analysis of Sweden's 290 municipalities.☆11Nov 18, 2022Updated 3 years ago
- Visualization of confirmed Covid-19 cases☆26May 15, 2020Updated 5 years ago
- A simple Python Flask-based implementation of the IIIF Image API 2.0 standard☆12Feb 4, 2022Updated 4 years ago