pdf2xml convertor based on Xpdf library - modified version
☆27Feb 23, 2018Updated 8 years ago
Alternatives and similar repositories for pdf2xml
Users that are interested in pdf2xml are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A Knowledge Base for research software relying on large-scale text mining and curated knowledge sources☆17May 14, 2023Updated 2 years ago
- Service for converting and enhancing heterogeneous publisher XML formats into TEI☆63Sep 14, 2024Updated last year
- Some examples of usage of Grobid in a third party java project.☆20Jun 14, 2023Updated 2 years ago
- A machine learning software for extracting astronomical entities from scholarly documents☆10Oct 31, 2022Updated 3 years ago
- Code for pre-training CharacterBERT models (as well as BERT models).☆34Sep 6, 2021Updated 4 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- The Python crash course of the Summer Institute in Computational Social Science 2022!☆10Nov 19, 2022Updated 3 years ago
- Source and scripts for generating DCMI Metadata Terms documentation☆33Feb 3, 2018Updated 8 years ago
- 🕸 GlotWeb: Web Indexing for Minority Languages (WWW 2026)☆17Feb 27, 2026Updated last month
- A small python library to parse and write TSV files generated by the WebAnno software.☆11Apr 14, 2025Updated 11 months ago
- An ontology containing biotic and abiotic plant stresses. Part of the Planteome suite of reference ontologies. Formerly called the Onto…☆18Updated this week
- DatasetImgLabeler is a image annotation tool for researchers to prepare datasets in ICDAR2015 format☆12Dec 7, 2019Updated 6 years ago
- OCR for DjVu☆47Oct 3, 2022Updated 3 years ago
- Automagically ignore all notifications related to work when you are on vacations☆21Aug 21, 2020Updated 5 years ago
- Web-based page layout editor created for EMOP (Early Modern OCR Project).☆11May 21, 2021Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- GROBID extension for identifying and normalizing physical quantities.☆83Updated this week
- A jQuery plugin that adds spellcheck support to inputs using the Google spell checker API.☆91Feb 13, 2010Updated 16 years ago
- A Lucene Indexer for XML, with lexical analysis (lemmatization for French)☆18Updated this week
- Poor man's simple harvester for arXiv resources☆13Jul 14, 2023Updated 2 years ago
- ACL style for Typst☆22Jan 27, 2026Updated 2 months ago
- ☆10Aug 5, 2019Updated 6 years ago
- An open source library for the ETPKLDiv generation algorithm invented by Lucas and Volz.☆10Dec 29, 2019Updated 6 years ago
- Java command line tool to convert PAGE XML files with layout and text content to PDF☆10Apr 27, 2020Updated 5 years ago
- ☆10Apr 21, 2020Updated 5 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Core libraries by the PRImA Research Lab☆16Jul 30, 2024Updated last year
- Augment line images for improving OCR datasets☆10Oct 4, 2023Updated 2 years ago
- FDDWNET: A LIGHTWEIGHT CONVOLUTIONAL NEURAL NETWORK FOR REAL-TIME SEMANTIC SEGMENTATION(ICASSP2020)☆10May 7, 2020Updated 5 years ago
- Adaptation of ring-oauth2 to reitit routes + example of usage☆10Jul 4, 2021Updated 4 years ago
- A desktop wrapper for Mirador and its environment, allowing use of local images.☆14Aug 24, 2018Updated 7 years ago
- fixed some errors from AirBernard/Scene-Text-Detection-with-SPCNET☆13Jul 29, 2019Updated 6 years ago
- ☆20Sep 15, 2022Updated 3 years ago
- Pytorch official implementation for Imitating Unknown Policies via Exploration.☆14Oct 3, 2023Updated 2 years ago
- Finding mentions and citations to named and implicit research datasets from within the academic literature☆30Jun 14, 2025Updated 9 months ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- A little demo how to bind an advanced data science algorithms to 4 different languages☆10Nov 6, 2018Updated 7 years ago
- Fully Point-wise Convolutional Neural Network☆11Dec 30, 2019Updated 6 years ago
- A Named-Entity Recogniser based on Grobid.☆54May 14, 2025Updated 10 months ago
- Safely wrap all selected text contained within a DOM Range☆14Jan 31, 2015Updated 11 years ago
- Image De-Hazing by finding Transmittance and Airlight☆10Apr 15, 2019Updated 6 years ago
- Tensorflow for Orange PI Zero☆15Apr 3, 2018Updated 8 years ago
- ☆12Oct 8, 2020Updated 5 years ago