pdf2xml convertor based on Xpdf library - modified version
☆27Feb 23, 2018Updated 8 years ago
Alternatives and similar repositories for pdf2xml
Users that are interested in pdf2xml are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- PDF to XML ALTO file converter☆269May 10, 2026Updated last week
- ☆18Apr 6, 2021Updated 5 years ago
- OpenQuant通视股票全推行情接口,已经不再维护,请移步XAPI2项目☆14Sep 12, 2013Updated 12 years ago
- Some examples of usage of Grobid in a third party java project.☆20Jun 14, 2023Updated 2 years ago
- Logiciel utilise sur la plateforme HAL☆12Jul 13, 2021Updated 4 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Generating graph structures from OWL ontologies☆12Nov 21, 2017Updated 8 years ago
- A machine learning software for extracting astronomical entities from scholarly documents☆10Oct 31, 2022Updated 3 years ago
- Pre-processing text and tokenization for UTH-BERT☆10Sep 30, 2020Updated 5 years ago
- ☆11Apr 15, 2022Updated 4 years ago
- Tools for creating a MongoDB collection of ChemDataExtractor-snowball records☆14May 29, 2019Updated 6 years ago
- Collection of LaTeX utility packages for scientific documents☆17Sep 13, 2023Updated 2 years ago
- ChemDataExtractor toolkit updated to include semi-supervised quaternary relationship extraction☆13Feb 8, 2021Updated 5 years ago
- The Python crash course of the Summer Institute in Computational Social Science 2022!☆10Nov 19, 2022Updated 3 years ago
- Calculate MFCC/Fbank feature for wav files☆15Nov 21, 2017Updated 8 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆21May 1, 2025Updated last year
- ☆10May 29, 2020Updated 5 years ago
- Python client for GROBID Web services☆405Mar 5, 2026Updated 2 months ago
- A small python library to parse and write TSV files generated by the WebAnno software.☆11Apr 14, 2025Updated last year
- An ontology containing biotic and abiotic plant stresses. Part of the Planteome suite of reference ontologies. Formerly called the Onto…☆18Apr 14, 2026Updated last month
- Line shuffler for huge text file which does not fit in memory☆13Dec 1, 2022Updated 3 years ago
- OCR for DjVu☆47Oct 3, 2022Updated 3 years ago
- Automagically ignore all notifications related to work when you are on vacations☆21Aug 21, 2020Updated 5 years ago
- This repository hosts the dataset for the paper Computer Science Named Entity Recognition in the Open Research Knowledge Graph☆22Jan 8, 2024Updated 2 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Web-based page layout editor created for EMOP (Early Modern OCR Project).☆11May 21, 2021Updated 5 years ago
- GROBID extension for identifying and normalizing physical quantities.☆84Apr 8, 2026Updated last month
- Repository for the learning materials of the Aachen-Graz SICSS location.☆19Oct 19, 2023Updated 2 years ago
- A Lucene Indexer for XML, with lexical analysis (lemmatization for French)☆18May 15, 2026Updated last week
- Poor man's simple harvester for arXiv resources☆14Jul 14, 2023Updated 2 years ago
- GC4LM: A Colossal (Biased) language model for German☆13May 2, 2021Updated 5 years ago
- Collection of tools to extract semantic information from (mathematical) research articles☆23Feb 7, 2026Updated 3 months ago
- ACL style for Typst☆23Jan 27, 2026Updated 3 months ago
- ☆10Aug 5, 2019Updated 6 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Audible Electromagnetic Interference Detector - Model ET-1☆14Mar 7, 2020Updated 6 years ago
- Async procedures for Clojure☆13Oct 5, 2022Updated 3 years ago
- Java command line tool to convert PAGE XML files with layout and text content to PDF☆10Apr 27, 2020Updated 6 years ago
- ☆10Apr 21, 2020Updated 6 years ago
- product recommendation text generation using OpenCCG☆28Dec 5, 2021Updated 4 years ago
- ggplot2 extension to add a table to an axis.☆13May 29, 2021Updated 4 years ago
- A browser extension providing Open Access bibliographical services☆18Dec 9, 2022Updated 3 years ago