PDF2JSON is a conversion library based on XPDF (3.02) which can be used for high performance PDF page by page conversion to JSON and XML format. It also supports compressing data to minimize size. PDF2JSON is available for Windows, OSX and Linux. Please see https://flowpaper.com for more information
☆320Jun 21, 2020Updated 5 years ago
Alternatives and similar repositories for pdf2json
Users that are interested in pdf2json are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- OOXML to HTML conversion☆16Jan 3, 2023Updated 3 years ago
- Burst fetch requests through DO scaling☆12Jan 16, 2025Updated last year
- Benchmark comparing the nats message queue with a REST api server.☆12May 4, 2016Updated 9 years ago
- A presentation (in Markdown) for the IETF Hub Boston on June 12, 2018.☆11Sep 20, 2019Updated 6 years ago
- Incorporates external dependencies into HTML file using data: URI scheme☆21Nov 17, 2011Updated 14 years ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- Interact with SQL databases in Go☆14Mar 3, 2026Updated last month
- OKCJUG presentation stuff☆12Jul 11, 2018Updated 7 years ago
- Server-sent events rewritten on top of fetch☆19Oct 30, 2018Updated 7 years ago
- Build styled component with css-modules☆15Mar 21, 2017Updated 9 years ago
- materials for my workshop "Latest Deep Learning Models for NLP" @ the European Open Data Science Conference 2019☆11Feb 3, 2020Updated 6 years ago
- Heroku buildpack to install Caddy, the fast, cross-platform HTTP/2 web server with automatic HTTPS☆11Aug 19, 2017Updated 8 years ago
- Build web applications with Go. #golang #go☆20Sep 27, 2021Updated 4 years ago
- Transactional, replicable document store for Node.js and browsers. Built on LevelDB.☆10Aug 18, 2015Updated 10 years ago
- Simple ToDo-App with Electron, AngularJs and Material Design☆16Sep 20, 2016Updated 9 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Bridging Large Language Models with Scala 3 Functions☆11Aug 31, 2024Updated last year
- ☆15Mar 21, 2018Updated 8 years ago
- SQL on CSV files in the shell☆15Apr 6, 2018Updated 8 years ago
- Provides a mockable wrapper around the reqwest HTTP client for Rust.☆16May 18, 2023Updated 2 years ago
- Wellcome tool to parse references scraped from policy documents using machine learning☆25May 10, 2021Updated 4 years ago
- Turns your Realtek RTL2832 based DVB dongle into a DAB radio receiver☆11Oct 25, 2015Updated 10 years ago
- Read-only mirror of https://framagit.org/tuxor1337/springerdownload. Pull requests and issues on GitHub cannot be accepted and will be au…☆41Feb 12, 2023Updated 3 years ago
- A set of tools to allow PDF to XML conversion, utilising Apache Beam and other tools. The aim of this project is to bring multiple tools…☆297Apr 1, 2026Updated last week
- Open-source keyboard firmware for Atmel AVR and Arm USB families☆15Sep 21, 2021Updated 4 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Run a program in a modified environment providing an optional .env file or variables from the standard input.☆11Jan 4, 2026Updated 3 months ago
- Deduplicated indexed binary storage for JSON☆162Dec 15, 2025Updated 3 months ago
- A CRDT-based collaborative editor engine of letters.yandex.ru (2012, historical)☆69Dec 8, 2021Updated 4 years ago
- A simple Parser for Roc☆30Jan 28, 2025Updated last year
- Direct editing support for diagram-js☆19Feb 7, 2026Updated 2 months ago
- Utilities for converting between Prosemirror schemas and the Pandoc JSON format☆18Mar 4, 2023Updated 3 years ago
- Read-only mirror of https://framagit.org/tuxor1337/firedict. Pull requests and issues on GitHub cannot be accepted and will be automati…☆18Feb 12, 2023Updated 3 years ago
- Scala interfaces to huggingface transformers and tokenizers☆13Mar 31, 2026Updated last week
- OCRopus model for Gothic print (Fraktur)☆19Feb 16, 2020Updated 6 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Neural network inference the Unix way☆561Apr 21, 2019Updated 6 years ago
- A fast CSV command line toolkit written in Rust.☆10,761Apr 24, 2025Updated 11 months ago
- ☆12Oct 21, 2018Updated 7 years ago
- Memorable references to binary data (eg. private keys) encoded as common words.☆12Oct 16, 2022Updated 3 years ago
- 4Catalyzer JavaScript Tooling☆18Apr 3, 2026Updated last week
- A simple tool for visually comparing two PDF files☆4,208Mar 28, 2026Updated last week
- An open source multi-tool for exploring and publishing data☆10,931Mar 31, 2026Updated last week