flexpaper / pdf2jsonLinks
PDF2JSON is a conversion library based on XPDF (3.02) which can be used for high performance PDF page by page conversion to JSON and XML format. It also supports compressing data to minimize size. PDF2JSON is available for Windows, OSX and Linux. Please see https://flowpaper.com for more information
☆311Updated 5 years ago
Alternatives and similar repositories for pdf2json
Users that are interested in pdf2json are comparing it to the libraries listed below
Sorting:
- it will contain different utilities for GMail API over OAuth2☆416Updated 2 years ago
- A versioning data store for time-variant graph data.☆341Updated 11 months ago
- pdftilecut lets you sub-divide a PDF page(s) into smaller pages so you can print them on small form printers.☆352Updated 10 months ago
- JSON processing utility☆505Updated 2 years ago
- Extract and decompose (fuzzy) URLs (including emails, which are conceptually a part of URLs) in texts with Area-Pattern-based modularity☆354Updated 5 months ago
- ☆393Updated 9 months ago
- Web clipper browser extension for saving highlights, screenshots, and automatically extracting content from web pages.☆374Updated 3 years ago
- Query CSVs using SQL☆167Updated 5 years ago
- ☆189Updated 4 years ago
- An interactive demo walk-through we built to give visitors a feel for what the Trevor.io platform does☆253Updated 5 years ago
- Excel-like Experience for Web Apps (The performant & reliable Vanilla Javascript data grid with Excel-like controls)☆523Updated 3 weeks ago
- Source code of my personal blog☆345Updated 5 months ago
- DropBox/GoogleDrive-style 2-way sync using rsync and fswatch☆143Updated 5 years ago
- Well-Tempered Traveler☆126Updated 11 months ago
- ☆179Updated 5 years ago
- Geocode rows in a SQLite database table☆237Updated 2 years ago
- Next-Generation Interactive Notebooks☆308Updated 2 years ago
- A Global Exhaustive First and Last Name Database☆735Updated 2 years ago
- DataHen Till is a companion tool to your existing web scraper that instantly makes it scalable, maintainable, and more unblockable, with …☆815Updated 3 years ago
- Dirty Little SQL Notebook☆114Updated 2 years ago
- Meetups and online groups for Hacker News readers☆122Updated 6 years ago
- Simple SQL-like syntax on top of Perl text processing.☆411Updated 6 years ago
- Human Response Code: Designed to be recognized by humans and OCR. Encodes all valid URL characters to images.☆228Updated 5 years ago
- a simple syntax for complex argumentation☆958Updated last week
- A CLI tool for planning trip itinerary.☆362Updated last year
- Screening emails workflow☆101Updated 8 months ago
- A Python library to inspect and modify the internal structure of a PDF file☆995Updated last week
- Textricator is a tool to extract text from documents and generate structured data.☆346Updated 4 months ago
- Add-in for Excel that finds formula errors☆98Updated 7 months ago
- Tutorial on paged.js☆326Updated 3 years ago