flexpaper / pdf2jsonLinks
PDF2JSON is a conversion library based on XPDF (3.02) which can be used for high performance PDF page by page conversion to JSON and XML format. It also supports compressing data to minimize size. PDF2JSON is available for Windows, OSX and Linux. Please see https://flowpaper.com for more information
☆314Updated 5 years ago
Alternatives and similar repositories for pdf2json
Users that are interested in pdf2json are comparing it to the libraries listed below
Sorting:
- Extract and decompose (fuzzy) URLs (including emails, which are conceptually a part of URLs) in texts with Area-Pattern-based modularity☆355Updated 9 months ago
- ☆391Updated last year
- API for extracting a table from an image or a PDF☆90Updated last year
- A versioning data store for time-variant graph data.☆342Updated last year
- it will contain different utilities for GMail API over OAuth2☆415Updated 2 years ago
- Web clipper browser extension for saving highlights, screenshots, and automatically extracting content from web pages.☆374Updated 3 years ago
- Repository for Pipes☆277Updated 2 months ago
- Query CSVs using SQL☆166Updated 6 years ago
- Source code of my personal blog☆348Updated 9 months ago
- WarcDB: Web crawl data as SQLite databases.☆406Updated last year
- Screening emails workflow☆101Updated 11 months ago
- Dirty Little SQL Notebook☆115Updated 3 years ago
- Next-Generation Interactive Notebooks☆307Updated 2 years ago
- Convert a number to an approximated text expression: from '0.23' to 'less than a quarter'.☆200Updated 4 years ago
- Geocode rows in a SQLite database table☆237Updated 2 years ago
- DropBox/GoogleDrive-style 2-way sync using rsync and fswatch☆143Updated 5 years ago
- using XPDF, pdftojson extracts text from PDF files as JSON, including word bounding boxes.☆146Updated last year
- DOM Recorder☆189Updated 4 years ago
- Excel-like Experience for Web Apps (The performant & reliable Vanilla Javascript data grid with Excel-like controls)☆530Updated 4 months ago
- Scripts and results from our OCR roundup, available on Source☆150Updated 6 years ago
- An algorithm for generating robust XPath locators for web testing.☆185Updated 2 years ago
- ☆178Updated 5 years ago
- End-to-end encrypted notes application☆122Updated 3 years ago
- Simple SQL-like syntax on top of Perl text processing.☆411Updated 6 years ago
- JSON processing utility☆506Updated 3 years ago
- A Global Exhaustive First and Last Name Database☆739Updated 2 years ago
- Qbix Platform for powering Social Apps (http://qbix.com/platform)☆93Updated last year
- Interactive visualization library for concept map☆92Updated 6 years ago
- A java / spring boot application to help you sign and check signed pdf documents☆96Updated last year
- Tutorial on paged.js☆325Updated 4 years ago