flexpaper / pdf2jsonLinks
PDF2JSON is a conversion library based on XPDF (3.02) which can be used for high performance PDF page by page conversion to JSON and XML format. It also supports compressing data to minimize size. PDF2JSON is available for Windows, OSX and Linux. Please see https://flowpaper.com for more information
☆315Updated 5 years ago
Alternatives and similar repositories for pdf2json
Users that are interested in pdf2json are comparing it to the libraries listed below
Sorting:
- ☆391Updated last year
- pdftilecut lets you sub-divide a PDF page(s) into smaller pages so you can print them on small form printers.☆360Updated last year
- Web clipper browser extension for saving highlights, screenshots, and automatically extracting content from web pages.☆374Updated 4 years ago
- it will contain different utilities for GMail API over OAuth2☆415Updated 2 years ago
- A versioning data store for time-variant graph data.☆343Updated last year
- ☆178Updated 5 years ago
- Extract and decompose (fuzzy) URLs (including emails, which are conceptually a part of URLs) in texts with Area-Pattern-based modularity☆356Updated 10 months ago
- Convert a number to an approximated text expression: from '0.23' to 'less than a quarter'.☆200Updated 4 years ago
- Query CSVs using SQL☆167Updated 6 years ago
- API for extracting a table from an image or a PDF☆91Updated last year
- ☆194Updated 4 years ago
- Simple JSON based geolocation API, powered by Google App Engine.☆106Updated 12 years ago
- JSON processing utility☆508Updated 3 years ago
- Dirty Little SQL Notebook☆115Updated 3 years ago
- A browser add-on to strip search results from 'blacklisted' URLs on Google☆290Updated 4 years ago
- Geocode rows in a SQLite database table☆237Updated 3 years ago
- Tutorial on paged.js☆325Updated 4 years ago
- DOM Recorder☆189Updated 4 years ago
- DropBox/GoogleDrive-style 2-way sync using rsync and fswatch☆143Updated 5 years ago
- An algorithm for generating robust XPath locators for web testing.☆185Updated 2 years ago
- using XPDF, pdftojson extracts text from PDF files as JSON, including word bounding boxes.☆147Updated 2 years ago
- An interactive demo walk-through we built to give visitors a feel for what the Trevor.io platform does☆251Updated 5 years ago
- Interactive visualization library for concept map☆92Updated 6 years ago
- Qbix Platform for powering Social Apps (http://qbix.com/platform)☆93Updated last year
- A Global Exhaustive First and Last Name Database☆740Updated 2 years ago
- OpenTeams is an opensource team visualization tool.☆84Updated 5 years ago
- Well-Tempered Traveler☆126Updated last year
- PDF viewer created using Electron framework and PDF.js☆102Updated 2 years ago
- Bit-sync is a utility for synchronizing arbitrary data using the rsync algorithm in pure js☆287Updated 11 years ago
- WarcDB: Web crawl data as SQLite databases.☆404Updated last year