flexpaper / pdf2jsonLinks
PDF2JSON is a conversion library based on XPDF (3.02) which can be used for high performance PDF page by page conversion to JSON and XML format. It also supports compressing data to minimize size. PDF2JSON is available for Windows, OSX and Linux. Please see https://flowpaper.com for more information
☆317Updated 5 years ago
Alternatives and similar repositories for pdf2json
Users that are interested in pdf2json are comparing it to the libraries listed below
Sorting:
- Web clipper browser extension for saving highlights, screenshots, and automatically extracting content from web pages.☆375Updated 4 years ago
- ☆392Updated last year
- it will contain different utilities for GMail API over OAuth2☆416Updated 2 years ago
- A versioning data store for time-variant graph data.☆344Updated last year
- Query CSVs using SQL☆167Updated 6 years ago
- ☆195Updated 4 years ago
- Dirty Little SQL Notebook☆115Updated 2 weeks ago
- JSON processing utility☆508Updated 3 years ago
- API for extracting a table from an image or a PDF☆90Updated last year
- Extract and decompose (fuzzy) URLs (including emails, which are conceptually a part of URLs) in texts with Area-Pattern-based modularity☆358Updated last year
- Excel-like Experience for Web Apps (The performant & reliable Vanilla Javascript data grid with Excel-like controls)☆534Updated 7 months ago
- JavaScript port of TLSH (Trend Micro Locality Sensitive Hash)☆162Updated 4 years ago
- An algorithm for generating robust XPath locators for web testing.☆186Updated 3 years ago
- ☆177Updated 5 years ago
- WarcDB: Web crawl data as SQLite databases.☆405Updated last year
- Geocode rows in a SQLite database table☆237Updated 3 years ago
- An interactive demo walk-through we built to give visitors a feel for what the Trevor.io platform does☆251Updated 5 years ago
- Interactive visualization library for concept map☆93Updated 6 years ago
- Source code of my personal blog☆350Updated last year
- Simple SQL-like syntax on top of Perl text processing.☆413Updated 6 years ago
- pdftilecut lets you sub-divide a PDF page(s) into smaller pages so you can print them on small form printers.☆361Updated last year
- Create beautiful ascii trees☆172Updated 6 years ago
- DOM Recorder☆191Updated 4 years ago
- Scraping assistant tool. Editing and maintaining CSS/XPath selectors across webpages.☆125Updated 7 years ago
- A brief POC of what a Memex could potentially be.☆176Updated 5 years ago
- Screening emails workflow☆101Updated last year
- Convert a number to an approximated text expression: from '0.23' to 'less than a quarter'.☆200Updated 5 years ago
- Hackity hack☆103Updated 6 years ago
- Next-Generation Interactive Notebooks☆307Updated 3 years ago
- DropBox/GoogleDrive-style 2-way sync using rsync and fswatch☆143Updated 6 years ago