flexpaper / pdf2json
PDF2JSON is a conversion library based on XPDF (3.02) which can be used for high performance PDF page by page conversion to JSON and XML format. It also supports compressing data to minimize size. PDF2JSON is available for Windows, OSX and Linux. Please see https://flowpaper.com for more information
☆306Updated 4 years ago
Alternatives and similar repositories for pdf2json:
Users that are interested in pdf2json are comparing it to the libraries listed below
- DANGER, WILL ROBINSON: THIS REPOSITORY IS IN MAINTENANCE MODE! I will not be continuing feature development or fixing bugs in this codeba…☆348Updated 4 years ago
- Extract and decompose (fuzzy) URLs (including emails, which are conceptually a part of URLs) in texts with Area-Pattern-based modularity☆352Updated 3 months ago
- Repository for Pipes☆272Updated 8 months ago
- it will contain different utilities for GMail API over OAuth2☆414Updated last year
- Next-Generation Interactive Notebooks☆309Updated 2 years ago
- A versioning data store for time-variant graph data.☆340Updated 8 months ago
- Web clipper browser extension for saving highlights, screenshots, and automatically extracting content from web pages.☆374Updated 3 years ago
- A site to instantly search 28M books from OpenLibrary using Typesense Search (an open source alternative to Algolia / ElasticSearch) ⚡ 📚…☆160Updated 2 months ago
- Excel-like Experience for Web Apps (The performant & reliable Vanilla Javascript data grid with Excel-like controls)☆519Updated 5 months ago
- Simple JSON based geolocation API, powered by Google App Engine.☆106Updated 12 years ago
- A node.js library for extracting data from scanned forms.☆117Updated 2 years ago
- ☆391Updated 7 months ago
- ☆179Updated 4 years ago
- Evaluating the performance and accuracy of ABBYY FineReader's OCR on Senate Financial Disclosure scanned forms☆131Updated 9 years ago
- A browser add-on to strip search results from 'blacklisted' URLs on Google☆292Updated 4 years ago
- simplest possible native GUI for inspecting JSON objects with jq☆370Updated 4 years ago
- Random text generator☆69Updated 6 years ago
- A light-weight password manager with a focus on simplicity and security☆372Updated last year
- DataHen Till is a companion tool to your existing web scraper that instantly makes it scalable, maintainable, and more unblockable, with …☆814Updated 3 years ago
- A Python library to inspect and modify the internal structure of a PDF file☆987Updated this week
- Convert a number to an approximated text expression: from '0.23' to 'less than a quarter'.☆198Updated 4 years ago
- using XPDF, pdftojson extracts text from PDF files as JSON, including word bounding boxes.☆144Updated last year
- A post-processing tool for scanned sheets of paper.☆1,071Updated 9 months ago
- Annotation layer for pdf.js (no longer maintained)☆553Updated 6 years ago
- A brief POC of what a Memex could potentially be.☆177Updated 4 years ago
- Asciiflow in VS Code☆400Updated 3 years ago
- A notepad for software and machine learning☆231Updated 6 years ago
- Query CSVs using SQL☆167Updated 5 years ago
- A web app to create and browse text visualizations for automated customer listening.☆148Updated last year
- Qbix Platform for powering Social Apps (http://qbix.com/platform)☆93Updated 7 months ago