JonathanLink / PDFLayoutTextStripper
Converts a pdf file into a text file while keeping the layout of the original pdf. Useful to extract the content from a table in a pdf file for instance. This is a subclass of PDFTextStripper class (from the Apache PDFBox library).
☆1,574Updated 11 months ago
Related projects ⓘ
Alternatives and complementary repositories for PDFLayoutTextStripper
- Scan, index, and archive all of your paper documents (acquired by Mayan EDMS)☆2,559Updated 5 years ago
- Detect text blocks and OCR poorly scanned PDFs in bulk. Python module available via pip.☆1,273Updated 3 years ago
- 🤖 A Node queue API for generating PDFs using headless Chrome. Comes with a CLI, S3 storage and webhooks for notifying subscribers about …☆2,625Updated 8 months ago
- Minimalist and powerful Web Crawler.☆880Updated 3 years ago
- Make a self hosted OpenVPN server in 15 minutes☆808Updated 7 years ago
- Problem Solving☆900Updated 5 years ago
- Generates a quiz for a Wikipedia page using parts of speech and text chunking.☆803Updated 4 years ago
- 👨🏭Set up your Linux server with plain shell scripts☆1,172Updated 3 years ago
- Dashboards using YAML or JSON files☆1,570Updated 11 months ago
- +2600 developer-related blogs and publications.☆636Updated 7 years ago
- The magic of Google Autocomplete while you're typing. Anywhere.☆1,540Updated last year
- 📸💻 Turn your source code into beautiful syntax-highlighted images.☆2,194Updated last year
- Stand for 12" MacBook, 13" MacBook Air and 13" MacBook Pro☆485Updated 4 years ago
- A simple, self-contained, serverless, zero-configuration, json document store.☆843Updated 2 years ago
- Send your stdin to google sheets☆546Updated 4 years ago
- Themes based on the biggest StartUps (buttons, color palette, components, etc.) ready to use in your own projects.☆783Updated 7 years ago
- Chrome extension for full text history search!☆1,825Updated 3 months ago
- GUI app for editing, visualizing, and manipulating JSON data☆1,861Updated 7 years ago
- The Berkeley Document Summarizer is a learning-based, single-document summarization system that extracts source document content, exploit…☆742Updated 5 years ago
- Simple command-line utility to convert CSV files to searchable and sortable HTML table.☆1,117Updated 3 years ago
- Time tracking you can host anywhere. Full export support in multiple formats and easily extensible.☆1,692Updated last year
- ☆2,194Updated 3 weeks ago
- Efficient Counter that uses a limited (bounded) amount of memory regardless of data size.☆936Updated 2 years ago
- Web page PDF/PNG rendering done right. Self-hosted service for rendering receipts, invoices, or any content.☆7,033Updated 10 months ago
- Xaddress - Give 7 billion people an instant physical address☆1,185Updated 2 years ago
- 💸 Take control back. Track your everyday spendings.☆440Updated 6 years ago
- Staffjoy V1, aka "Suite" - a scheduling application for hundreds of workers☆848Updated 6 years ago