jfilter / pdf-scripts
📑 Scripts to repair, verify, OCR, compress, wrangle, crop (etc.) PDFs
☆68Updated 11 months ago
Alternatives and similar repositories for pdf-scripts:
Users that are interested in pdf-scripts are comparing it to the libraries listed below
- ReadablePDF streamlines the effort of turning a not so great PDF into a more easily readable PDF (or of course a pretty decent PDF into a…☆33Updated 3 years ago
- etl pipeline, graphical explorer and general toolbox for investigations with follow the money data☆21Updated last year
- Abbreviations for use with the Abbreviation Filter developed for use with Multilingual Zotero.☆17Updated last year
- Extract networks of entities from journalistic reporting☆48Updated last year
- Make graphs you can play with... Web app in Flask and Bootstrap to fetch Zotero datasets and then create graph visualizations with d3.js☆22Updated 7 years ago
- A helper library full of URL-related heuristics.☆69Updated 3 weeks ago
- 📜 Dehyphenation of broken text (mainly German), i.e., extracted from a PDF☆38Updated 3 years ago
- Jurisdiction ID and abbreviation data files for using with Jurism and other projects.☆36Updated last year
- Fetch all your bookmarked tweets and make them accessible through a webinterface.☆29Updated last year
- Auflistung Freier/Libre-Open-Source-Software, die bereits im öffentlichen Dienst genutzt oder gar selbst betrieben wird. Ergänzungen aus …☆31Updated last year
- A social media open post web archiving tool☆25Updated last month
- Awesome list dedicated to digital and data preservation tools, sources, services and so on.☆25Updated 2 years ago
- Comparing warc files☆17Updated 6 years ago
- API client for Aleph, supports bulk entity and document upload.☆28Updated 6 months ago
- quickly and easily search for and download case law; automatically rename downloaded judgments☆23Updated 9 months ago
- Presentations on Quantified Self and Self-Tracking with Python☆30Updated 2 years ago
- Chrome extension that uses Memento to indicate that a page a user is viewing on the live web has an archived copy and to give the user ac…☆53Updated 2 months ago
- Javascript/Node wrapper around Mozilla's Readability library so that ArchiveBox can call it as a oneshot CLI command to extract each page…☆40Updated 7 months ago
- Create Robust Links from within Zotero☆19Updated 2 years ago
- Homebrew formula for the ArchiveBox self-hosted internet archiving solution.☆28Updated 6 months ago
- A German lexicon with words assosiated with love, fear, joy, disgust, surprise, contempt and anger☆11Updated last year
- Named-Entity Recognition extension for OpenRefine☆28Updated 2 years ago
- Fast PDF generation and compression. Deals with millions of pages daily.☆115Updated 8 months ago
- H2O is a web app for creating and reading open educational resources, primarily in the legal field☆38Updated 2 months ago
- An espanso package to assist with medical documentation☆26Updated 3 years ago
- Adds a reconciliation API endpoint to Datasette, based on the Reconciliation Service API specification.☆24Updated last year
- 🧩 Proposal to allow user scripts like "expand comments", "hide popups", "fill out this form", etc. to be reusable across pure browser en…