jfilter / pdf-scriptsLinks
π Scripts to repair, verify, OCR, compress, wrangle, crop (etc.) PDFs
β70Updated last year
Alternatives and similar repositories for pdf-scripts
Users that are interested in pdf-scripts are comparing it to the libraries listed below
Sorting:
- Comparing warc filesβ17Updated 6 years ago
- A social media open post web archiving toolβ27Updated 2 months ago
- ReadablePDF streamlines the effort of turning a not so great PDF into a more easily readable PDF (or of course a pretty decent PDF into aβ¦β33Updated 3 years ago
- My life dashboard - automatically track and visualize your data. Using common tracker APIs to create a minute by minute representation ofβ¦β19Updated 4 years ago
- π A compilation of research relevant to Data Together's efforts tackling the general problem of data resilience & interactivityβ96Updated 6 years ago
- backup and parse your browser history databases (chrome, firefox, safari, and other chrome/firefox derivatives)β143Updated 9 months ago
- π¨ High-fidelity, browser-based, single-page web archiving library and CLI for witnessing the web.β170Updated 3 weeks ago
- Tool to index and serve HTML files. Powered by Datasette.β105Updated 3 years ago
- Export/access your Hypothes.is data: annotations and profile infoβ44Updated last month
- Javascript/Node wrapper around Mozilla's Readability library so that ArchiveBox can call it as a oneshot CLI command to extract each pageβ¦β40Updated 11 months ago
- A list of things related to software, literature, and other content for π£ Mementoβ99Updated last year
- Parse db/html/json bookmarks file from (Chrome - Firefox - Custom source) and convert it to db/html/json format.β57Updated last week
- List of resources and tools for self-trackingβ13Updated 6 years ago
- GoodLinks Exporterβ10Updated last year
- π§© Proposal to allow user scripts like "expand comments", "hide popups", "fill out this form", etc. to be reusable across pure browser enβ¦β19Updated last month
- A simple Python wrapper and command-line interface for archive.orgβs "Save Page Now" capturing serviceβ182Updated 10 months ago
- Automated behaviors that run in browser to interact with complex sites automatically. Used by ArchiveWeb.page and Browsertrix Crawler.β48Updated this week
- Gets your upvoted posts from Hacker News and imports them to raindrop.ioβ26Updated 2 years ago
- π A community-curated list of awesome lawtech software and learning resources for legal technology and design.β27Updated 5 years ago
- an extensible tool to generate hyperlinks from legal citationsβ35Updated 11 months ago
- Generate a list of your GitHub stars by topic - automatically!β83Updated 2 years ago
- Fast PDF generation and compression. Deals with millions of pages daily.β122Updated this week
- Automatically sync Omnivore pages to Raindrop.ioβ22Updated 9 months ago
- Import data from Apple's Screen Time on macOS and iOS to ActivityWatchβ40Updated 2 years ago
- This project let's you fetch your Pocket Casts statistics and put them into Airtable with about 80 lines of code.β20Updated last week
- A post-processing tool for scanned sheets of paper.β82Updated last year
- Passively capture, archive, and hoard your web browsing history, including the contents of the pages you visit, for later offline viewingβ¦β88Updated last month
- π Command-line tool to organize large directories of media files recursively by date, detecting duplicates.β26Updated 2 years ago
- Chrome extension that uses Memento to indicate that a page a user is viewing on the live web has an archived copy and to give the user acβ¦β54Updated this week
- Plugin for Joplin which can be used to extract keywords from note and assign them as a note's tagsβ37Updated 4 years ago