jfilter / pdf-scriptsLinks
π Scripts to repair, verify, OCR, compress, wrangle, crop (etc.) PDFs
β70Updated last year
Alternatives and similar repositories for pdf-scripts
Users that are interested in pdf-scripts are comparing it to the libraries listed below
Sorting:
- A social media open post web archiving toolβ27Updated last week
- ReadablePDF streamlines the effort of turning a not so great PDF into a more easily readable PDF (or of course a pretty decent PDF into aβ¦β33Updated 4 years ago
- A list of things related to software, literature, and other content for π£ Mementoβ99Updated last year
- π A compilation of research relevant to Data Together's efforts tackling the general problem of data resilience & interactivityβ97Updated 6 years ago
- Chrome extension that uses Memento to indicate that a page a user is viewing on the live web has an archived copy and to give the user acβ¦β54Updated 3 weeks ago
- Create Robust Links from within Zoteroβ20Updated 3 years ago
- π¨ High-fidelity, browser-based, single-page web archiving library and CLI for witnessing the web.β171Updated 2 weeks ago
- Export your personal Spotify data: playlists, saved tracks/albums/shows, etc. as JSONβ38Updated last month
- A library/CLI tool to parse data out of your Google Takeout (History, Activity, Youtube, Locations, etc...)β110Updated 2 weeks ago
- Javascript/Node wrapper around Mozilla's Readability library so that ArchiveBox can call it as a oneshot CLI command to extract each pageβ¦β40Updated last year
- β13Updated 4 years ago
- A wrapper for tesseract / abbyyOCR11 ocr4linux finereader cli that can perform batch operations or monitor a directory and launch an OCR β¦β66Updated last year
- Fast PDF generation and compression. Deals with millions of pages daily.β124Updated last week
- Tool to index and serve HTML files. Powered by Datasette.β107Updated 3 years ago
- Make graphs you can play with... Web app in Flask and Bootstrap to fetch Zotero datasets and then create graph visualizations with d3.jsβ22Updated 7 years ago
- Automated behaviors that run in browser to interact with complex sites automatically. Used by ArchiveWeb.page and Browsertrix Crawler.β50Updated this week
- Homebrew formula for the ArchiveBox self-hosted internet archiving solution.β29Updated 11 months ago
- This project let's you fetch your Pocket Casts statistics and put them into Airtable with about 80 lines of code.β20Updated this week
- A collection of curated home built packages for the cross-platform text expander Espansoβ44Updated 2 months ago
- A post-processing tool for scanned sheets of paper.β82Updated last year
- Some tools to help analyze the twitter archiveβ63Updated 3 months ago
- Comparing warc filesβ17Updated 6 years ago
- π§© Proposal to allow user scripts like "expand comments", "hide popups", "fill out this form", etc. to be reusable across pure browser enβ¦β19Updated 2 months ago
- Import data from Apple's Screen Time on macOS and iOS to ActivityWatchβ41Updated 2 years ago
- scraper for facebook, gab, google and tiktokβ21Updated 3 months ago
- Extract networks of entities from journalistic reportingβ48Updated 2 years ago
- Export your Github activity: events, repositories, stars, etc.β52Updated 2 months ago
- Datasette plugin for uploading CSV files and converting them to database tablesβ27Updated last year
- Handy AppleScripts I useβ32Updated last month
- H2O is a web app for creating and reading open educational resources, primarily in the legal fieldβ43Updated last month