jfilter / pdf-scriptsLinks
📑 Scripts to repair, verify, OCR, compress, wrangle, crop (etc.) PDFs
☆70Updated last year
Alternatives and similar repositories for pdf-scripts
Users that are interested in pdf-scripts are comparing it to the libraries listed below
Sorting:
- A list of things related to software, literature, and other content for 🕣 Memento☆104Updated 3 weeks ago
- ReadablePDF streamlines the effort of turning a not so great PDF into a more easily readable PDF (or of course a pretty decent PDF into a…☆33Updated 4 years ago
- Auflistung Freier/Libre-Open-Source-Software, die bereits im öffentlichen Dienst genutzt oder gar selbst betrieben wird. Ergänzungen aus …☆32Updated 2 years ago
- DocumentCloud's back end source code - Please report bugs, issues and feature requests to info@documentcloud.org☆44Updated last week
- Automated behaviors that run in browser to interact with complex sites automatically. Used by ArchiveWeb.page and Browsertrix Crawler.☆55Updated this week
- A social media open post web archiving tool☆26Updated 2 months ago
- Comparing warc files☆17Updated 6 years ago
- 🍨 High-fidelity, browser-based, single-page web archiving library and CLI for witnessing the web.☆187Updated 5 months ago
- Fast PDF generation and compression. Deals with millions of pages daily.☆134Updated last month
- Javascript/Node wrapper around Mozilla's Readability library so that ArchiveBox can call it as a oneshot CLI command to extract each page…☆42Updated last year
- Tool to index and serve HTML files. Powered by Datasette.☆111Updated 3 years ago
- Export your personal Spotify data: playlists, saved tracks/albums/shows, etc. as JSON☆41Updated 6 months ago
- Small utility to prepare scanned documents. Supports separating PDF files by separator pages and removing blank pages.☆32Updated last year
- Create Robust Links from within Zotero☆21Updated 3 years ago
- A simple Python wrapper and command-line interface for archive.org’s "Save Page Now" capturing service☆188Updated 2 weeks ago
- Homebrew formula for the ArchiveBox self-hosted internet archiving solution.☆28Updated last year
- Export/access your Hypothes.is data: annotations and profile info☆46Updated 6 months ago
- DIY Atom feeds in times of social media and paywalls☆85Updated last year
- etl pipeline, graphical explorer and general toolbox for investigations with follow the money data☆25Updated 6 months ago
- 🧩 Proposal to allow user scripts like "expand comments", "hide popups", "fill out this form", etc. to be reusable across pure browser en…☆19Updated 6 months ago
- [Moved to https://git.huey.xyz/lacuna-technologies/clerkent] quickly and easily search for and download case law; automatically rename do…☆26Updated last month
- This project let's you fetch your Pocket Casts statistics and put them into Airtable with about 80 lines of code.☆20Updated this week
- Chrome extension that uses Memento to indicate that a page a user is viewing on the live web has an archived copy and to give the user ac…☆58Updated 5 months ago
- ⚙️ Das Backend zu OffeneGesetze.de☆25Updated 2 years ago
- Jurisdiction ID and abbreviation data files for using with Jurism and other projects.☆40Updated 2 years ago
- Export your Github activity: events, repositories, stars, etc.☆56Updated this week
- Make your PDF files text-searchable (A GUI for OCRmyPDF)☆50Updated last year
- Datasette plugin for uploading CSV files and converting them to database tables☆27Updated 2 months ago
- Extract list of results from search engines pages as CSV with a bookmarklet directly within the browser☆29Updated this week
- Clean a series of links, resolving redirects and finding Wayback results if page is gone. Originally written to aid with importing from A…☆19Updated 2 months ago