jfilter / pdf-scriptsLinks
📑 Scripts to repair, verify, OCR, compress, wrangle, crop (etc.) PDFs
☆70Updated last year
Alternatives and similar repositories for pdf-scripts
Users that are interested in pdf-scripts are comparing it to the libraries listed below
Sorting:
- Create one timeline from various digital sources☆12Updated last year
- Comparing warc files☆17Updated 6 years ago
- ReadablePDF streamlines the effort of turning a not so great PDF into a more easily readable PDF (or of course a pretty decent PDF into a…☆33Updated 4 years ago
- Tool to index and serve HTML files. Powered by Datasette.☆109Updated 3 years ago
- Auflistung Freier/Libre-Open-Source-Software, die bereits im öffentlichen Dienst genutzt oder gar selbst betrieben wird. Ergänzungen aus …☆32Updated 2 years ago
- Fast PDF generation and compression. Deals with millions of pages daily.☆125Updated 2 months ago
- A list of things related to software, literature, and other content for 🕣 Memento☆102Updated last year
- 🍨 High-fidelity, browser-based, single-page web archiving library and CLI for witnessing the web.☆179Updated 2 months ago
- Some tools to help analyze the twitter archive☆64Updated 5 months ago
- Automated behaviors that run in browser to interact with complex sites automatically. Used by ArchiveWeb.page and Browsertrix Crawler.☆52Updated this week
- A social media open post web archiving tool☆27Updated 2 weeks ago
- Javascript/Node wrapper around Mozilla's Readability library so that ArchiveBox can call it as a oneshot CLI command to extract each page…☆40Updated last year
- Fine-tuning the use of youtube-dl / yt-dlp for audio and videophiles☆60Updated last year
- Export your personal Spotify data: playlists, saved tracks/albums/shows, etc. as JSON☆39Updated 3 months ago
- The ArchiveWeb.page Site☆30Updated 2 weeks ago
- A collection of curated home built packages for the cross-platform text expander Espanso☆43Updated 4 months ago
- A post-processing tool for scanned sheets of paper.☆85Updated last year
- A wrapper for tesseract / abbyyOCR11 ocr4linux finereader cli that can perform batch operations or monitor a directory and launch an OCR …☆66Updated last year
- 📚 A compilation of research relevant to Data Together's efforts tackling the general problem of data resilience & interactivity☆98Updated 7 years ago
- Make graphs you can play with... Web app in Flask and Bootstrap to fetch Zotero datasets and then create graph visualizations with d3.js☆22Updated 7 years ago
- Custom AppleScript libraries providing a variety of utilities☆16Updated 2 years ago
- Automatic markdown backlinks. Designed with Zettlr in mind.☆36Updated 3 years ago
- A library/CLI tool to parse data out of your Google Takeout (History, Activity, Youtube, Locations, etc...)☆114Updated 2 months ago
- Export/access your Hypothes.is data: annotations and profile info☆45Updated 4 months ago
- Chrome extension that uses Memento to indicate that a page a user is viewing on the live web has an archived copy and to give the user ac…☆55Updated 2 months ago
- Search google scholar and only return the papers published on high h-index journals☆17Updated 3 years ago
- Extract networks of entities from journalistic reporting☆48Updated 2 years ago
- Synchronize your Mastodon bookmarks to bookmarking services.☆13Updated last month
- Homebrew formula for the ArchiveBox self-hosted internet archiving solution.☆28Updated last year
- Easily parse location .json files provided by the Google Takeout service☆37Updated 8 months ago