jfilter / pdf-scripts
📑 Scripts to repair, verify, OCR, compress, wrangle, crop (etc.) PDFs
☆63Updated 8 months ago
Alternatives and similar repositories for pdf-scripts:
Users that are interested in pdf-scripts are comparing it to the libraries listed below
- etl pipeline, graphical explorer and general toolbox for investigations with follow the money data☆14Updated last year
- Export your Github activity: events, repositories, stars, etc.☆48Updated last year
- Encapsulate dom-anchor-text-quote and dom-anchor-text-position for use in browser scripts☆10Updated 3 years ago
- Easily display Zotero items on a webpage☆32Updated last year
- Fetch all your bookmarked tweets and make them accessible through a webinterface.☆29Updated last year
- Exports all accessible reddit comments for an account using pushshift☆11Updated 2 months ago
- Create Robust Links from within Zotero☆17Updated 2 years ago
- This is the HeadQuarters of my digital info. HPI library got me inspired and I'm trying to play with the idea on a smaller scale for myse…☆20Updated last year
- Named-Entity Recognition extension for OpenRefine☆26Updated 2 years ago
- ☆13Updated 3 years ago
- Swift scripts for PDF manipulation, for Shortcuts or Terminal☆12Updated 2 years ago
- Clean a series of links, resolving redirects and finding Wayback results if page is gone. Originally written to aid with importing from A…☆16Updated 3 months ago
- A plugin for importing data from Wikidata into your Obsidian vault.☆29Updated 6 months ago
- A social media open post web archiving tool☆25Updated last month
- Eine kuratierte Liste hilfreicher Informationen zu Offenen Daten☆19Updated 2 years ago
- Javascript/Node wrapper around Mozilla's Readability library so that ArchiveBox can call it as a oneshot CLI command to extract each page…☆38Updated 4 months ago
- Comparing warc files☆16Updated 5 years ago
- Abbreviations for use with the Abbreviation Filter developed for use with Multilingual Zotero.☆18Updated last year
- Adds a reconciliation API endpoint to Datasette, based on the Reconciliation Service API specification.☆23Updated 11 months ago
- Typademic turns distraction freely written markdown files into beautiful PDFs.☆22Updated 2 years ago
- Visualisation of browsing history patterns using pandas and seaborn☆10Updated 4 years ago
- 📚 Online archive for annual reports of the German internal intelligence☆11Updated 2 months ago
- Datasette plugin for uploading CSV files and converting them to database tables☆25Updated 9 months ago
- List of awesome tools / plugins built around archivy☆27Updated 3 years ago
- Automated behaviors that run in browser to interact with complex sites automatically. Used by ArchiveWeb.page and Browsertrix Crawler.☆37Updated this week
- A Python library for defining rule-based overrides on messy data☆13Updated 2 months ago
- A Python scraping module, that extracts text from articles found in RSS feeds. Uses SQLite as database.☆18Updated 6 months ago
- Export your personal Spotify data: playlists, saved tracks/albums/shows, etc. as JSON☆37Updated last year
- Homebrew formula for the ArchiveBox self-hosted internet archiving solution.☆27Updated 3 months ago