dkl3 / py-issuu-scrape
Issuu scraper written in Python.
☆16Updated 5 years ago
Related projects ⓘ
Alternatives and complementary repositories for py-issuu-scrape
- Scrapes and archives a Yahoo groups email archives, photo galleries and file contents using the non-public API☆93Updated 4 years ago
- Converts a Yahoo group archive created by yahoo-group-archiver into standalone email, mbox folders, and PDF files☆22Updated 3 years ago
- A simple Python script that archives all the messages from a public Yahoo Group☆58Updated 5 years ago
- NYPL Project to transcribe and parse pages from the US Catalog of Copyright Entries☆58Updated 2 years ago
- Tab-delimited versions of Catalog of Copyright Entries renewals☆29Updated 5 years ago
- A python script to backup the contents of private Yahoo! groups.☆37Updated 4 years ago
- ☆39Updated 6 months ago
- utility to construct PDF files from one or more image files☆20Updated 10 months ago
- Python tools for processing data from the Catalog of Copyright Entries☆37Updated 5 years ago
- ☆31Updated 6 months ago
- Download PDFs from academia dot edu without logging in☆54Updated 5 months ago
- Converts WARC files to static HTML☆39Updated 4 months ago
- A tool for analyzing the word histories of a text.☆34Updated 3 months ago
- Pages repo☆88Updated 2 years ago
- Raspberry Pi image for controlling a DIYBookScanner via spreads☆37Updated 9 years ago
- Command line tool to convert a file in the WARC format to a file in the ZIM format☆44Updated this week
- Recover lost websites from the Web Infrastructure☆85Updated 3 years ago
- Fast PDF generation and compression. Deals with millions of pages daily.☆100Updated 2 months ago
- A font set based on the 10th-century Exeter Book script☆29Updated 2 years ago
- A command line tool to archive a git repository from GitHub to the Internet Archive.☆90Updated 3 years ago
- Some unsupported 'wrapper' scripts for pdfjam☆46Updated 5 months ago
- An un-official user guide for the KryoFlux written by archivists, for archivists☆92Updated last year
- The Ambitious Plan to Put a Solar Panel on a Laptop by 2030☆7Updated this week
- multispectral monitoring of a sourdough starter; esp32 eink module, scd30 co2 sensor, vl6180 distance sensor☆16Updated 2 years ago
- Script to process ham radio exam questions from http://ncvec.org into Anki flash cards. Using Python and the strategy pattern.☆27Updated 9 months ago
- The Internet Archive Research Assistant - Daily search Internet Archive for new items matching your keywords☆71Updated 7 months ago
- Scan Tailor Experimental is an interactive post-processing tool for scanned pages.☆39Updated last week
- Illustrations☆25Updated last year
- Tools for working with Ancestry.com Gedcom files and all associated media items☆37Updated last year
- Small and simple command line tool to automate downloading pages from HathiTrust☆36Updated this week