dkl3 / py-issuu-scrapeLinks
Issuu scraper written in Python.
☆16Updated 5 years ago
Alternatives and similar repositories for py-issuu-scrape
Users that are interested in py-issuu-scrape are comparing it to the libraries listed below
Sorting:
- Scrapes and archives a Yahoo groups email archives, photo galleries and file contents using the non-public API☆94Updated 5 years ago
- ☆45Updated last year
- An online citation generator for Wikipedia☆31Updated 2 months ago
- Converts WARC files to static HTML☆44Updated 11 months ago
- A collection of tools for researchers using Newspapers.com, including configurable automatic citation generation in five different format…☆11Updated 11 months ago
- Wikipedia userscript that helps assess pages for WikiProjects☆12Updated 7 months ago
- NYPL Project to transcribe and parse pages from the US Catalog of Copyright Entries☆58Updated 2 years ago
- Converts a Yahoo group archive created by yahoo-group-archiver into standalone email, mbox folders, and PDF files☆22Updated 3 years ago
- Illustrations☆26Updated last year
- A simple Python script that archives all the messages from a public Yahoo Group☆59Updated 5 years ago
- Fast PDF generation and compression. Deals with millions of pages daily.☆115Updated 9 months ago
- Pages repo☆88Updated 3 years ago
- A python script to backup the contents of private Yahoo! groups.☆37Updated 5 years ago
- GNU/Linux Overdrive/EMusic Client☆30Updated last year
- Photobucket image and album extractor. Improved version of PB_Shovel by Daxda, to scrape urls.☆11Updated 6 years ago
- Tab-delimited versions of Catalog of Copyright Entries renewals☆28Updated 6 years ago
- Docbook edition and translations of the book Free Culture by Lawrence Lessig☆22Updated 4 years ago
- Python tools for processing data from the Catalog of Copyright Entries☆37Updated 5 years ago
- Downloads an entire Internet Archive collection☆32Updated 6 years ago
- A Collection of Christmas Carols☆36Updated this week
- Raspberry Pi image for controlling a DIYBookScanner via spreads☆37Updated 9 years ago
- A design system for interactive fiction based on natural language.☆107Updated last month
- Python API for neocities.org☆75Updated 9 months ago
- Modular workflow assistant for book digitization☆18Updated 8 years ago
- A command line tool to archive a git repository from GitHub to the Internet Archive.☆94Updated 4 years ago
- A web tool to convert Wiki tables to CSV 📈☆181Updated 5 months ago
- Nondestructive warc-in-tar to warc conversion☆26Updated 12 years ago
- Tool and library for handling Web ARChive (WARC) files.☆159Updated 7 months ago
- A library for generating slide rules using Haskell and `diagrams`.☆63Updated last year
- NaNoGenMo☆36Updated 8 months ago