csaftoiu / yahoo-groups-backupLinks
A python script to backup the contents of private Yahoo! groups.
☆37Updated 6 years ago
Alternatives and similar repositories for yahoo-groups-backup
Users that are interested in yahoo-groups-backup are comparing it to the libraries listed below
Sorting:
- Scrapes and archives a Yahoo groups email archives, photo galleries and file contents using the non-public API☆94Updated 6 years ago
- A simple Python script that archives all the messages from a public Yahoo Group☆59Updated 6 years ago
- Converts a Yahoo group archive created by yahoo-group-archiver into standalone email, mbox folders, and PDF files☆23Updated 4 years ago
- Python tools for processing data from the Catalog of Copyright Entries☆39Updated 6 years ago
- Papercut NNTP server☆48Updated 14 years ago
- Raspberry Pi image for controlling a DIYBookScanner via spreads☆37Updated 10 years ago
- Modular workflow assistant for book digitization☆131Updated 9 years ago
- simple console/terminal podcast downloader☆60Updated last year
- Grabbing all news.☆62Updated 6 years ago
- Project for parsing Usenet mbox files into local PostgreSQL DB☆18Updated 5 years ago
- Check out https://github.com/webrecorder/webrecorder for newer version matching https://webrecorder.io☆38Updated 10 years ago
- Serving content from a WARC☆62Updated 13 years ago
- Trough: Big data, small databases.☆41Updated last year
- Command line tools and libraries for handling and manipulating WARC files (and HTTP contents)☆168Updated 4 months ago
- Humane Heritage - OLD VERSION☆112Updated 5 years ago
- Official lifelines repository☆87Updated last week
- Tool and library for handling Web ARChive (WARC) files.☆165Updated last year
- Papercut NNTP server☆15Updated 9 years ago
- Recover lost websites from the Web Infrastructure☆91Updated 4 months ago
- NOTE: This project is no longer being actively developed.. Check out https://replayweb.page / https://github.com/webrecorder/replayweb.pa…☆200Updated 11 months ago
- smoothscan is a tool to convert scanned text into a vectorized output form.☆67Updated 12 years ago
- Stitches scanned image segments☆64Updated 12 years ago
- Nondestructive warc-in-tar to warc conversion☆27Updated 12 years ago
- A modern frontend to newsgroups☆22Updated 6 years ago
- A dockerized, queued high fidelity web archiver based on Squidwarc☆61Updated last year
- WarcMiddleware lets users seamlessly download a mirror copy of a website when running a web crawl with the Python web crawler Scrapy.☆47Updated 7 years ago
- Streamlined version of the tech in the Goodbye Big Five Series☆57Updated 5 years ago
- Short script for removing watermarks from PDF files. Requires pdftk.☆59Updated 6 years ago
- Gopher client and server library for Python☆46Updated 2 years ago
- A simple Python wrapper and command-line interface for archive.org’s "Save Page Now" capturing service☆188Updated last year