palewire / archiveisLinks
A simple Python wrapper for the archive.is capturing service
☆206Updated 9 months ago
Alternatives and similar repositories for archiveis
Users that are interested in archiveis are comparing it to the libraries listed below
Sorting:
- A Tool To Push Web Resources Into Web Archives☆424Updated last year
- WARC writing MITM HTTP/S proxy☆429Updated last month
- Chrome extension to "Create WARC files from any webpage"☆224Updated last year
- Web Archiving Integration Layer: One-Click User Instigated Preservation☆384Updated 8 months ago
- Grabbing all news.☆62Updated 5 years ago
- brozzler - distributed browser-based web crawler☆760Updated this week
- A basic tool for pushing a web page to multiple archiving services at once.☆216Updated last year
- Parse OPML subscription lists in Python☆84Updated this week
- Indelible links☆489Updated this week
- Squidwarc is a high fidelity, user scriptable, archival crawler that uses Chrome or Chromium with or without a head☆172Updated 5 years ago
- Wget-compatible web downloader and crawler.☆596Updated last year
- Python client for Firefox Send☆120Updated 2 years ago
- Estimating the age of web resources☆96Updated 5 months ago
- An OPML file with 22 of the top 25 US newspapers RSS feeds☆56Updated 7 years ago
- Recover lost websites from the Web Infrastructure☆89Updated 3 months ago
- Scraping assistant tool. Editing and maintaining CSS/XPath selectors across webpages.☆122Updated 7 years ago
- DIY Atom feeds in times of social media and paywalls☆85Updated last year
- Tool for real-time scraping of news articles.☆39Updated 6 years ago
- Save data from Twitter to a SQLite database☆412Updated 2 years ago
- Hacker News ranked by Comment/Score ratio☆28Updated last week
- Serving content from a WARC☆62Updated 12 years ago
- Export your (or other people's) Goodreads data to SQLite☆84Updated 5 years ago
- We back up a lot of stuff from around the web; now it's time to back up the Internet Archive, just in case.☆92Updated 5 years ago
- An archival copy.☆81Updated 12 years ago
- Convert URL or RSS feed to text with readability☆51Updated 5 years ago
- The simple way of using Imgur.☆126Updated 7 months ago
- The subreddit archiver☆177Updated 2 years ago
- Tool for keeping a hypermedia encyclopedia☆57Updated 2 years ago
- A light version of Tor portable to the browser☆120Updated 5 years ago
- WarcMiddleware lets users seamlessly download a mirror copy of a website when running a web crawl with the Python web crawler Scrapy.☆47Updated 7 years ago