ArchiveTeam / grab-siteLinks
The archivist's web crawler: WARC output, dashboard for all crawls, dynamic ignore patterns
☆1,482Updated last week
Alternatives and similar repositories for grab-site
Users that are interested in grab-site are comparing it to the libraries listed below
Sorting:
- Wget-compatible web downloader and crawler.☆584Updated last year
- Collect and revisit web pages.☆1,501Updated 4 months ago
- Core Python Web Archiving Toolkit for replay and recording of web archives☆1,509Updated 3 weeks ago
- Web Archiving Integration Layer: One-Click User Instigated Preservation☆373Updated 2 months ago
- Run a high-fidelity browser-based web archiving crawler in a single Docker container☆786Updated this week
- A Python and Command-Line Interface to Archive.org☆1,704Updated last week
- ArchiveBot, an IRC bot for archiving websites☆383Updated this week
- An archiving tool with an IM-style interface that prioritizes privacy and accessibility, integrated with various archival services includ…☆1,966Updated this week
- Serverless replay of web archives directly in the browser☆796Updated this week
- Self-Hosted Bookmark And Archive Manager☆1,815Updated last year
- The ultimate collection of scripts for YouTube-DL.☆2,422Updated 8 months ago
- Wayback Machine API interface & a command-line tool☆528Updated last year
- Chrome extension to "Create WARC files from any webpage"☆220Updated last year
- Browsertrix is the hosted, high-fidelity, browser-based crawling service from Webrecorder designed to make web archiving easier and more …☆277Updated this week
- 💾 dn - offline full-text search and archiving for your Chromium-based browser.☆3,833Updated 2 weeks ago
- The personal, minimalist, super-fast, database free, bookmarking service - community repo☆3,644Updated last month
- Indexes open directories☆1,218Updated 3 weeks ago
- Web Extension to save a faithful copy of an entire web page in a self-extracting ZIP file☆1,877Updated 8 months ago
- A curated list of awesome tools for website diffing and change monitoring.☆509Updated 2 years ago
- Download the entire Wayback Machine archive for a given URL.☆3,031Updated last month
- I consume the world via RSS feeds, and this is my attempt to keep it that way.☆796Updated last week
- Espial is an open-source, web-based bookmarking server.☆841Updated last week
- Follow blogs, wikis, YouTube channels, as well as accounts on Twitter, Instagram, etc. from a single page.☆1,797Updated last year
- Starting point for archiving entire YouTube channels using yt-dlp (originally youtube-dl)☆501Updated 2 years ago
- Webrecorder Desktop App!☆205Updated 4 years ago
- A browser extension that captures web pages to local device or backend server for future retrieval, organization, annotation, and edit. T…☆1,017Updated last week
- Client side encrypted pastebin☆1,394Updated 3 months ago
- Extremely fast tool to remove duplicates and other lint from your filesystem☆2,081Updated 3 weeks ago
- Browser extension for viewing archived and cached versions of web pages, available for Chrome, Edge and Safari☆1,323Updated 5 months ago
- Watch (parts of) webpages and get notified when something changes via e-mail, on your phone or via other means. Highly configurable.☆2,909Updated 2 months ago