Create "perfect" snapshots of web pages
☆34Apr 10, 2026Updated 3 weeks ago
Alternatives and similar repositories for web-snap
Users that are interested in web-snap are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Spider templates for automatic crawlers.☆34Mar 26, 2026Updated last month
- Python client for Zyte API☆29Apr 23, 2026Updated last week
- Official Python package for ArchiveBox, the self-hosted internet archiving solution.☆12Oct 5, 2024Updated last year
- Backend, IA-specific tools for crawling and processing the scholarly web. Content ends up in https://fatcat.wiki☆28Jul 31, 2024Updated last year
- A fork of http://pydispatcher.sourceforge.net/ with PyPy support☆16Jul 3, 2017Updated 8 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆14Feb 11, 2023Updated 3 years ago
- A simple algorithm for clustering web pages, suitable for crawlers☆35Mar 6, 2017Updated 9 years ago
- Converts HTTrack crawls to WARC files☆34Aug 6, 2024Updated last year
- Flatten, format, and export any JSON-like data to CSV (or any other string output).☆17Sep 13, 2021Updated 4 years ago
- ██████╗ ███████╗██████╗ ██╔══██╗██╔════╝██╔══██╗ ██████╔╝█████╗ ██║ ██║ ██╔══██╗██╔══╝ ██║ ██║ ██║ ██║███████╗██████╔╝ ╚═╝ ╚═╝╚═══…☆11Feb 17, 2022Updated 4 years ago
- A dataset of popular pages (taken from <dir.yahoo.com>) with manually marked up semantic blocks.☆15Feb 9, 2014Updated 12 years ago
- 🎭 An introduction to the Internet Archiving ecosystem, tooling, and some of the ethical dilemmas that the community faces.☆57Aug 15, 2024Updated last year
- My game entry for JS13K Games 2020 on the theme "404".☆35Apr 19, 2021Updated 5 years ago
- 404Games Wastelands V2 - Chernarus☆25Jun 25, 2013Updated 12 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Verifiable Credential Extensions☆12Feb 12, 2025Updated last year
- ☆14Jun 27, 2019Updated 6 years ago
- Web archive index server based on RocksDB☆43Updated this week
- Conifer setup and deployment via Ansible☆12Jun 15, 2020Updated 5 years ago
- Python clients for Zyte AutoExtract API☆41Jan 17, 2022Updated 4 years ago
- Web scraping Page Objects core library☆105Apr 21, 2026Updated last week
- ☆19Oct 6, 2025Updated 6 months ago
- A simple 404 page that uses the pathname as input to generate a 404 message.☆13Apr 28, 2018Updated 8 years ago
- Standard implementation of TRC404☆10Jan 20, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A default backend (404 page) for nginx-ingress in Kubernetes☆13Jan 23, 2018Updated 8 years ago
- NAMM Standards☆10Dec 7, 2021Updated 4 years ago
- The Zonemaster Backend - part of the Zonemaster project☆16Dec 19, 2025Updated 4 months ago
- PlayStation GPU (WIP)☆18Oct 3, 2023Updated 2 years ago
- A C# library for loading LSD: Dream Emulator data files.☆15Aug 28, 2023Updated 2 years ago
- ☆16Sep 9, 2021Updated 4 years ago
- OmniCrawl is a web measurement tool that allows for recording of web requests and JavaScript browser API accesses on multiple platforms.☆28Mar 20, 2024Updated 2 years ago
- Repository for ru-syntax command line tool.☆16Mar 8, 2022Updated 4 years ago
- Intelligent redirector extension for 404 pages in Silverstripe☆18Apr 15, 2025Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Scripts to automate IPv6 maintenance on RouterOS, and more☆15Jan 4, 2026Updated 3 months ago
- Extract text from HTML☆135Apr 8, 2026Updated 3 weeks ago
- Advanced JavaScript virtualization engine for code protection. KrakVM combines custom bytecode compilation with deep structural obfuscati…☆35Feb 26, 2026Updated 2 months ago
- Automatically crop scans of multiple images at once.☆14Feb 23, 2019Updated 7 years ago
- vue 解决vue-router的addRoutes刷新失效的问题(匹配不到跳转404)☆11Jun 30, 2018Updated 7 years ago
- ☆15Feb 15, 2022Updated 4 years ago
- Firefox extension to find and remove expired bookmarks.☆17Aug 28, 2019Updated 6 years ago