Create "perfect" snapshots of web pages
☆34Mar 12, 2026Updated last week
Alternatives and similar repositories for web-snap
Users that are interested in web-snap are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Spider templates for automatic crawlers.☆34Jan 8, 2026Updated 2 months ago
- Python client for Zyte API☆29Feb 10, 2026Updated last month
- Official Python package for ArchiveBox, the self-hosted internet archiving solution.☆12Oct 5, 2024Updated last year
- Backend, IA-specific tools for crawling and processing the scholarly web. Content ends up in https://fatcat.wiki☆28Jul 31, 2024Updated last year
- Homebrew formula for the ArchiveBox self-hosted internet archiving solution.☆28Oct 5, 2024Updated last year
- A fork of http://pydispatcher.sourceforge.net/ with PyPy support☆16Jul 3, 2017Updated 8 years ago
- ☆14Feb 11, 2023Updated 3 years ago
- A simple algorithm for clustering web pages, suitable for crawlers☆35Mar 6, 2017Updated 9 years ago
- Flatten, format, and export any JSON-like data to CSV (or any other string output).☆17Sep 13, 2021Updated 4 years ago
- HTML5 audio/video clipper☆13Mar 7, 2018Updated 8 years ago
- 🗿Stones: Persistent key-value containers, compatible with Python dict☆17Jul 15, 2024Updated last year
- Converts HTTrack crawls to WARC files☆34Aug 6, 2024Updated last year
- A puppeteer-extra plugin to solve Amazon captchas using Tessaract.JS.☆15May 16, 2024Updated last year
- ☆10Apr 22, 2024Updated last year
- A dataset of popular pages (taken from <dir.yahoo.com>) with manually marked up semantic blocks.☆15Feb 9, 2014Updated 12 years ago
- ☆10Mar 10, 2026Updated last week
- Podclips is an iOS app that allows users to cut out and share clips from their favourite podcasts☆15Mar 25, 2018Updated 7 years ago
- 🗄 Save an archived copy of websites from Pocket/Pinboard/Bookmarks/RSS. Outputs HTML, PDFs, and more...☆38Aug 12, 2018Updated 7 years ago
- 404Games Wastelands V2 - Chernarus☆25Jun 25, 2013Updated 12 years ago
- ☆14Jun 27, 2019Updated 6 years ago
- Web archive index server based on RocksDB☆38Mar 2, 2026Updated 3 weeks ago
- An ultra lightweight web screenshot tool with advanced DOM analysis features.☆41Dec 2, 2025Updated 3 months ago
- Web scraping Page Objects core library☆104Mar 10, 2026Updated last week
- A simple 404 page that uses the pathname as input to generate a 404 message.☆13Apr 28, 2018Updated 7 years ago
- Scrapy exporter for Big Data formats☆16Mar 10, 2026Updated last week
- Material for my React Fundamentals Workshop☆17Dec 27, 2022Updated 3 years ago
- Standard implementation of TRC404☆10Jan 20, 2025Updated last year
- Les réflexions menées au cours du 404CTF 2023 pour résoudre les challenges proposés☆10Dec 16, 2023Updated 2 years ago
- A Simple C++ based CSSParser☆18Jan 19, 2021Updated 5 years ago
- ☆23Dec 12, 2025Updated 3 months ago
- A polyfill for the WebCodecs API for use in server-side JavaScript environments such as Node, Deno, and Bun.☆53Dec 5, 2025Updated 3 months ago
- Repository for ru-syntax command line tool.☆16Mar 8, 2022Updated 4 years ago
- u-boot addon image for the AVM FritzBox 4040☆11Mar 8, 2026Updated 2 weeks ago
- Vite bundled dev, RSC on tanstack router, Vue server component, etc...☆20Oct 28, 2025Updated 4 months ago
- Fixed Point Math in C++ for Playstation 1☆12Aug 21, 2023Updated 2 years ago
- ☆15Feb 15, 2022Updated 4 years ago
- ☆22Feb 13, 2026Updated last month
- Gopher Signal uses smart technology to quickly summarize important points from HackerNews.com articles. https://gophersignal.com☆26Mar 9, 2026Updated 2 weeks ago
- ☆16Jan 2, 2016Updated 10 years ago