zytedata / web-snap
Create "perfect" snapshots of web pages
☆30Updated last month
Related projects ⓘ
Alternatives and complementary repositories for web-snap
- DigestBox takes any webpage URL (news article, video link, comment thread, etc.) and gives you just the raw content. It's powered by Arch…☆15Updated 9 months ago
- 🛡️📧 Protect e-mails against spam and scraping bots☆29Updated last year
- Local SMTP desktop app for debugging and previewing your emails☆14Updated 9 months ago
- Coldbrew is Python compiled into JavaScript using Emscripten.☆30Updated last year
- Awesome list dedicated to digital and data preservation tools, sources, services and so on.☆20Updated 2 years ago
- ☆14Updated last month
- ArchiveBoxMatic: configure ArchiveBox with the simplicity of a yaml file.☆13Updated 3 years ago
- A debian:buster-slim full-text-rss Docker Container☆13Updated 3 years ago
- Official Python package for ArchiveBox, the self-hosted internet archiving solution.☆13Updated last month
- 🎭 An introduction to the Internet Archiving ecosystem, tooling, and some of the ethical dilemmas that the community faces.☆50Updated 3 months ago
- This is the HeadQuarters of my digital info. HPI library got me inspired and I'm trying to play with the idea on a smaller scale for myse…☆19Updated last year
- Track changes to GraphQL APIs by git scraping their schemas☆23Updated 2 weeks ago
- A framework for quick web archiving; canonical repository: https://gitea.arpa.li/JustAnotherArchivist/qwarc☆27Updated 3 years ago
- A code editing & sharing utility☆12Updated 11 months ago
- A micro-framework for asynchronous deep crawls and web scraping with Python☆13Updated last year
- https://mimesniff.spec.whatwg.org/ implementation for Python☆14Updated 10 months ago
- Encode/decode binary data over a live streaming video in real time.☆13Updated last year
- Passively capture, archive, and hoard your web browsing history, including the contents of the pages you visit, for later offline viewing…☆41Updated this week
- ☆16Updated this week
- Docker Compose based system for running remote browsers (including Flash and Java support) connected to web archives☆13Updated 3 years ago
- RFSH: Run shell scripts in batch, concurrently, fully customized with variable .☆23Updated 11 months ago
- NPM package and CLI tool for saving web page as single HTML file☆43Updated this week
- A server code for serving BERT-based models for text classification. It is designed by SerpApi for heavy-load prototyping and production …☆13Updated 7 months ago
- Clean a series of links, resolving redirects and finding Wayback results if page is gone. Originally written to aid with importing from A…☆15Updated last month
- YaBSON is a library allowing schemaless binary-encoded parsing/serialization of JavaScript data with a generator-based implementation☆13Updated last year
- Use markdown as document (by casual-markdown parser)☆13Updated last year
- A webpage bookmarking and snapshotting service☆27Updated this week
- Add your configs for tmux☆12Updated 2 years ago
- Command line tool to convert a file in the WARC format to a file in the ZIM format☆45Updated last week
- Export/access your Rescuetime data☆11Updated last year