WarcDB: Web crawl data as SQLite databases.
β405Jul 13, 2024Updated last year
Alternatives and similar repositories for WarcDB
Users that are interested in WarcDB are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ποΈ A simple CLI for converting WARC to Parquet.β113Feb 12, 2025Updated last year
- A SQLite extension for querying, manipulating, and creating HTML elements.β396Aug 6, 2023Updated 2 years ago
- π§© Proposal to allow user scripts like "expand comments", "hide popups", "fill out this form", etc. to be reusable across pure browser enβ¦β19Jul 11, 2025Updated 8 months ago
- abuse ImageMagick (or GraphicsMagick) to create arbitrary filesβ54Feb 3, 2026Updated last month
- Command line tool for digging into WARC filesβ51Updated this week
- NordVPN Threat Protection Proβ’ β’ AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- Query Excel spredsheets (.xlsx, .xls, .ods) using SQLiteβ1,304Mar 9, 2025Updated last year
- A platform for collecting, analyzing, and visualizing social media data.β13Dec 27, 2020Updated 5 years ago
- Create and edit WARC and WACZ filesβ25Dec 6, 2024Updated last year
- Web archive index server based on RocksDBβ38Mar 2, 2026Updated 3 weeks ago
- Specifications developed and maintained by the Webrecorder community.β140Oct 16, 2025Updated 5 months ago
- The ultimate set of SQLite extensionsβ4,290Feb 10, 2026Updated last month
- β60Apr 11, 2024Updated last year
- A SQLite extension for making HTTP requests purely in SQLβ261Feb 9, 2025Updated last year
- The best RSS Search experience you can findβ621Jan 19, 2023Updated 3 years ago
- Simple, predictable pricing with DigitalOcean hosting β’ AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- A Memento Aggregator CLI and Server in Goβ78Mar 4, 2025Updated last year
- Docker Compose based system for running remote browsers (including Flash and Java support) connected to web archivesβ16Jun 10, 2021Updated 4 years ago
- β17Oct 2, 2025Updated 5 months ago
- pystitcher stitches your PDF files together, generating nice customizable bookmarks for you using a declarative markdown file as inputβ397Apr 25, 2025Updated 11 months ago
- A SQLite extension that brings column-oriented tables to SQLiteβ683Mar 21, 2024Updated 2 years ago
- Python WSGI Middleware for adding HTTP/S proxy support to any WSGI Applicationβ24Oct 27, 2020Updated 5 years ago
- DigestBox takes any webpage URL (news article, video link, comment thread, etc.) and gives you just the raw content. It's powered by Archβ¦β19Feb 2, 2024Updated 2 years ago
- DataHen Till is a companion tool to your existing web scraper that instantly makes it scalable, maintainable, and more unblockable, with β¦β815Dec 5, 2021Updated 4 years ago
- Static Site Generator for Viewing Web Archives (in WACZ) formatβ30Jun 30, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI β’ AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- App to easily query, script, and visualize data from every database, file, and API.β2,960Nov 10, 2023Updated 2 years ago
- CG/SQL is a compiler that converts a SQL Stored Procedure like language into C for SQLite. SQLite has no stored procedures of its own. β¦β403May 1, 2023Updated 2 years ago
- FUSE-based file system for replicating SQLite databases across a cluster of machinesβ4,721Apr 22, 2025Updated 11 months ago
- A SQLite extension for reading large files line-by-line (NDJSON, logs, txt, etc.)β403Oct 7, 2023Updated 2 years ago
- A Rust library for reading and writing WARC filesβ59Nov 27, 2024Updated last year
- This is a simple graph database in SQLite, inspired by "SQLite as a document database"β1,505Feb 15, 2025Updated last year
- Experimental proxy and wrapper for safely embedding Web Archives (warc, warc.gz, wacz) into web pages.β42Nov 24, 2025Updated 4 months ago
- Go sqlite3 http vfs: query sqlite databases over http with range headersβ230Apr 9, 2023Updated 2 years ago
- Query SQLite files in S3 using s3fsβ513Sep 14, 2022Updated 3 years ago
- End-to-end encrypted cloud storage - Proton Drive β’ AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Add website scraping abilities to Datasetteβ66Mar 4, 2023Updated 3 years ago
- β26Oct 6, 2021Updated 4 years ago
- Convert HTTP Archive (HAR) -> Web Archive (WARC) formatβ56Oct 21, 2018Updated 7 years ago
- A Github Action for turning Markdown into ReSpec HTMLβ15Jun 6, 2024Updated last year
- Transparent dictionary-based row-level compression for SQLiteβ1,656Jun 30, 2025Updated 9 months ago
- Distributed, MVCC SQLite that runs on FoundationDB.β1,532Updated this week
- Postgres Generic Clever Scanning Data Verify/Recovery Xpressβ18Nov 23, 2019Updated 6 years ago