WarcDB: Web crawl data as SQLite databases.
β404Jul 13, 2024Updated last year
Alternatives and similar repositories for WarcDB
Users that are interested in WarcDB are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ποΈ A simple CLI for converting WARC to Parquet.β115Feb 12, 2025Updated last year
- A SQLite extension for querying, manipulating, and creating HTML elements.β397Aug 6, 2023Updated 2 years ago
- π§© Proposal to allow user scripts like "expand comments", "hide popups", "fill out this form", etc. to be reusable across pure browser enβ¦β19Jul 11, 2025Updated 9 months ago
- abuse ImageMagick (or GraphicsMagick) to create arbitrary filesβ54Feb 3, 2026Updated 3 months ago
- Query Excel spredsheets (.xlsx, .xls, .ods) using SQLiteβ1,303Mar 9, 2025Updated last year
- AI Agents on DigitalOcean Gradient AI Platform β’ AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Create and edit WARC and WACZ filesβ25Dec 6, 2024Updated last year
- Web archive index server based on RocksDBβ43May 1, 2026Updated last week
- Specifications developed and maintained by the Webrecorder community.β139Oct 16, 2025Updated 6 months ago
- The ultimate set of SQLite extensionsβ4,316Mar 30, 2026Updated last month
- β58Apr 11, 2024Updated 2 years ago
- A SQLite extension for making HTTP requests purely in SQLβ262Feb 9, 2025Updated last year
- The best RSS Search experience you can findβ618Jan 19, 2023Updated 3 years ago
- A Memento Aggregator CLI and Server in Goβ78Apr 9, 2026Updated last month
- Docker Compose based system for running remote browsers (including Flash and Java support) connected to web archivesβ16Jun 10, 2021Updated 4 years ago
- Virtual machines for every use case on DigitalOcean β’ AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- β16Apr 16, 2026Updated 3 weeks ago
- pystitcher stitches your PDF files together, generating nice customizable bookmarks for you using a declarative markdown file as inputβ397Apr 25, 2025Updated last year
- A SQLite extension that brings column-oriented tables to SQLiteβ682Mar 21, 2024Updated 2 years ago
- DigestBox takes any webpage URL (news article, video link, comment thread, etc.) and gives you just the raw content. It's powered by Archβ¦β20Feb 2, 2024Updated 2 years ago
- DataHen Till is a companion tool to your existing web scraper that instantly makes it scalable, maintainable, and more unblockable, with β¦β815Dec 5, 2021Updated 4 years ago
- Static Site Generator for Viewing Web Archives (in WACZ) formatβ28Jun 30, 2023Updated 2 years ago
- App to easily query, script, and visualize data from every database, file, and API.β2,960Nov 10, 2023Updated 2 years ago
- CG/SQL is a compiler that converts a SQL Stored Procedure like language into C for SQLite. SQLite has no stored procedures of its own. β¦β401May 1, 2023Updated 3 years ago
- FUSE-based file system for replicating SQLite databases across a cluster of machinesβ4,760Apr 23, 2026Updated 2 weeks ago
- Wordpress hosting with auto-scaling - Free Trial Offer β’ AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- A SQLite extension for reading large files line-by-line (NDJSON, logs, txt, etc.)β403Oct 7, 2023Updated 2 years ago
- This is a simple graph database in SQLite, inspired by "SQLite as a document database"β1,517Feb 15, 2025Updated last year
- Experimental proxy and wrapper for safely embedding Web Archives (warc, warc.gz, wacz) into web pages.β43Nov 24, 2025Updated 5 months ago
- Go sqlite3 http vfs: query sqlite databases over http with range headersβ233Apr 9, 2023Updated 3 years ago
- Query SQLite files in S3 using s3fsβ513Sep 14, 2022Updated 3 years ago
- Add website scraping abilities to Datasetteβ66Mar 4, 2023Updated 3 years ago
- β26Oct 6, 2021Updated 4 years ago
- Convert HTTP Archive (HAR) -> Web Archive (WARC) formatβ55Oct 21, 2018Updated 7 years ago
- Replicate postgres to SQLite on the edgeβ1,036Jun 17, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A Github Action for turning Markdown into ReSpec HTMLβ16Jun 6, 2024Updated last year
- Python CLI utility and library for manipulating SQLite databasesβ2,047Jan 21, 2026Updated 3 months ago
- Transparent dictionary-based row-level compression for SQLiteβ1,668Jun 30, 2025Updated 10 months ago
- Distributed, MVCC SQLite that runs on FoundationDB.β1,540May 3, 2026Updated last week
- Postgres Generic Clever Scanning Data Verify/Recovery Xpressβ18Nov 23, 2019Updated 6 years ago
- Webrecorder Automated In-Page Behavior Frameworkβ13Apr 21, 2021Updated 5 years ago
- Use SQL to instantly query spreadsheets, sheets, and cell data from Google Sheets. Open source CLI. No DB required.β31Apr 10, 2026Updated 3 weeks ago