WarcDB: Web crawl data as SQLite databases.
β405Jul 13, 2024Updated last year
Alternatives and similar repositories for WarcDB
Users that are interested in WarcDB are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ποΈ A simple CLI for converting WARC to Parquet.β116Feb 12, 2025Updated last year
- A SQLite extension for querying, manipulating, and creating HTML elements.β397Aug 6, 2023Updated 2 years ago
- π§© Proposal to allow user scripts like "expand comments", "hide popups", "fill out this form", etc. to be reusable across pure browser enβ¦β19Jul 11, 2025Updated 10 months ago
- abuse ImageMagick (or GraphicsMagick) to create arbitrary filesβ54Feb 3, 2026Updated 3 months ago
- Command line tool for digging into WARC filesβ49May 23, 2026Updated last week
- GPUs on demand by Runpod - Special Offer Available β’ AdRun AI, ML, and HPC workloads on powerful cloud GPUsβwithout limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Query Excel spredsheets (.xlsx, .xls, .ods) using SQLiteβ1,303Mar 9, 2025Updated last year
- A platform for collecting, analyzing, and visualizing social media data.β13Dec 27, 2020Updated 5 years ago
- Create and edit WARC and WACZ filesβ27Dec 6, 2024Updated last year
- Web archive index server based on RocksDBβ43May 1, 2026Updated 3 weeks ago
- Specifications developed and maintained by the Webrecorder community.β139Oct 16, 2025Updated 7 months ago
- The ultimate set of SQLite extensionsβ4,330Mar 30, 2026Updated last month
- β59Apr 11, 2024Updated 2 years ago
- A SQLite extension for making HTTP requests purely in SQLβ262Feb 9, 2025Updated last year
- The best RSS Search experience you can findβ618Jan 19, 2023Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient β’ AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- A Memento Aggregator CLI and Server in Goβ79Apr 9, 2026Updated last month
- Docker Compose based system for running remote browsers (including Flash and Java support) connected to web archivesβ16Jun 10, 2021Updated 4 years ago
- β16Apr 16, 2026Updated last month
- pystitcher stitches your PDF files together, generating nice customizable bookmarks for you using a declarative markdown file as inputβ397Apr 25, 2025Updated last year
- A SQLite extension that brings column-oriented tables to SQLiteβ683Mar 21, 2024Updated 2 years ago
- DigestBox takes any webpage URL (news article, video link, comment thread, etc.) and gives you just the raw content. It's powered by Archβ¦β20Feb 2, 2024Updated 2 years ago
- DataHen Till is a companion tool to your existing web scraper that instantly makes it scalable, maintainable, and more unblockable, with β¦β815Dec 5, 2021Updated 4 years ago
- Static Site Generator for Viewing Web Archives (in WACZ) formatβ28Jun 30, 2023Updated 2 years ago
- App to easily query, script, and visualize data from every database, file, and API.β2,960Nov 10, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer β’ AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- CG/SQL is a compiler that converts a SQL Stored Procedure like language into C for SQLite. SQLite has no stored procedures of its own. β¦β401May 1, 2023Updated 3 years ago
- FUSE-based file system for replicating SQLite databases across a cluster of machinesβ4,775May 11, 2026Updated 2 weeks ago
- A SQLite extension for reading large files line-by-line (NDJSON, logs, txt, etc.)β402Oct 7, 2023Updated 2 years ago
- A Rust library for reading and writing WARC filesβ59Nov 27, 2024Updated last year
- This is a simple graph database in SQLite, inspired by "SQLite as a document database"β1,520Feb 15, 2025Updated last year
- Experimental proxy and wrapper for safely embedding Web Archives (warc, warc.gz, wacz) into web pages.β43Nov 24, 2025Updated 6 months ago
- Go sqlite3 http vfs: query sqlite databases over http with range headersβ234Apr 9, 2023Updated 3 years ago
- Query SQLite files in S3 using s3fsβ512Sep 14, 2022Updated 3 years ago
- Add website scraping abilities to Datasetteβ66Mar 4, 2023Updated 3 years ago
- End-to-end encrypted cloud storage - Proton Drive β’ AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- β26Oct 6, 2021Updated 4 years ago
- Convert HTTP Archive (HAR) -> Web Archive (WARC) formatβ55Oct 21, 2018Updated 7 years ago
- Replicate postgres to SQLite on the edgeβ1,038Jun 17, 2024Updated last year
- A Github Action for turning Markdown into ReSpec HTMLβ16Jun 6, 2024Updated last year
- Python CLI utility and library for manipulating SQLite databasesβ2,057May 17, 2026Updated last week
- Transparent dictionary-based row-level compression for SQLiteβ1,668Jun 30, 2025Updated 11 months ago
- Distributed, MVCC SQLite that runs on FoundationDB.β1,544May 18, 2026Updated last week