WarcDB: Web crawl data as SQLite databases.
β404Jul 13, 2024Updated last year
Alternatives and similar repositories for WarcDB
Users that are interested in WarcDB are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ποΈ A simple CLI for converting WARC to Parquet.β114Feb 12, 2025Updated last year
- A SQLite extension for querying, manipulating, and creating HTML elements.β397Aug 6, 2023Updated 2 years ago
- π§© Proposal to allow user scripts like "expand comments", "hide popups", "fill out this form", etc. to be reusable across pure browser enβ¦β19Jul 11, 2025Updated 9 months ago
- abuse ImageMagick (or GraphicsMagick) to create arbitrary filesβ54Feb 3, 2026Updated 2 months ago
- Command line tool for digging into WARC filesβ51Updated this week
- Wordpress hosting with auto-scaling - Free Trial β’ AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Query Excel spredsheets (.xlsx, .xls, .ods) using SQLiteβ1,303Mar 9, 2025Updated last year
- A platform for collecting, analyzing, and visualizing social media data.β13Dec 27, 2020Updated 5 years ago
- Create and edit WARC and WACZ filesβ25Dec 6, 2024Updated last year
- Web archive index server based on RocksDBβ41Apr 1, 2026Updated 2 weeks ago
- Specifications developed and maintained by the Webrecorder community.β141Oct 16, 2025Updated 6 months ago
- The ultimate set of SQLite extensionsβ4,307Mar 30, 2026Updated 2 weeks ago
- β60Apr 11, 2024Updated 2 years ago
- A SQLite extension for making HTTP requests purely in SQLβ262Feb 9, 2025Updated last year
- The best RSS Search experience you can findβ618Jan 19, 2023Updated 3 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits β’ AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- A Memento Aggregator CLI and Server in Goβ78Apr 9, 2026Updated last week
- Docker Compose based system for running remote browsers (including Flash and Java support) connected to web archivesβ16Jun 10, 2021Updated 4 years ago
- β17Oct 2, 2025Updated 6 months ago
- pystitcher stitches your PDF files together, generating nice customizable bookmarks for you using a declarative markdown file as inputβ397Apr 25, 2025Updated 11 months ago
- A SQLite extension that brings column-oriented tables to SQLiteβ683Mar 21, 2024Updated 2 years ago
- Python WSGI Middleware for adding HTTP/S proxy support to any WSGI Applicationβ24Oct 27, 2020Updated 5 years ago
- DigestBox takes any webpage URL (news article, video link, comment thread, etc.) and gives you just the raw content. It's powered by Archβ¦β19Feb 2, 2024Updated 2 years ago
- DataHen Till is a companion tool to your existing web scraper that instantly makes it scalable, maintainable, and more unblockable, with β¦β814Dec 5, 2021Updated 4 years ago
- Static Site Generator for Viewing Web Archives (in WACZ) formatβ30Jun 30, 2023Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean β’ AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- App to easily query, script, and visualize data from every database, file, and API.β2,961Nov 10, 2023Updated 2 years ago
- CG/SQL is a compiler that converts a SQL Stored Procedure like language into C for SQLite. SQLite has no stored procedures of its own. β¦β403May 1, 2023Updated 2 years ago
- A SQLite extension for reading large files line-by-line (NDJSON, logs, txt, etc.)β403Oct 7, 2023Updated 2 years ago
- A Rust library for reading and writing WARC filesβ59Nov 27, 2024Updated last year
- This is a simple graph database in SQLite, inspired by "SQLite as a document database"β1,512Feb 15, 2025Updated last year
- Experimental proxy and wrapper for safely embedding Web Archives (warc, warc.gz, wacz) into web pages.β42Nov 24, 2025Updated 4 months ago
- Go sqlite3 http vfs: query sqlite databases over http with range headersβ233Apr 9, 2023Updated 3 years ago
- Query SQLite files in S3 using s3fsβ513Sep 14, 2022Updated 3 years ago
- Add website scraping abilities to Datasetteβ66Mar 4, 2023Updated 3 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits β’ AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Convert HTTP Archive (HAR) -> Web Archive (WARC) formatβ56Oct 21, 2018Updated 7 years ago
- Replicate postgres to SQLite on the edgeβ1,036Jun 17, 2024Updated last year
- A Github Action for turning Markdown into ReSpec HTMLβ15Jun 6, 2024Updated last year
- Python CLI utility and library for manipulating SQLite databasesβ2,035Jan 21, 2026Updated 2 months ago
- Transparent dictionary-based row-level compression for SQLiteβ1,663Jun 30, 2025Updated 9 months ago
- Distributed, MVCC SQLite that runs on FoundationDB.β1,537Updated this week
- Webrecorder Automated In-Page Behavior Frameworkβ13Apr 21, 2021Updated 4 years ago