Florents-Tselai / WarcDB
WarcDB: Web crawl data as SQLite databases.
☆398Updated 7 months ago
Alternatives and similar repositories for WarcDB:
Users that are interested in WarcDB are comparing it to the libraries listed below
- A SQLite extension for querying, manipulating, and creating HTML elements.☆381Updated last year
- A SQLite extension which loads a Google Sheet as a virtual table.☆513Updated 2 years ago
- A SQLite extension for reading large files line-by-line (NDJSON, logs, txt, etc.)☆396Updated last year
- A self-hosted live video streaming platform with Discord authentication, auto-recording and more!☆349Updated 7 months ago
- Query SQLite files in S3 using s3fs☆499Updated 2 years ago
- Python module to parse ingredient names. Splitting them into the ingredient, unit and quantity. It is trained on a publicly available dat…☆151Updated last year
- Create a SQLite database containing metadata from Google Drive☆156Updated 2 years ago
- Dirty Little SQL Notebook☆112Updated 2 years ago
- ☆114Updated 3 years ago
- 🗄️ A simple CLI for converting WARC to Parquet.☆108Updated 2 weeks ago
- DataHen Till is a companion tool to your existing web scraper that instantly makes it scalable, maintainable, and more unblockable, with …☆814Updated 3 years ago
- Minimalist Error collection Service compatible with Rollbar clients. Sentry or Rollbar alternative.☆386Updated 3 years ago
- Distributed Embeddable Database☆255Updated 2 years ago
- Module Oriented Large Archive Specialized Slow Exhaustive Searcher☆113Updated 9 years ago
- A SQLite extension for making HTTP requests purely in SQL☆242Updated 2 weeks ago
- Scrapy rotation proxy package with advanced functions☆94Updated 2 years ago
- Verneuil is a VFS extension for SQLite that asynchronously replicates databases to S3-compatible blob stores.☆473Updated 4 months ago
- Geocode rows in a SQLite database table☆234Updated 2 years ago
- A collaborative UML editor; build with etherpad and plantuml☆170Updated 9 months ago
- A generator for OpenAPI 3☆97Updated 4 years ago
- α-Indirect Control in Onion-like Networks☆149Updated last year
- Easy log aggregation, indexing and searching☆169Updated 4 months ago
- The various scripts I use to back up my home computers using ssh and rsync☆199Updated 3 years ago
- Dockerized local and offline backing up of PostgresQL with rotation and compression.☆210Updated last year
- Repository for Pipes☆269Updated 6 months ago
- Command-line tool to remotely execute code in the cloud☆134Updated 2 years ago
- Shell scripting for serverless☆141Updated 2 years ago
- Minimalist log collector☆114Updated last month
- Query sqlite via json+http☆520Updated last week
- shell-based query tool☆164Updated last year