Florents-Tselai / WarcDB
WarcDB: Web crawl data as SQLite databases.
☆398Updated 10 months ago
Alternatives and similar repositories for WarcDB
Users that are interested in WarcDB are comparing it to the libraries listed below
Sorting:
- A SQLite extension for reading large files line-by-line (NDJSON, logs, txt, etc.)☆397Updated last year
- A SQLite extension which loads a Google Sheet as a virtual table.☆518Updated 2 years ago
- Query SQLite files in S3 using s3fs☆504Updated 2 years ago
- A SQLite extension for querying, manipulating, and creating HTML elements.☆384Updated last year
- A self hosted recommendation feed generated from your browsing habits☆313Updated 2 years ago
- Python module to parse ingredient names. Splitting them into the ingredient, unit and quantity. It is trained on a publicly available dat…☆153Updated last year
- The best RSS Search experience you can find☆627Updated 2 years ago
- Functional UUIDs for Python.☆148Updated 4 years ago
- Repository for Pipes☆273Updated 9 months ago
- Scrapy rotation proxy package with advanced functions☆95Updated 2 years ago
- Create a SQLite database containing metadata from Google Drive☆159Updated 2 months ago
- 🗄️ A simple CLI for converting WARC to Parquet.☆110Updated 3 months ago
- Minimalist log collector☆115Updated 3 months ago
- Geocode rows in a SQLite database table☆236Updated 2 years ago
- Template repository for setting up shot-scraper☆256Updated 2 months ago
- A self-hosted live video streaming platform with Discord authentication, auto-recording and more!☆349Updated 10 months ago
- Rsync-based time machine for Linux, written in Python, for local and remote backups.☆169Updated last year
- a simple website for sharing table data - with an API☆392Updated last month
- API for extracting a table from an image or a PDF☆91Updated 8 months ago
- Distributed Embeddable Database☆255Updated 2 years ago
- ☆114Updated 4 years ago
- A generator for OpenAPI 3☆97Updated 4 years ago
- The various scripts I use to back up my home computers using ssh and rsync☆199Updated 3 years ago
- Minimalist Error collection Service compatible with Rollbar clients. Sentry or Rollbar alternative.☆387Updated 3 years ago
- µServer in Bash☆154Updated 2 years ago
- Verneuil is a VFS extension for SQLite that asynchronously replicates databases to S3-compatible blob stores.☆494Updated 7 months ago
- Command line parser for common log format.☆142Updated 10 months ago
- Simple Python Calculation Engine☆134Updated 3 years ago
- shell-based query tool☆164Updated last year
- Shell scripting for serverless☆140Updated 2 years ago