maxcountryman / warc-parquet
ποΈ A simple CLI for converting WARC to Parquet.
β109Updated last month
Alternatives and similar repositories for warc-parquet:
Users that are interested in warc-parquet are comparing it to the libraries listed below
- β108Updated 8 months ago
- Scale to zero Seafowl hosting with Cloud Runβ38Updated last year
- ZSV Utility for converting json to/from zip-separated-valuesβ56Updated 9 months ago
- Multi-model transactional embedded databaseβ68Updated 3 months ago
- A safe, stateful rules language for event streamsβ114Updated last year
- What if an HNSW index was just a file, and you could serve it from a CDN, and search it directly in the browser?β96Updated 10 months ago
- SQL transformation tool for DuckDB written in Rustβ45Updated 2 weeks ago
- Shell scripting for serverlessβ141Updated 2 years ago
- Testing various image matching algorithms' performance on the Pinecone vector DBβ43Updated last year
- the fastest CSV SQLite extension, written in Rustβ132Updated last month
- Fast similarity search using DuckDBβ127Updated 5 months ago
- Reverse Geocode for OpenStreetmapβ122Updated 6 months ago
- Code to accompany blog post https://reorchestrate.com/posts/sqlite-transactionsβ66Updated 8 months ago
- CLI tool to convert a natural language date/time string to UTCβ237Updated 11 months ago
- Foundation DB Query Languageβ141Updated last month
- abuse ImageMagick (or GraphicsMagick) to create arbitrary filesβ53Updated this week
- WarcDB: Web crawl data as SQLite databases.β398Updated 8 months ago
- Steampipe SQLite is a zero-ETL engine for SQLite. Virtual tables translate queries into live API calls for cloud services and APIs. Hundrβ¦β55Updated last month
- A js library to incorporate HN comments to any websiteβ32Updated 10 months ago
- ayb makes it easy to create databases, share them with collaborators, and query them from a web application or the command lineβ72Updated this week
- Module Oriented Large Archive Specialized Slow Exhaustive Searcherβ113Updated 9 years ago
- Text Synchronization libraries over Braid-HTTPβ61Updated last month
- A small language that compiles to WebAssembly Text formatβ74Updated 11 months ago
- Documentation and demonstration of how to build WASM versions of SQLite with extensions embeddedβ26Updated 4 months ago
- a file transfer service utilizing quicβ65Updated 3 months ago
- Beating the `bisect` module's implementation using C-extensions.β30Updated last year
- SQLite3 extension for read-only HTTP(S) database accessβ52Updated last year
- Finds the school district associated with a given street address in the United Statesβ48Updated 7 months ago
- β163Updated 10 months ago
- Minimalist log collectorβ114Updated 2 months ago