maxcountryman / warc-parquet
ποΈ A simple CLI for converting WARC to Parquet.
β106Updated last week
Related projects β
Alternatives and complementary repositories for warc-parquet
- Scale to zero Seafowl hosting with Cloud Runβ39Updated last year
- β109Updated 4 months ago
- Gavin Mendel-Gleason's blogβ86Updated 10 months ago
- What if an HNSW index was just a file, and you could serve it from a CDN, and search it directly in the browser?β86Updated 6 months ago
- A small language that compiles to WebAssembly Text formatβ74Updated 6 months ago
- ZSV Utility for converting json to/from zip-separated-valuesβ58Updated 5 months ago
- Block Erasure Format - An extensible, fast, and usable file utility to encode and decode interleaved erasure coded streams of data.β57Updated 6 months ago
- Testing various image matching algorithms' performance on the Pinecone vector DBβ43Updated last year
- Module Oriented Large Archive Specialized Slow Exhaustive Searcherβ113Updated 9 years ago
- Shell scripting for serverlessβ141Updated 2 years ago
- A simple performant GeoIP server written in Rust using MaxMind DBs with auto database updateβ45Updated this week
- WarcDB: Web crawl data as SQLite databases.β394Updated 4 months ago
- Beating the `bisect` module's implementation using C-extensions.β30Updated last year
- β162Updated 5 months ago
- the fastest CSV SQLite extension, written in Rustβ122Updated last year
- Retry a command with exponential backoff and jitter (+ Starlark expressions)β119Updated this week
- Quality News - Towards a fairer ranking formula for Hacker Newsβ52Updated this week
- A js library to incorporate HN comments to any websiteβ31Updated 6 months ago
- Code to accompany blog post https://reorchestrate.com/posts/sqlite-transactionsβ67Updated 4 months ago
- A Go program to split large JSON files into many jsonl filesβ60Updated last year
- Command-line tool to remotely execute code in the cloudβ134Updated 2 years ago
- β44Updated 2 years ago
- Dumfederated gRPC social network implemented in Rust/Tonic/Diesel with both Flutter and React (web+native) frontends. ππ©EZ to deploy toβ¦β63Updated last week
- A safe, stateful rules language for event streamsβ113Updated last year
- Gutenberg the π© out of it!β55Updated last month
- Compile Justfiles to portable shell scriptsβ141Updated last week
- abuse ImageMagick (or GraphicsMagick) to create arbitrary filesβ53Updated last week
- SQLite3 extension for read-only HTTP(S) database accessβ51Updated last year
- a file transfer service utilizing quicβ61Updated 2 months ago