maxcountryman / warc-parquet
🗄️ A simple CLI for converting WARC to Parquet.
☆110Updated 3 months ago
Alternatives and similar repositories for warc-parquet
Users that are interested in warc-parquet are comparing it to the libraries listed below
Sorting:
- Scale to zero Seafowl hosting with Cloud Run☆38Updated last year
- What if an HNSW index was just a file, and you could serve it from a CDN, and search it directly in the browser?☆98Updated last month
- ☆109Updated last week
- ZSV Utility for converting json to/from zip-separated-values☆56Updated 11 months ago
- Gavin Mendel-Gleason's blog☆89Updated last year
- A small language that compiles to WebAssembly Text format☆74Updated last year
- Multi-model transactional embedded database☆68Updated 5 months ago
- A simple performant GeoIP server written in Rust using MaxMind DBs with auto database update☆50Updated last week
- Code to accompany blog post https://reorchestrate.com/posts/sqlite-transactions☆65Updated 9 months ago
- webidx is a client-side search engine for static websites.☆60Updated 2 months ago
- Beating the `bisect` module's implementation using C-extensions.☆30Updated last year
- Testing various image matching algorithms' performance on the Pinecone vector DB☆43Updated last year
- ☆163Updated 11 months ago
- SQL transformation tool for DuckDB written in Rust☆50Updated 2 months ago
- Minimalist log collector☆115Updated 3 months ago
- Zig library for HyperLogLog estimation☆89Updated 9 months ago
- A safe, stateful rules language for event streams☆114Updated last year
- Hand-written email client on a reMarkable or tablet of your choice☆60Updated 3 months ago
- Fast similarity search using DuckDB☆132Updated 6 months ago
- SQL Language server and cli☆85Updated 2 months ago
- A library for parsing and executing Excel-style formulas☆58Updated 2 years ago
- ayb makes it easy to create databases, share them with collaborators, and query them from a web application or the command line☆73Updated this week
- abuse ImageMagick (or GraphicsMagick) to create arbitrary files☆53Updated last month
- the fastest CSV SQLite extension, written in Rust☆133Updated 3 months ago
- Module Oriented Large Archive Specialized Slow Exhaustive Searcher☆113Updated 9 years ago
- a file transfer service utilizing quic☆65Updated 4 months ago
- Uniform eXchange Format (uxf) is a plain text human readable optionally typed storage format that supports custom types. It may serve as …☆1Updated last year
- WarcDB: Web crawl data as SQLite databases.☆398Updated 10 months ago
- Shell scripting for serverless☆140Updated 2 years ago
- Parallelism and preemptive concurrency for sporadic workloads☆46Updated 5 months ago