jedireza / warcLinks
A Rust library for reading and writing WARC files
☆56Updated last year
Alternatives and similar repositories for warc
Users that are interested in warc are comparing it to the libraries listed below
Sorting:
- Spelling correction & Fuzzy search based on Symmetric Delete spelling correction algorithm.☆140Updated 5 months ago
- Fast English word segmentation in Rust☆101Updated last month
- ☆67Updated 2 years ago
- Fast hierarchical agglomerative clustering in Rust.☆103Updated 8 months ago
- Fast approximate nearest neighbor searching in Rust, based on HNSW index☆337Updated 2 weeks ago
- Xor filters - efficient probabilistic hashsets. Faster and smaller than bloom and cuckoo filters.☆150Updated last week
- A vectorized JSON parser for pre-validated, minified documents☆85Updated last year
- Fast item-to-item recommendations on the command line.☆38Updated 3 years ago
- Rust implementation of Simhash☆24Updated 2 years ago
- A WHATWG-compliant HTML5 tokenizer and tag soup parser☆166Updated this week
- A collection of small notes that aren't appropriate for my blog.☆32Updated 3 years ago
- A crate implementing a synchronized map for memoization☆30Updated 10 months ago
- Media file metadata for human consumption☆58Updated 3 months ago
- Dynamic transformation of data using serde serializable, deserialize using JSON and a JSON transformation syntax similar to Javascript JS…☆16Updated 4 years ago
- Rust client for txtai☆113Updated 3 weeks ago
- [UNMAINTAINED] A transactional and deduplicating virtual file system☆97Updated last year
- A command line tool to rename media files based on titles from IMDb.☆238Updated last year
- ☆51Updated 3 years ago
- Rust helpers for conditional GET, HEAD, byte range serving, and gzip content encoding for static files and more with hyper and tokio.☆34Updated 9 months ago
- Rust implementation of JMESPath, a query language for JSON☆150Updated 5 months ago
- Native Rust port of Google's HighwayHash, which makes use of SIMD instructions for a fast and strong hash function☆173Updated 4 months ago
- Hidden Markov Models in Rust☆78Updated last year
- 🃏 A distributed unique ID generator inspired by Twitter's Snowflake.☆190Updated 2 weeks ago
- Cargo subcommand for downloading crates directly from crates.io☆28Updated 4 years ago
- Port of arc90labs-readability with rust☆132Updated last year
- Proxy for turning web browsers into web servers. Load a 100GB file in your browser and stream it over the public web with HTTP byte range…☆103Updated 2 years ago
- Rust library to find links such as URLs and email addresses in plain text, handling surrounding punctuation correctly☆225Updated 2 weeks ago
- The fastest and lightest mail parsing Rust library.☆179Updated 2 years ago
- Full-service command-line parsing☆74Updated 9 months ago
- A globbing library for Rust.☆44Updated last year