newca12 / dictionary-builderLinks
Real world example to demonstrate advanced techniques to unmarshall very large xml document with very low memory footprint.
☆61Updated 10 months ago
Alternatives and similar repositories for dictionary-builder
Users that are interested in dictionary-builder are comparing it to the libraries listed below
Sorting:
- ☆51Updated 3 years ago
- A blazingly fast phonetic reduction/hashing algorithm.☆219Updated 4 years ago
- Pure Rust port of CRFsuite: a fast implementation of Conditional Random Fields (CRFs)☆29Updated last week
- Spelling correction & Fuzzy search based on Symmetric Delete spelling correction algorithm.☆141Updated 7 months ago
- Further developed as SyntaxDot: https://github.com/tensordot/syntaxdot☆13Updated 5 years ago
- Multilingual implementation of RAKE algorithm for Rust☆36Updated 11 months ago
- Context-sensitive word embeddings with subwords. In Rust.☆90Updated 2 years ago
- Fast English word segmentation in Rust☆102Updated 3 weeks ago
- Helsinki Finite-State Technology (library and application suite)☆136Updated last month
- A Rust library for reading and writing WARC files☆59Updated last year
- Archived Python/Rust hybrid codebase - see divvun/kbdgen for v3☆26Updated 4 years ago
- ☆68Updated 2 years ago
- Port of arc90labs-readability with rust☆131Updated last year
- Rust wrapper for libxml2☆87Updated 2 months ago
- Java Wiktionary Library☆59Updated 3 years ago
- Neural syntax annotator, supporting sequence labeling, lemmatization, and dependency parsing.☆80Updated 2 years ago
- Python package for WikiMedia dump processing (Wiktionary, Wikipedia etc). Wikitext parsing, template expansion, Lua module execution. Fo…☆107Updated 2 months ago
- Rust crate for entity parsing☆18Updated 3 years ago
- Text hyphenation for Rust☆57Updated 2 years ago
- finalfusion embeddings in Rust☆105Updated 2 years ago
- ☆380Updated 5 months ago
- an approximate string matching or fuzzy-matching system for spelling correction, normalisation or post-OCR correction (mirror of https://…☆37Updated last month
- rust version of chardet☆36Updated 7 years ago
- Rust Regex binding for Javascript☆30Updated this week
- Lexical data at Unicode☆70Updated last year
- Full-text IPFS-friendly and WASM-compatible Search in Rust☆288Updated 8 months ago
- The next iteration of a Rust keyboard layout generator☆23Updated this week
- The code, training pipeline, and models that power Firefox Translations☆244Updated this week
- XPath, XQuery, and XSLT for Rust☆134Updated last week
- A rust implementation of some popular snowball stemming algorithms☆130Updated last year