Intsights / PyWordSegment
Concatenated-word segmentation Python library written in Rust
☆16Updated 8 months ago
Related projects ⓘ
Alternatives and complementary repositories for PyWordSegment
- Intsights open-source wrappers library for some AWS resources and high level management objects for distributed backend systems☆17Updated last year
- Python library for a duplicate lines removal written in C++☆32Updated 8 months ago
- Python library for fast fuzzy search over a big file written in Rust☆43Updated 8 months ago
- ☆25Updated 5 months ago
- Python library for fast substring/pattern search written in C++ leveraging Suffix Array Algorithm☆41Updated 8 months ago
- A blazingly fast domain extraction library written in Rust☆65Updated 3 months ago
- Uncompromising and opinionated flake8 plugin which follows Intsights' practices☆14Updated 3 months ago
- A Git Repository Secrets Scanner written in Rust☆39Updated 8 months ago
- Word frequency checker based on Wikipedia corpus written in Rust☆10Updated 8 months ago
- Queue server base on RocksDB as a KV-Store backend and gRPC as an interface☆10Updated last year
- A Python based alternative to Elasticsearch Reindex API with multiprocessing support.☆17Updated 8 months ago
- A fast and easy adblockplus parser and matcher based on adblock-rust package☆27Updated 8 months ago
- Fast, Safe & Simple Asynchronous Task Queues Written In Pure Python☆144Updated 2 weeks ago
- Cython based high performance alternative to Python (re) module for doing basic pattern matching on large data-set..☆12Updated last year
- HeBERT: Pre-training BERT for modern Hebrew☆72Updated last year
- A fully customisable language detection pipeline for spaCy☆93Updated 5 years ago
- Sample of replacing a function with a Rust implementation.☆9Updated last year
- Multi-Langauge Identification☆28Updated 3 months ago
- A high performance asynchronous Python client for Memcached with full batteries included☆40Updated 2 months ago
- The fastest FlashText library for Python☆20Updated 4 months ago
- Abydos NLP/IR library for Python☆183Updated 2 years ago
- Extend typing package functionalities☆60Updated last year
- Hunspell extension for spaCy 2.0.☆94Updated 3 months ago
- Parse and convert numbers written in French, English, Spanish, Portuguese, German and Catalan into their digit representation.☆102Updated 2 weeks ago
- ☆47Updated 2 years ago
- Find strings/words in text; convenience and C speed☆126Updated 2 years ago
- Python function to stream unzip all the files in a ZIP archive on the fly☆279Updated this week
- Annotation tool on Jupyter for Named Entity Recognition tasks☆21Updated 8 months ago
- Yet Another (natural language) Parser☆82Updated 2 years ago
- Fast and robust date extraction from web pages, with Python or on the command-line☆122Updated last week