Intsights / PyWordSegment
Concatenated-word segmentation Python library written in Rust
☆17Updated 3 weeks ago
Alternatives and similar repositories for PyWordSegment:
Users that are interested in PyWordSegment are comparing it to the libraries listed below
- Intsights open-source wrappers library for some AWS resources and high level management objects for distributed backend systems☆17Updated last year
- Python library for a duplicate lines removal written in C++☆33Updated 3 weeks ago
- Python library for fast substring/pattern search written in C++ leveraging Suffix Array Algorithm☆41Updated 3 weeks ago
- Python library for fast fuzzy search over a big file written in Rust☆45Updated last month
- Uncompromising and opinionated flake8 plugin which follows Intsights' practices☆14Updated last month
- A blazingly fast domain extraction library written in Rust☆65Updated 3 weeks ago
- A fast and easy adblockplus parser and matcher based on adblock-rust package☆27Updated last month
- Word frequency checker based on Wikipedia corpus written in Rust☆10Updated 3 weeks ago
- ☆25Updated last month
- Queue server base on RocksDB as a KV-Store backend and gRPC as an interface☆10Updated last year
- A Python based alternative to Elasticsearch Reindex API with multiprocessing support.☆17Updated last month
- A Git Repository Secrets Scanner written in Rust☆39Updated 3 weeks ago
- Fast, Safe & Simple Asynchronous Task Queues Written In Pure Python☆144Updated last month
- Hebrew analyzer plugin for elasticsearch☆60Updated 5 years ago
- Fast Python Bloom Filter using Mmap☆129Updated 10 months ago
- Extract text from HTML☆135Updated 4 years ago
- Efficient Trie-based regex unions for blacklist/whitelist filtering and one-pass mapping-based string replacing☆69Updated 2 months ago
- A lucene query parser generating ElasticSearch queries and more !☆189Updated last month
- Scalable String Similarity Joins in Python☆39Updated 8 months ago
- Neural Sentiment Analyzer for Modern Hebrew☆43Updated 4 years ago
- Python package for deduplication/entity resolution using active learning☆77Updated 7 months ago
- Abydos NLP/IR library for Python☆185Updated 2 years ago
- A Fast Levenshtein Distance Library for Python☆82Updated last month
- Full text search in your Pandas dataframe☆222Updated 3 months ago
- Python wrapper for aspell (C extension and python version)☆81Updated last year
- Parse numbers written in natural language☆110Updated 5 months ago
- Simplest and fastest image and text annotation tool.☆233Updated last week
- Extending Python's process pool to support asyncio functions☆12Updated 3 years ago
- Multi-Langauge Identification☆29Updated 8 months ago
- A Python module to convert natural language numerics into ints and floats.☆226Updated 6 months ago