Intsights / PyWordSegment
Concatenated-word segmentation Python library written in Rust
☆16Updated 9 months ago
Alternatives and similar repositories for PyWordSegment:
Users that are interested in PyWordSegment are comparing it to the libraries listed below
- Intsights open-source wrappers library for some AWS resources and high level management objects for distributed backend systems☆17Updated last year
- Python library for fast substring/pattern search written in C++ leveraging Suffix Array Algorithm☆41Updated 9 months ago
- Python library for a duplicate lines removal written in C++☆33Updated 9 months ago
- Uncompromising and opinionated flake8 plugin which follows Intsights' practices☆14Updated 5 months ago
- A blazingly fast domain extraction library written in Rust☆65Updated 5 months ago
- Python library for fast fuzzy search over a big file written in Rust☆45Updated 9 months ago
- A Git Repository Secrets Scanner written in Rust☆39Updated 9 months ago
- A fast and easy adblockplus parser and matcher based on adblock-rust package☆27Updated 9 months ago
- Queue server base on RocksDB as a KV-Store backend and gRPC as an interface☆10Updated last year
- Word frequency checker based on Wikipedia corpus written in Rust☆10Updated 9 months ago
- A Python based alternative to Elasticsearch Reindex API with multiprocessing support.☆17Updated 9 months ago
- Parse numbers written in natural language☆109Updated 2 months ago
- Fast Levenshtein Distance Library for Python 3☆82Updated 2 years ago
- ☆68Updated 2 years ago
- ConfZ is a configuration management library for Python based on pydantic.☆228Updated last week
- Pyinfer is a model agnostic tool for ML developers and researchers to benchmark the inference statistics for machine learning models or f…☆24Updated 3 years ago
- This collection of general purpose python magic was too good to keep for ourselves!☆15Updated 5 months ago
- Multi-Langauge Identification☆29Updated 5 months ago
- Extract city and country mentions from Text like GeoText without regex, but FlashText, a Aho-Corasick implementation.☆60Updated this week
- A mongo mocking library with an ephemeral MongoDB running in memory.☆41Updated 4 months ago
- Fast and robust date extraction from web pages, with Python or on the command-line☆121Updated 2 weeks ago
- 🦉 Modern high-performance serialization utilities for Python (JSON, MessagePack, Pickle)☆445Updated this week
- Language detection using Spacy and Fasttext☆54Updated last year
- spaCy match and replace, maintaining conjugation☆35Updated 2 years ago
- An Elasticsearch Python ORM based on Pydantic.☆124Updated last year
- Extending Python's process pool to support asyncio functions☆12Updated 3 years ago
- 🔤 Measure edit distance based on keyboard layout☆58Updated last year
- 🐍 A CPython extension for the Hyperscan regular expression matching library.☆167Updated last week
- ☆167Updated 7 months ago
- A Domain Specific Language (DSL) for building language patterns. These can be later compiled into spaCy patterns, pure regex, or any othe…☆64Updated 2 years ago