sandinmyjoints / fold_to_ascii
A Python port of the Apache Lucene ASCII Folding Filter that converts alphabetic, numeric, and symbolic Unicode characters which are not in the first 127 ASCII characters (the ‘Basic Latin’ Unicode block) into ASCII equivalents, if they exist.
☆15Updated 4 years ago
Alternatives and similar repositories for fold_to_ascii:
Users that are interested in fold_to_ascii are comparing it to the libraries listed below
- Python package for Google's diff-match-patch native C++ implementation.☆74Updated 8 months ago
- Python port for IWNLP.Lemmatizer☆17Updated last year
- Elasticsearch proxy for Quepid.☆13Updated this week
- A Python implementation of Lunr.js 🌖☆196Updated last month
- Segtok v2 is here: https://github.com/fnl/syntok -- A rule-based sentence segmenter (splitter) and a word tokenizer using orthographic fe…☆170Updated 3 years ago
- Python bindings to the Compact Language Detector☆33Updated 4 years ago
- Python Solr query utility // http://solrq.readthedocs.org/en/latest/☆25Updated 2 years ago
- A trend viewer written in Python/JavaScript☆21Updated 3 months ago
- An index data structure for approximate string search.☆23Updated 5 years ago
- Hidden alignment conditional random field for classifying string pairs.☆36Updated 7 years ago
- Python binding for gumbo-parser using Cython☆14Updated 8 years ago
- A fast and simple JavaScript library specifically targeted at collecting search and search related browser events.☆40Updated 5 months ago
- Hy-phen-ation made easy☆207Updated 3 weeks ago
- Fast Python Bloom Filter using Mmap☆129Updated 8 months ago
- A simple fuzzy matching set for python strings☆225Updated 6 months ago
- Extract, parse and populate templates from strings☆27Updated 5 years ago
- Regular Expression based parsers for extracting data from natural languages☆70Updated 7 years ago
- Sort-friendly URI Reordering Transform (SURT) python module☆41Updated 6 months ago
- Abydos NLP/IR library for Python☆184Updated 2 years ago
- A tiny library for Python text normalisation. Useful for ad-hoc text processing.☆147Updated last month
- PyPruningRadixTrie - Python version of super fast Radix trie for prefix search & auto-complete☆39Updated 2 months ago
- Locality-sensitive hashing algorithm for text similarity comparisons☆59Updated 3 years ago
- extract difference between two html pages☆32Updated 6 years ago
- Lightning fast spell correction / fuzzy search library based on SymSpell by Commerce-Experts☆81Updated 6 years ago
- Language detection extension for spaCy 2.0+☆112Updated 6 years ago
- 💥 Cython hash tables that assume keys are pre-hashed☆85Updated 3 weeks ago
- A python library to generate highly realistic typos (fuzz-testing)☆11Updated 6 years ago
- A time machine for debugging pesky stateful errors.☆35Updated 8 years ago
- Search relevance evaluation toolkit☆31Updated 2 years ago
- Price and currency parsing utility☆26Updated last year