sandinmyjoints / fold_to_asciiLinks
A Python port of the Apache Lucene ASCII Folding Filter that converts alphabetic, numeric, and symbolic Unicode characters which are not in the first 127 ASCII characters (the ‘Basic Latin’ Unicode block) into ASCII equivalents, if they exist.
☆15Updated 5 years ago
Alternatives and similar repositories for fold_to_ascii
Users that are interested in fold_to_ascii are comparing it to the libraries listed below
Sorting:
- A trend viewer written in Python/JavaScript☆21Updated 7 months ago
- A simple fuzzy matching set for python strings☆227Updated 10 months ago
- Abydos NLP/IR library for Python☆186Updated 2 years ago
- A tiny library for Python text normalisation. Useful for ad-hoc text processing.☆152Updated 5 months ago
- Python bindings to the Compact Language Detector☆33Updated 5 years ago
- Python package for Google's diff-match-patch native C++ implementation.☆79Updated last year
- A Python implementation of Lunr.js 🌖☆197Updated 3 months ago
- Python Solr query utility // http://solrq.readthedocs.org/en/latest/☆25Updated 2 years ago
- Hy-phen-ation made easy☆211Updated 4 months ago
- Validation and data pipelines made easy!☆12Updated 5 years ago
- ☆52Updated last year
- Search 'from' and 'to' strings to learn a text cleaning mapping☆17Updated 9 years ago
- Makes it easy to respect rate limits.☆96Updated 8 years ago
- An asynchronous SPARQL client library using aiohttp☆24Updated 9 months ago
- Faster replacement for Python's urlparse module☆46Updated 6 years ago
- Extract, parse and populate templates from strings☆27Updated 6 years ago
- Scripts for preprocessing morfologik data.☆40Updated 7 years ago
- A Cython implementation of the affine gap string distance☆57Updated 2 years ago
- Modularly extensible semantic metadata validator☆84Updated 9 years ago
- The most basic Text::Unidecode port (licensed under Artistic License or GPL or GPLv2+ - choose whatever you want)☆66Updated 2 years ago
- ISO 20275☆10Updated last year
- Python binding for gumbo-parser using Cython☆14Updated 8 years ago
- Python package for harvesting records from OAI-PMH provider(s).☆63Updated 2 years ago
- ElasticSearch ODM (Object Document Mapper) for Python - pip install esengine☆110Updated 5 years ago
- Fuzzy Categorical Distances☆14Updated 5 years ago
- Streaming newline delimited JSON I/O.☆12Updated last year
- Sunburnt offspring solr client☆27Updated 3 years ago
- Multi-Langauge Identification☆28Updated 11 months ago
- Now included in rigour☆151Updated last month
- Super-fast and clean conversions to numbers for Python.☆109Updated 3 months ago