zheplusplus / pyunicodeblockLinks
Python Unicode Block Utilities
☆24Updated 5 years ago
Alternatives and similar repositories for pyunicodeblock
Users that are interested in pyunicodeblock are comparing it to the libraries listed below
Sorting:
- unihandecode is a transliteration library to convert all characters/words in Unicode into ASCII alphabet that aware with Language prefere…☆68Updated 3 years ago
- Python package for lexicon; Trie and DAWG implementation.☆55Updated 9 months ago
- The PyICU project repository has moved to https://pyicu.org.☆133Updated 4 years ago
- Cython wrapper on Hunspell Dictionary☆66Updated last year
- Python difflib with parts reimplemented in C☆40Updated 8 months ago
- Accurately find/replace/remove emojis in text strings☆162Updated last year
- Put together a multilingual corpus from a variety of sources. Used for wordfreq and word embeddings.☆53Updated 4 years ago
- Library for extracting text and timestamps from multiple subtitle files (.ass, .ssa, .srt, .sub, .txt).☆53Updated last year
- A python package for grapheme aware string handling☆115Updated 3 years ago
- Hy-phen-ation made easy☆212Updated 7 months ago
- A Python binding of SQLite Full Text Search Tokenizer☆48Updated 2 weeks ago
- A Python library to parse MediaWiki WikiText☆313Updated 4 months ago
- bin files☆13Updated 7 months ago
- 💥 Cython hash tables that assume keys are pre-hashed☆85Updated 4 months ago
- ISO 639 library for Python☆34Updated last year
- The most basic Text::Unidecode port (licensed under Artistic License or GPL or GPLv2+ - choose whatever you want)☆67Updated 2 years ago
- Abydos NLP/IR library for Python☆190Updated 2 years ago
- Efficient Trie-based regex unions for blacklist/whitelist filtering and one-pass mapping-based string replacing☆75Updated 3 weeks ago
- An index data structure for approximate string search.☆23Updated 6 years ago
- A versioned python wrapper package for cmudict (https://github.com/cmusphinx/cmudict).☆64Updated last week
- DAFSA-based dictionary-like read-only objects for Python. Based on `dawgdic` C++ library.☆303Updated last year
- Lightning Fast Language Prediction 🚀☆167Updated last month
- Simple multilingual lemmatizer for Python, especially useful for speed and efficiency☆175Updated 3 months ago
- A python module for word inflections designed for use with spaCy.☆93Updated 5 years ago
- A sentence segmentation library with wide language support optimized for speed and utility.☆67Updated 3 months ago
- python3 package supporting efficient storage and querying of sets of sets using the trie data structure. Supports finding all the superse…☆23Updated 2 years ago
- Python search module for fast approximate string matching☆54Updated 2 years ago
- A Python library for working with and comparing language codes.☆350Updated 4 months ago
- Levenshtein and Hamming distance computation☆116Updated 5 years ago
- Python library for fast approximate string matching using Jaro and Jaro-Winkler similarity☆74Updated last year