pysuxing / python-stardictLinks
python module reading the StarDict dictionaries
☆44Updated 2 years ago
Alternatives and similar repositories for python-stardict
Users that are interested in python-stardict are comparing it to the libraries listed below
Sorting:
- Library for manipulating StarDict dictionaries from within Python☆106Updated 3 months ago
- Python library for CJK (Chinese, Japanese, and Korean) language dictionary☆94Updated this week
- A toolbox for working with the Chinese language in Python☆149Updated 6 years ago
- an open solution for collecting n-gram Chinese lexicon and n-gram statistics☆73Updated 9 years ago
- Python bindings for libwapiti☆67Updated 6 years ago
- Yet another Chinese word segmentation package based on character-based tagging heuristics and CRF algorithm☆245Updated 13 years ago
- OpenCC binding for Python.☆52Updated 5 years ago
- Thot toolkit for statistical machine translation☆53Updated 3 years ago
- A simple python script to translate chinese to pinyin based on Mandarin.dat☆217Updated last year
- Extract data from Octopus mdict (*.mdd, *.mdx) files☆24Updated 8 years ago
- Han character library for CJKV languages☆165Updated 4 years ago
- Morfessor is a tool for unsupervised and semi-supervised morphological segmentation☆200Updated 5 years ago
- A more complete example of programming with PDFMiner, which continues where the default documentation stops☆216Updated 6 years ago
- Chinese Tokenizer module for Python☆16Updated 7 years ago
- Python wrapper for LanguageTool grammar checker☆329Updated 4 years ago
- Python module that identifies Chinese text as being Simplified or Traditional☆105Updated last year
- English word segmentation, written in pure-Python, and based on a trillion-word corpus.☆378Updated 3 years ago
- *Deprecated* A fast and accurate part-of-speech tagger for TextBlob.☆101Updated 10 years ago
- Import of https://sourceforge.net/projects/champollion☆18Updated 9 years ago
- Imported from mdict analysis☆264Updated 4 years ago
- Offline bilingual dictionaries made using data from Wiktionary☆62Updated 10 years ago
- A series of scripts to download and parse the OpenSubtitles corpus.☆85Updated 3 months ago
- Text normalization library for Python☆202Updated 7 years ago
- Machine-Translation-based sentence alignment tool for parallel text☆314Updated 4 years ago
- Context sensitive spell checker for Icelandic based on a recurrent neural network model from karpathy/char-rnn. This repo is no longer in…☆40Updated 10 years ago
- A toolkit for corpus linguistics☆206Updated 6 years ago
- Chinese morphological analysis with Word Segment and POS Tagging data for MeCab☆162Updated 8 years ago
- A parser and autocorrection tool for wiktionary.☆39Updated 10 years ago
- ☆98Updated 4 years ago
- Transition-based statistical parser☆417Updated 8 years ago