pysuxing / python-stardict
python module reading the StarDict dictionaries
☆45Updated last year
Alternatives and similar repositories for python-stardict
Users that are interested in python-stardict are comparing it to the libraries listed below
Sorting:
- Library for manipulating StarDict dictionaries from within Python☆104Updated last year
- an open solution for collecting n-gram Chinese lexicon and n-gram statistics☆74Updated 9 years ago
- OpenCC binding for Python.☆52Updated 5 years ago
- Yet another Chinese word segmentation package based on character-based tagging heuristics and CRF algorithm☆245Updated 12 years ago
- Python library for CJK (Chinese, Japanese, and Korean) language dictionary☆91Updated this week
- A toolbox for working with the Chinese language in Python☆150Updated 5 years ago
- Machine-Translation-based sentence alignment tool for parallel text☆309Updated 4 years ago
- Offline bilingual dictionaries made using data from Wiktionary☆54Updated 10 years ago
- Python bindings for libwapiti☆67Updated 5 years ago
- Thot toolkit for statistical machine translation☆53Updated 2 years ago
- Automatically exported from code.google.com/p/stardict-3☆319Updated 3 years ago
- convert epub file to txt☆88Updated 5 years ago
- Convert *.LD2 dictionaries format into human-readable text files☆66Updated 12 years ago
- Tools for extracting parallel corpora from article titles across languages in Wikipedia☆73Updated 10 years ago
- Converting Chinese number string <=> int/float/str☆19Updated 2 weeks ago
- TED parallel Corpora is growing collection of Bilingual parallel corpora, Multilingual parallel corpora and Monolingual corpora extracted…☆250Updated 9 years ago
- Import of https://sourceforge.net/projects/champollion☆18Updated 9 years ago
- A simple Vietnamese word segmentation program☆20Updated 9 years ago
- Han character library for CJKV languages☆158Updated 4 years ago
- Imported from mdict analysis☆255Updated 3 years ago
- Library for extracting text and timestamps from multiple subtitle files (.ass, .ssa, .srt, .sub, .txt).☆52Updated last year
- *Deprecated* A fast and accurate part-of-speech tagger for TextBlob.☆102Updated 9 years ago
- The PyICU project repository has moved to https://pyicu.org.☆133Updated 4 years ago
- Chinese Wordnet v.2☆22Updated 8 years ago
- A BiRNN framework implemented in Python and TensorFlow to extract parallel sentences from aligned comparable corpora.☆33Updated 6 years ago
- TheanoLM is a recurrent neural network language modeling tool implemented using Theano☆81Updated 10 months ago
- A toolkit for corpus linguistics☆205Updated 5 years ago
- Pali Buddhist scriptures of 15 countries and its parallel corpus☆9Updated 6 years ago
- Convert Sino-Korean words written in Hangul to Chinese characters, which is called hanja in Korean, using neural networks☆30Updated 7 years ago
- (Official repo for pypi package) Python bindings for the Hunspell spellchecker engine☆186Updated 4 years ago