pysuxing / python-stardictLinks
python module reading the StarDict dictionaries
☆44Updated 2 years ago
Alternatives and similar repositories for python-stardict
Users that are interested in python-stardict are comparing it to the libraries listed below
Sorting:
- Library for manipulating StarDict dictionaries from within Python☆106Updated 3 months ago
- Python library for CJK (Chinese, Japanese, and Korean) language dictionary☆94Updated this week
- an open solution for collecting n-gram Chinese lexicon and n-gram statistics☆73Updated 9 years ago
- A toolbox for working with the Chinese language in Python☆149Updated 5 years ago
- Yet another Chinese word segmentation package based on character-based tagging heuristics and CRF algorithm☆245Updated 13 years ago
- Thot toolkit for statistical machine translation☆53Updated 3 years ago
- Python bindings for libwapiti☆67Updated 6 years ago
- OpenCC binding for Python.☆52Updated 5 years ago
- A toolkit for corpus linguistics☆206Updated 6 years ago
- Offline bilingual dictionaries made using data from Wiktionary☆62Updated 10 years ago
- Machine-Translation-based sentence alignment tool for parallel text☆313Updated 4 years ago
- A more complete example of programming with PDFMiner, which continues where the default documentation stops☆216Updated 6 years ago
- Han character library for CJKV languages☆165Updated 4 years ago
- Python wrapper for LanguageTool grammar checker☆329Updated 4 years ago
- Library for extracting text and timestamps from multiple subtitle files (.ass, .ssa, .srt, .sub, .txt).☆53Updated last year
- Data collection, alignment and TAUS repository☆23Updated 8 years ago
- python based software to unpack kindlegen generated ebooks☆74Updated 4 months ago
- parallel corpora for any languages supported by glosbe.com☆10Updated 9 years ago
- Morfessor is a tool for unsupervised and semi-supervised morphological segmentation☆200Updated 5 years ago
- Wrapper for pdftohtml that tries to extract paragraph structure☆52Updated 7 years ago
- Chinese morphological analysis with Word Segment and POS Tagging data for MeCab☆162Updated 8 years ago
- A simple Vietnamese word segmentation program☆20Updated 10 years ago
- *Deprecated* A fast and accurate part-of-speech tagger for TextBlob.☆101Updated 10 years ago
- clone of https://code.google.com/p/splitta/ so it can be a git submodule☆34Updated 12 years ago
- Distributed text analysis suite based on Celery☆96Updated 3 years ago
- Tools for extracting parallel corpora from article titles across languages in Wikipedia☆74Updated 10 years ago
- ISO 639 library for Python☆35Updated last year
- Fast multi-keyword search engine for text strings☆258Updated last year
- Hwyluso cyfieithu peirianyddol MosesSMT i'r Gymraeg // Making MosesSMT machine translation easier for Welsh (and other languages)☆16Updated 4 years ago
- TED parallel Corpora is growing collection of Bilingual parallel corpora, Multilingual parallel corpora and Monolingual corpora extracted…☆253Updated 10 years ago