pysuxing / python-stardict
python module reading the StarDict dictionaries
☆44Updated last year
Alternatives and similar repositories for python-stardict:
Users that are interested in python-stardict are comparing it to the libraries listed below
- Library for manipulating StarDict dictionaries from within Python☆104Updated last year
- OpenCC binding for Python.☆52Updated 4 years ago
- A toolbox for working with the Chinese language in Python☆148Updated 4 years ago
- Python library for CJK (Chinese, Japanese, and Korean) language dictionary☆88Updated last week
- Machine-Translation-based sentence alignment tool for parallel text☆304Updated 3 years ago
- Offline bilingual dictionaries made using data from Wiktionary☆52Updated 9 years ago
- Thot toolkit for statistical machine translation☆50Updated 2 years ago
- A parser and autocorrection tool for wiktionary.☆39Updated 9 years ago
- Python module that identifies Chinese text as being Simplified or Traditional☆86Updated last month
- Library for extracting text and timestamps from multiple subtitle files (.ass, .ssa, .srt, .sub, .txt).☆52Updated 9 months ago
- Yet another Chinese word segmentation package based on character-based tagging heuristics and CRF algorithm☆243Updated 12 years ago
- Chinese morphological analysis with Word Segment and POS Tagging data for MeCab☆158Updated 7 years ago
- Han character library for CJKV languages☆153Updated 3 years ago
- Automatically exported from code.google.com/p/stardict-3☆310Updated 2 years ago
- Chinese Character Frequencies☆20Updated 7 years ago
- Zurich Morphological Lexicon for German: a tool to extract a morphological lexicon from Wiktionary☆11Updated last year
- Sentence aligner☆109Updated 3 years ago
- Memory-based shallow parser for Python☆73Updated 5 years ago
- The Arborator software is aimed at collaboratively annotating dependency corpora.☆25Updated 5 years ago
- Import of https://sourceforge.net/projects/champollion☆18Updated 8 years ago
- convert epub file to txt☆86Updated 4 years ago
- python based software to unpack kindlegen generated ebooks☆61Updated last year
- Hwyluso cyfieithu peirianyddol MosesSMT i'r Gymraeg // Making MosesSMT machine translation easier for Welsh (and other languages)☆16Updated 3 years ago
- A python module for looking up mdict dictionary file (.mdx and .mdd).☆310Updated 4 months ago
- words frequency top100k from BNC/ANC/COCA, dsl format, for goldendict☆62Updated 8 years ago
- A more complete example of programming with PDFMiner, which continues where the default documentation stops☆214Updated 5 years ago
- Non-Overlapping Aho-Corasick Python extension, for Python 2 (str and unicode) and Python 3☆51Updated 9 years ago
- TED parallel Corpora is growing collection of Bilingual parallel corpora, Multilingual parallel corpora and Monolingual corpora extracted…☆246Updated 9 years ago
- Tools for extracting parallel corpora from article titles across languages in Wikipedia☆72Updated 9 years ago