buda-base / lucene-bo
Lucene analyzer for Tibetan
☆12Updated 2 months ago
Alternatives and similar repositories for lucene-bo:
Users that are interested in lucene-bo are comparing it to the libraries listed below
- Linguistically analyzed Classical Tibetan texts☆26Updated 3 years ago
- 🏷 བོད་ཏོག [pʰøtɔk̚] Tibetan word tokenizer in Python☆60Updated 3 weeks ago
- 🦜 NLP for Tibetan, in Python.☆33Updated last year
- 😎 Curated list of Tibetan NLP projects☆36Updated 4 years ago
- Resources for spell checking Tibetan☆12Updated 4 years ago
- 😎 Curated list of tibetan canon datasets☆16Updated 4 years ago
- Hunspell files for Tibetan☆22Updated 9 years ago
- ☆53Updated last month
- ✒️ དག་བྱེད། Dakje, improving your spelling and readability☆11Updated 2 years ago
- repo for Tibetan corpora☆21Updated last year
- simple CSV database if Tibetan verbs☆22Updated 9 years ago
- Tibetan Language Processing Library☆19Updated 6 years ago
- This repository will soon contain all scripts and links to the annotated corpora of Tibetan.☆12Updated last week
- An OCR application focused on machine-print Tibetan text.☆16Updated 6 years ago
- Tibetan Unicode to Wylie converter. (EWTS-Extended Wylie Transliteration Scheme)☆23Updated last month
- all of tibetan dictionary.ཚོང་ལས་ལས་དོན་དུ་སྤྱོད་མི་ཆོག གལ་སྲིད་འགལ་ན་ཁྲིམས་རྩོད་བྱུང་ངེས།☆14Updated last year
- uncover old chinese textual parallels based on sound☆13Updated 3 months ago
- Sentence aligner☆108Updated 3 years ago
- Tibetan to English Machine Translation☆10Updated 4 years ago
- ☆28Updated 3 months ago
- ☆17Updated 7 years ago
- ☆63Updated 9 months ago
- Tokenizer POS-tagger and Dependency-parser for Classical Chinese☆64Updated 2 months ago
- Libraries and command-line tools for metrical analysis of epic Greek hexameter☆27Updated 6 years ago
- TIP-LAS: An open source toolkit for Tibetan word segmentation and part-of-speech tagging☆81Updated 2 years ago
- Translation Memory Open-source Purifier☆33Updated 2 years ago
- Linguistic search for large annotated text corpora, based on Apache Lucene☆108Updated this week
- Wiktionary parser tool for many language editions.☆54Updated 2 years ago
- A list of resources for conservation, development, and documentation of endangered, minority, and low or under-resourced human languages.☆34Updated last year