buda-base / lucene-bo
Lucene analyzer for Tibetan
β12Updated last month
Alternatives and similar repositories for lucene-bo:
Users that are interested in lucene-bo are comparing it to the libraries listed below
- Linguistically analyzed Classical Tibetan textsβ25Updated 3 years ago
- π Curated list of Tibetan NLP projectsβ36Updated 4 years ago
- π· ΰ½ΰ½Όΰ½ΰΌΰ½ΰ½Όΰ½ [pΚ°ΓΈtΙkΜ] Tibetan word tokenizer in Pythonβ58Updated last month
- Resources for spell checking Tibetanβ12Updated 4 years ago
- π¦ NLP for Tibetan, in Python.β33Updated last year
- Hunspell files for Tibetanβ22Updated 9 years ago
- π Curated list of tibetan canon datasetsβ16Updated 4 years ago
- repo for Tibetan corporaβ21Updated last year
- β52Updated 3 weeks ago
- βοΈ ΰ½ΰ½ΰΌΰ½ΰΎ±ΰ½Ίΰ½ΰΌ Dakje, improving your spelling and readabilityβ11Updated 2 years ago
- Tibetan Language Processing Libraryβ18Updated 6 years ago
- An OCR application focused on machine-print Tibetan text.β16Updated 6 years ago
- simple CSV database if Tibetan verbsβ22Updated 9 years ago
- Tibetan phonetics engine in Pythonβ16Updated 2 months ago
- Sentence alignerβ109Updated 3 years ago
- Tibetan Unicode to Wylie converter. (EWTS-Extended Wylie Transliteration Scheme)β22Updated 2 weeks ago
- β63Updated 8 months ago
- Software for phonetic transcription of English and Finnish, and IPA toolsβ15Updated 9 years ago
- β17Updated 7 years ago
- Translation Memory Open-source Purifierβ33Updated 2 years ago
- This repository will soon contain all scripts and links to the annotated corpora of Tibetan.β12Updated 2 months ago
- β28Updated 2 months ago
- Tibetan to English Machine Translationβ10Updated 4 years ago
- A list of resources for conservation, development, and documentation of endangered, minority, and low or under-resourced human languages.β34Updated last year
- Annodoc annotation documentation support systemβ35Updated 4 years ago
- TIP-LAS: An open source toolkit for Tibetan word segmentation and part-of-speech taggingβ81Updated 2 years ago
- Python version for Doug Biber's Multidimensional Analysis (MDA)β28Updated last month
- Efficient Low-Memory Alignerβ140Updated this week
- A Language-Independent Unsupervised Morphological Segmentation Framework based on Adaptor Grammarsβ15Updated 7 months ago
- An NLP library for Uralic languages such as Finnish, Skolt Sami, Moksha and so on. Also supporting some non-Uralic languages such as Spanβ¦β73Updated last month