tibetan-nlp / tibetan-collation
Collation algorithm for Tibetan
☆10Updated 9 years ago
Alternatives and similar repositories for tibetan-collation
Users that are interested in tibetan-collation are comparing it to the libraries listed below
Sorting:
- simple CSV database if Tibetan verbs☆22Updated 9 years ago
- Linguistically analyzed Classical Tibetan texts☆26Updated 3 years ago
- Hunspell files for Tibetan☆22Updated 9 years ago
- Resources for spell checking Tibetan☆13Updated 4 years ago
- ☆10Updated 2 years ago
- 🏷 བོད་ཏོག [pʰøtɔk̚] Tibetan word tokenizer in Python☆67Updated 2 months ago
- Lucene analyzer for Tibetan☆12Updated this week
- 😎 Curated list of Tibetan NLP projects☆37Updated 4 years ago
- Tibetan Unicode to Wylie converter. (EWTS-Extended Wylie Transliteration Scheme)☆25Updated this week
- 😎 Curated list of tibetan canon datasets☆17Updated 5 years ago
- ☆56Updated 4 months ago
- The e-texts of the SARIT project☆40Updated 11 months ago
- An OCR application focused on machine-print Tibetan text.☆16Updated 6 years ago
- Chinese Notes: A digital library for classical and historic Chinese texts with built in dictionary and reader☆23Updated 3 weeks ago
- Multilingual sentence alignment using sentence embeddings☆117Updated 6 months ago
- Python API to access glottolog/glottolog☆29Updated 6 months ago
- Sentence aligner☆112Updated 3 years ago
- LingPy: Python library for quantitative tasks in historical linguistics☆133Updated 2 months ago
- Hanzipy is a Chinese character and NLP module for Chinese language processing for python. It is primarily written to help provide a frame…☆21Updated last year
- Sources of Collatinus software - Latin lemmatizer, morphological analyzer and scansion☆75Updated 3 weeks ago
- Collection of hand-analyzed ancient Greek prose in dependency trees.☆18Updated 2 years ago
- Lexical data at Unicode☆68Updated 8 months ago
- CLDF: Cross-Linguistic Data Formats - the specification☆57Updated last year
- Snapshots of the GRETIL repository of South Asian (Sanskrit, Pali, etc.) etexts☆9Updated 3 years ago
- Ideographic Description Sequences☆26Updated last month
- SIGTYP 2022 Shared Task☆9Updated 2 years ago
- uncover old chinese textual parallels based on sound☆13Updated 6 months ago
- Shobhika is a Devanāgarī font for scholars.☆37Updated 6 years ago
- Machine-Translation-based sentence alignment tool for parallel text☆309Updated 4 years ago
- Python package and data files for manipulating phonological segments (phones, phonemes) in terms of universal phonological features.☆256Updated 9 months ago