tibetan-nlp / classical-tibetan-corpusLinks
Linguistically analyzed Classical Tibetan texts
β26Updated 4 years ago
Alternatives and similar repositories for classical-tibetan-corpus
Users that are interested in classical-tibetan-corpus are comparing it to the libraries listed below
Sorting:
- π Curated list of Tibetan NLP projectsβ41Updated 5 years ago
- π· ΰ½ΰ½Όΰ½ΰΌΰ½ΰ½Όΰ½ [pΚ°ΓΈtΙkΜ] Tibetan word tokenizer in Pythonβ71Updated last month
- π¦ NLP for Tibetan, in Python.β37Updated 2 years ago
- Sentence alignerβ120Updated 4 years ago
- Improved Sentence Alignment in Linear Time and Spaceβ185Updated 2 years ago
- OpusFilter - Parallel corpus processing toolkitβ112Updated last week
- Efficient Low-Memory Alignerβ146Updated 10 months ago
- Machine-Translation-based sentence alignment tool for parallel textβ313Updated 4 years ago
- A multilingual parallel corpus created from translations of the Bible.β190Updated 6 months ago
- repo for Tibetan corporaβ21Updated 2 years ago
- Multilingual sentence alignment using sentence embeddingsβ130Updated last year
- Hunspell files for Tibetanβ22Updated 10 years ago
- β65Updated 3 months ago
- Bicleaner is a parallel corpus classifier/cleaner that aims at detecting noisy sentence pairs in a parallel corpus.β160Updated last year
- β78Updated 3 months ago
- STREUSLE: a corpus with comprehensive lexical semantic annotation (multiword expressions, supersenses)β67Updated last week
- MorphyNet: a Large Multilingual Database of Derivational and Inflectional Morphology (+morpheme segmentation)β51Updated 2 years ago
- Improving Low-Resource Neural Machine Translation of Related Languages by Transfer Learningβ19Updated 3 years ago
- β32Updated 2 years ago
- A tool that locates, downloads, and extracts machine translation corporaβ159Updated 2 months ago
- MAGPIE: A sense-annotated corpus of potentially idiomatic expressionsβ28Updated 5 years ago
- LingPy: Python library for quantitative tasks in historical linguisticsβ138Updated 4 months ago
- Bitextor generates translation memories from multilingual websitesβ296Updated last year
- A neural word aligner based on multilingual BERTβ359Updated 3 years ago
- Obtain Word Alignments using Pretrained Language Models (e.g., mBERT)β381Updated 2 years ago
- Scripts to preprocess training and test data and to run fast_align and gizaβ107Updated 4 years ago
- Repository for the Georgetown University Multilayer Corpus (GUM)β102Updated last week
- Easier Automatic Sentence Simplification Evaluationβ162Updated 2 years ago
- β18Updated 8 years ago
- SIGMORPHON 2022 Shared Task on Morpheme Segmentationβ28Updated 2 years ago