lothelanor / actib
This repository will soon contain all scripts and links to the annotated corpora of Tibetan.
☆12Updated 3 weeks ago
Related projects ⓘ
Alternatives and complementary repositories for actib
- Linguistically analyzed Classical Tibetan texts☆24Updated 3 years ago
- Lucene analyzer for Tibetan☆12Updated 3 weeks ago
- 🦜 NLP for Tibetan, in Python.☆32Updated last year
- 🏷 བོད་ཏོག [pʰøtɔk̚] Tibetan word tokenizer in Python☆58Updated 2 months ago
- 😎 Curated list of Tibetan NLP projects☆33Updated 4 years ago
- ☆51Updated 2 weeks ago
- simple CSV database if Tibetan verbs☆20Updated 9 years ago
- Hunspell files for Tibetan☆20Updated 9 years ago
- ☆22Updated last month
- This is a collection of sentence-level aligned Sanskrit-Tibetan Etexts.☆14Updated 2 years ago
- ✒️ དག་བྱེད། Dakje, improving your spelling and readability☆11Updated 2 years ago
- Data for the quantitative study of (Vedic) Sanskrit☆111Updated 3 weeks ago
- An NLP library for Uralic languages such as Finnish, Skolt Sami, Moksha and so on. Also supporting some non-Uralic languages such as Span…☆70Updated this week
- A neural word aligner based on multilingual BERT☆328Updated 2 years ago
- Pre-trained BERT Models for Ancient and Medieval Greek, and associated code for LaTeCH 2021 paper titled - "A Pilot Study for BERT Langua…☆33Updated 2 years ago
- Snapshots of the GRETIL repository of South Asian (Sanskrit, Pali, etc.) etexts☆9Updated 2 years ago
- Natural language processing resources for multiple languages, with an eye towards use for digital humanities.☆124Updated 3 years ago
- ☆45Updated this week
- Bicleaner is a parallel corpus classifier/cleaner that aims at detecting noisy sentence pairs in a parallel corpus.☆150Updated 5 months ago
- Efficient Low-Memory Aligner☆139Updated 2 months ago
- A multilingual parallel corpus created from translations of the Bible.☆176Updated 2 months ago
- Extension for pie to include taggers with their models and pre/postprocessors☆11Updated 5 months ago
- A tool that locates, downloads, and extracts machine translation corpora☆147Updated 5 months ago
- Obtain Word Alignments using Pretrained Language Models (e.g., mBERT)☆351Updated last year
- Improved Sentence Alignment in Linear Time and Space☆163Updated last year
- The e-texts of the SARIT project☆39Updated 5 months ago
- Machine-Translation-based sentence alignment tool for parallel text☆300Updated 3 years ago
- 😎 Curated list of tibetan canon datasets☆15Updated 4 years ago
- Repository to store Sanskrit koshas and scripts to process them.☆25Updated 8 months ago