Esukhia / Corpora
repo for Tibetan corpora
☆21Updated last year
Alternatives and similar repositories for Corpora:
Users that are interested in Corpora are comparing it to the libraries listed below
- ✒️ དག་བྱེད། Dakje, improving your spelling and readability☆11Updated 2 years ago
- ☆17Updated 7 years ago
- 🏷 བོད་ཏོག [pʰøtɔk̚] Tibetan word tokenizer in Python☆62Updated 2 weeks ago
- 🦜 NLP for Tibetan, in Python.☆34Updated last year
- 😎 Curated list of Tibetan NLP projects☆37Updated 4 years ago
- Linguistically analyzed Classical Tibetan texts☆26Updated 3 years ago
- TIP-LAS: An open source toolkit for Tibetan word segmentation and part-of-speech tagging☆80Updated 2 years ago
- ☆42Updated 6 years ago
- <u><a href="https://circse.github.io/LT4HALA/" style="color: white">Workshop on Language Technologies for Historical and Ancient Language…☆33Updated 9 months ago
- Dataset for TALLIP2019 paper "Ancient-Modern Chinese Translation with a New Large Training Dataset"☆23Updated 2 years ago
- Efficient Low-Memory Aligner☆142Updated 2 months ago
- Tibetan Language Processing Library☆21Updated 7 years ago
- Prior Knowledge Integration for Neural Machine Translation using Posterior Regularization☆11Updated 6 years ago
- Hunspell files for Tibetan☆22Updated 9 years ago
- ☆28Updated 4 months ago
- Efficient Markov Chain word alignment☆53Updated 3 years ago
- all of tibetan dictionary.ཚོང་ལས་ལས་དོན་དུ་སྤྱོད་མི་ཆོག གལ་སྲིད་འགལ་ན་ཁྲིམས་རྩོད་བྱུང་ངེས།☆14Updated last year
- Baseline models, training scripts, and instructions on how to reproduce our results for our state-of-art grammar correction system from M…☆72Updated 5 years ago
- ☆14Updated 9 years ago
- ☆55Updated last year
- Summary of Responses to Questionnaire on Annotation Platform https://forms.gle/iZk8kehkjAWmB8xe9☆59Updated 4 years ago
- A PyTorch implementation of "Reaching Human-level Performance in Automatic Grammatical Error Correction: An Empirical Study"☆50Updated 6 years ago
- The code for EMNLP2022 paper "Improved grammatical error correction by ranking elementary edits"☆19Updated 2 years ago
- Examples, tutorials and use cases for Marian, including our WMT-2017/18 baselines.☆76Updated last year
- We use phonetics as a feature to create a joint semantic-phonetic embedding and improve the neural machine translation between Chinese an…☆11Updated 3 years ago
- ☆36Updated 2 years ago
- JFLEG (JHU FLuency-Extended GUG) corpus for Grammatical Error Correction Evaluation☆113Updated last year
- Tokenizer POS-tagger and Dependency-parser for Classical Chinese☆66Updated 4 months ago
- An open-source classical Chinese information processing toolkit developed by Tsinghua Natural Language Processing Group☆51Updated 6 years ago
- Framework for neural-based Quality Estimation☆42Updated 4 years ago