ye-kyaw-thu / myUDTreeLinks
Universal Dependency Tree for Myanmar Language
☆10Updated 10 months ago
Alternatives and similar repositories for myUDTree
Users that are interested in myUDTree are comparing it to the libraries listed below
Sorting:
- myPOS (Myanmar Part-of-Speech) Corpus for Myanmar NLP Research and Developments☆78Updated 2 months ago
- Syllable segmentation tool for Myanmar language (Burmese) by Ye.☆63Updated last year
- Some lecture materials of NLP Class at UTYCC☆31Updated 2 years ago
- Myanmar Word Segmentation Tool☆32Updated 7 years ago
- Python library for Myanmar language☆38Updated last year
- preprocessing and postediting tools especially for NLP (bash, perl, python)☆17Updated 4 months ago
- Laphet: A tiny neural network language modeling library designed for students and educators.☆11Updated 10 months ago
- Myanmar (Burmese) Language Grapheme to Phoneme (myG2P) Conversion Dictionary for speech recognition (ASR) and speech synthesis (TTS).☆55Updated 4 years ago
- Python library for Myanmar text processing☆73Updated 4 months ago
- A library to compose and decompose Hangul syllables using Hangul jamo characters☆28Updated 3 years ago
- Benchmark Arabic text diacritization dataset☆76Updated 6 years ago
- A repository for publicly/freely available Natural Language Processing (NLP) datasets for African languages.☆112Updated last year
- A dataset for online Arabic calligraphy. A collection of 2500 annotated calligraphic styles.☆153Updated last year
- TURJUMAN, a neural toolkit for translating from 20 languages into Modern Standard Arabic (MSA).☆57Updated 2 years ago
- Aksharamukha Python Library☆55Updated 10 months ago
- AfriBERTa: Exploring the Viability of Pretrained Multilingual Language Models for Low-resourced Languages☆79Updated 3 years ago
- Code and models for "The Interplay of Variant, Size, and Task Type in Arabic Pre-trained Language Models". EACL 2021, WANLP.☆54Updated last year
- Machine translation (MT) benchmark dataset for languages in the Horn of Africa.☆40Updated 3 years ago
- ☆115Updated 2 months ago
- The largest public catalogue for Arabic NLP and speech datasets. There are +500 datasets annotated with more than 25 attributes.☆189Updated 2 weeks ago
- Aranizer: A Custom Tokenizer based on SentencePiece and BPE tailored for Arabic Language Modeling☆21Updated last year
- ✍️ Bengali Alphabet (বাংলা বর্ণমালা)☆86Updated last year
- A list of awesome Machine Translation frameworks, libraries, software and papers☆194Updated last year
- ☆63Updated 4 years ago
- Source code for the paper "Post-OCR Document Correction with Large Ensembles of Character Sequence-to-Sequence Models"☆38Updated 2 years ago
- Code for extracting parallel corpora from pmindia☆16Updated 5 years ago
- Arabic cleaning, normalization and segmentation library.☆72Updated 2 years ago
- Kamus morfologi untuk bahasa Melayu/Indonesia☆17Updated last year
- Implementation of "SMaLL-100: Introducing Shallow Multilingual Machine Translation Model for Low-Resource Languages" paper, accepted to E…☆28Updated 2 years ago
- TUFS Asian Language Parallel Corpus☆51Updated 2 years ago