ye-kyaw-thu / myUDTreeLinks
Universal Dependency Tree for Myanmar Language
☆10Updated 5 months ago
Alternatives and similar repositories for myUDTree
Users that are interested in myUDTree are comparing it to the libraries listed below
Sorting:
- myPOS (Myanmar Part-of-Speech) Corpus for Myanmar NLP Research and Developments☆74Updated 7 months ago
- Some lecture materials of NLP Class at UTYCC☆30Updated 2 years ago
- Myanmar Word Segmentation Tool☆30Updated 6 years ago
- syllable, word and phrase segmenter for Burmese (Myanmar language)☆58Updated 3 years ago
- preprocessing and postediting tools especially for NLP (bash, perl, python)☆17Updated 7 months ago
- Laphet: A tiny neural network language modeling library designed for students and educators.☆11Updated 5 months ago
- Syllable segmentation tool for Myanmar language (Burmese) by Ye.☆58Updated last year
- Python library for Myanmar language☆36Updated last year
- Python library for Myanmar text processing☆70Updated 4 years ago
- ☆139Updated last year
- AfriBERTa: Exploring the Viability of Pretrained Multilingual Language Models for Low-resourced Languages☆74Updated 3 years ago
- ☆109Updated last year
- Repository for contributions for Data Generation for Post-OCR correction of Cyrillic handwriting paper☆20Updated last year
- Cross-lingual learning in scene text recognition (ICASSP2024)☆16Updated 9 months ago
- Aksharamukha Python Library☆50Updated 5 months ago
- A dataset for online Arabic calligraphy. A collection of 2500 annotated calligraphic styles.☆152Updated last year
- Kamus morfologi untuk bahasa Melayu/Indonesia☆16Updated 7 months ago
- TURJUMAN, a neural toolkit for translating from 20 languages into Modern Standard Arabic (MSA).☆53Updated 2 years ago
- A repository for publicly/freely available Natural Language Processing (NLP) datasets for African languages.☆106Updated last year
- Aranizer: A Custom Tokenizer based on SentencePiece and BPE tailored for Arabic Language Modeling☆20Updated 11 months ago
- Multilingual Speech Recognition for Indonesian Languages☆64Updated 2 years ago
- Classical Arabic Named Entity Recognition Corpus☆19Updated 2 years ago
- Machine translation (MT) benchmark dataset for languages in the Horn of Africa.☆40Updated 2 years ago
- ☆59Updated 3 years ago
- မြန်မာစကားလုံးများ (Myanmar Words / Burmese Words).☆46Updated 2 years ago
- Arabic cleaning, normalization and segmentation library.☆70Updated last year
- Source code for the paper "Post-OCR Document Correction with Large Ensembles of Character Sequence-to-Sequence Models"☆37Updated last year
- This dataset contains 207,572 books from the Amazon.com, Inc. marketplace.☆254Updated 4 years ago
- A python library for extracting text from PDFs without losing the formatting of the PDF content.☆77Updated 3 years ago
- TUFS Asian Language Parallel Corpus☆50Updated 2 years ago