ye-kyaw-thu / khPOSLinks
khPOS (Khmer Part-of-Speech) Corpus for Khmer NLP Research and Developments
☆31Updated last year
Alternatives and similar repositories for khPOS
Users that are interested in khPOS are comparing it to the libraries listed below
Sorting:
- A large collection of Khmer language resources. Khmer is a language used by Cambodia.☆145Updated 2 weeks ago
- Word segmentation using Conditional Random Fields (CRF) for Khmer document☆30Updated 5 years ago
- Khmer language processing toolkit☆77Updated last year
- Khmer wordlist for line and word breaking☆38Updated 4 years ago
- ☆14Updated 6 years ago
- Khmer unicode text data for unsupervised learning language model☆25Updated 4 years ago
- A code for transliterating (romanizing) Arabic text using the American Library Association - Library of Congress (ALA-LC) standard☆48Updated 3 years ago
- New and modern Khmer keyboard with new re-design layout and local word segmentation☆25Updated last year
- Modules used for separating articles in (historical) newspapers and similar documents. This repository is part of the European Union's Ho…☆21Updated 3 years ago
- Morphological processing for languages of the Horn of Africa☆46Updated last week
- Pronounce Arabic words☆19Updated 6 years ago
- Core libraries by the PRImA Research Lab☆16Updated last year
- Small collection of PAGE XML related scripts used at the ZPD Würzburg☆13Updated last year
- Recognize text using Calamari OCR and the OCR-D framework☆15Updated 3 months ago
- This is a repository for PALM students at Royal University of Phnom Penh (2024). The materials for this course are adopted from https://i…☆16Updated 4 months ago
- This repository contains all the codes used in a thesis at Information Technology University (ITU). The topic of the thesis is pronunciat…☆26Updated 6 years ago
- Wiktra - Python tool of Wiktionary Transliteration modules for 514 languages and its 102 different scripts (orthographies)☆30Updated 2 months ago
- Unicode Standard tokenization routines and orthography profile segmentation☆37Updated 6 months ago
- Python interface to ISLEX, an English IPA pronunciation dictionary with syllable and stress marking.☆52Updated last year
- An NLP library for Uralic languages such as Finnish, Skolt Sami, Moksha and so on. Also supporting some non-Uralic languages such as Span…☆82Updated 3 months ago
- Benchmark Arabic text diacritization dataset☆75Updated 6 years ago
- Kamus morfologi untuk bahasa Melayu/Indonesia☆17Updated 9 months ago
- A Language-Independent Unsupervised Morphological Segmentation Framework based on Adaptor Grammars☆17Updated last year
- Convert Transkribus PAGE-XML to standard PAGE-XML☆12Updated last year
- The Kurdish Language Processing Toolkit☆104Updated 3 weeks ago
- Arabic data☆14Updated last month
- Master repository which includes most other OCR-D repositories as submodules☆73Updated last month
- ☆12Updated 5 years ago
- The Unicode Cookbook for Linguists☆56Updated 4 years ago
- Vietnamese Text Dataset - Wikipedia vi 2018☆14Updated 6 years ago