ye-kyaw-thu / khPOS
khPOS (Khmer Part-of-Speech) Corpus for Khmer NLP Research and Developments
☆26Updated last year
Alternatives and similar repositories for khPOS:
Users that are interested in khPOS are comparing it to the libraries listed below
- A large collection of Khmer language resources. Khmer is a language used by Cambodia.☆109Updated 6 months ago
- Word segmentation using Conditional Random Fields (CRF) for Khmer document☆29Updated 4 years ago
- Khmer unicode text data for unsupervised learning language model☆21Updated 4 years ago
- Split Khmer sentence into an array of words.☆17Updated 2 years ago
- ☆14Updated 6 years ago
- Khmer language processing toolkit☆71Updated last year
- A Keras implementation of a deep learning network to simultaneously perform Word Segmentation and Part-of-Speech (POS) Tagging introduced…☆11Updated 2 years ago
- The payment service that you'll never need☆17Updated 3 months ago
- New and modern Khmer keyboard with new re-design layout and local word segmentation☆23Updated 11 months ago
- Separate Khmer words from given sentences.☆12Updated 5 years ago
- Khmer wordlist for line and word breaking☆36Updated 3 years ago
- API Server for querying administrative regions in Cambodia including Provinces, Districts, Communes and Villages☆28Updated 3 years ago
- Super fast conversion between Khmer number to Arabic☆16Updated last year
- Small collection of PAGE XML related scripts used at the ZPD Würzburg☆13Updated 8 months ago
- Myanmar and Thai Language Resources☆9Updated 2 years ago
- A code for transliterating (romanizing) Arabic text using the American Library Association - Library of Congress (ALA-LC) standard☆45Updated 2 years ago
- Core libraries by the PRImA Research Lab☆16Updated 8 months ago
- OCR-D post-correction module based on weighted finite-state transducers☆11Updated last year
- This is a repository for PALM students at Royal University of Phnom Penh (2024). The materials for this course are adopted from https://i…☆15Updated 3 months ago
- A collection of random thing from Khmer Coders community member.☆11Updated 3 years ago
- Tesseract4 finetuned traineddata for Central Kurdish/Sorani☆7Updated 4 years ago
- OCR-D wrapper for detectron2 based segmentation models☆16Updated 6 months ago
- Kamus morfologi untuk bahasa Melayu/Indonesia☆16Updated 4 months ago
- Unicode Standard tokenization routines and orthography profile segmentation☆35Updated last month
- Myanmar (Burmese) Language Grapheme to Phoneme (myG2P) Conversion Dictionary for speech recognition (ASR) and speech synthesis (TTS).☆54Updated 3 years ago
- Conversions between various OCR formats☆74Updated last year
- ☆12Updated 2 years ago
- convert PubLayNet data into METS/PAGE-XML☆10Updated 5 years ago
- Update of the ISRI Analytic Tools for OCR Evaluation with UTF-8 support☆57Updated 3 years ago
- Wiktra - Python tool of Wiktionary Transliteration modules for 514 languages and its 102 different scripts (orthographies)☆27Updated 3 years ago