ye-kyaw-thu / khPOSLinks
khPOS (Khmer Part-of-Speech) Corpus for Khmer NLP Research and Developments
☆29Updated last year
Alternatives and similar repositories for khPOS
Users that are interested in khPOS are comparing it to the libraries listed below
Sorting:
- A large collection of Khmer language resources. Khmer is a language used by Cambodia.☆135Updated last month
- Khmer language processing toolkit☆72Updated last year
- Word segmentation using Conditional Random Fields (CRF) for Khmer document☆30Updated 5 years ago
- Khmer unicode text data for unsupervised learning language model☆23Updated 4 years ago
- Khmer wordlist for line and word breaking☆36Updated 3 years ago
- New and modern Khmer keyboard with new re-design layout and local word segmentation☆22Updated last year
- ☆14Updated 6 years ago
- This is a repository for PALM students at Royal University of Phnom Penh (2024). The materials for this course are adopted from https://i…☆16Updated 2 months ago
- Split Khmer sentence into an array of words.☆20Updated 2 years ago
- Just Another Word Segmenter (JAWS): A Graph Neural Network Model for Khmer Word Segmentation☆11Updated 8 months ago
- Separate Khmer words from given sentences.☆13Updated 5 years ago
- A Keras implementation of a deep learning network to simultaneously perform Word Segmentation and Part-of-Speech (POS) Tagging introduced…☆11Updated 3 years ago
- A collection of random thing from Khmer Coders community member.☆15Updated 4 years ago
- A standardized benchmark dataset for Khmer Optical Character Recognition (OCR) engine.☆24Updated 2 years ago
- OCR-D python tools☆33Updated 10 months ago
- Double-checked Gold Standard Data for Training and Testing OCR Engines☆14Updated 8 years ago
- Small collection of PAGE XML related scripts used at the ZPD Würzburg☆13Updated 10 months ago
- Master repository which includes most other OCR-D repositories as submodules☆73Updated this week
- An Enhanced Version of Piper especially for Vietnamese :)☆19Updated 2 months ago
- Conversions between various OCR formats☆78Updated 2 years ago
- ☆13Updated 2 years ago
- ⚡Extracting the Machine Readable Zone (MRZ) from passport or any document images☆69Updated 3 months ago
- convert PubLayNet data into METS/PAGE-XML☆10Updated 5 years ago
- Convert Transkribus PAGE-XML to standard PAGE-XML☆12Updated 11 months ago
- A Vietnamese phonetizer☆51Updated last year
- Tool for enhancing noisy scanned text images☆48Updated 5 years ago
- A code for transliterating (romanizing) Arabic text using the American Library Association - Library of Congress (ALA-LC) standard☆47Updated 3 years ago
- Run tesseract with the tesserocr bindings with @OCR-D's interfaces☆39Updated last month
- OCR-D post-correction module based on weighted finite-state transducers☆11Updated last year
- Recognize text using Calamari OCR and the OCR-D framework☆14Updated last month