ye-kyaw-thu / khPOSLinks
khPOS (Khmer Part-of-Speech) Corpus for Khmer NLP Research and Developments
☆31Updated last year
Alternatives and similar repositories for khPOS
Users that are interested in khPOS are comparing it to the libraries listed below
Sorting:
- A large collection of Khmer language resources. Khmer is a language used by Cambodia.☆146Updated 2 weeks ago
- Khmer language processing toolkit☆78Updated last year
- Word segmentation using Conditional Random Fields (CRF) for Khmer document☆30Updated 5 years ago
- Khmer unicode text data for unsupervised learning language model☆25Updated 4 years ago
- Modules used for separating articles in (historical) newspapers and similar documents. This repository is part of the European Union's Ho…☆21Updated 3 years ago
- Recognize text using Calamari OCR and the OCR-D framework☆15Updated 4 months ago
- Core libraries by the PRImA Research Lab☆16Updated last year
- OCR-D wrapper for detectron2 based segmentation models☆17Updated 4 months ago
- Small collection of PAGE XML related scripts used at the ZPD Würzburg☆13Updated last year
- This is a repository for PALM students at Royal University of Phnom Penh (2024). The materials for this course are adopted from https://i…☆16Updated 5 months ago
- ☆13Updated 3 years ago
- Convert Transkribus PAGE-XML to standard PAGE-XML☆12Updated last year
- Ground Truth Resources for the HTR of patrimonial documents☆44Updated last week
- ☆14Updated 6 years ago
- Master repository which includes most other OCR-D repositories as submodules☆73Updated 2 months ago
- A code for transliterating (romanizing) Arabic text using the American Library Association - Library of Congress (ALA-LC) standard☆49Updated 3 years ago
- Conversions between various OCR formats☆80Updated 2 years ago
- Update of the ISRI Analytic Tools for OCR Evaluation with UTF-8 support☆58Updated 4 years ago
- OCR-D python tools☆33Updated last year
- OCRopus model for Gothic print (Fraktur)☆18Updated 5 years ago
- OCR-D post-correction module based on weighted finite-state transducers☆11Updated last year
- Some bits of javascript to transcribe scanned pages using PageXML☆17Updated last year
- A Keras implementation of a deep learning network to simultaneously perform Word Segmentation and Part-of-Speech (POS) Tagging introduced…☆11Updated 3 years ago
- A semi-automatic open-source tool for Layout Analysis and Region EXtraction on early printed books.☆191Updated 2 months ago
- Vietnamese Text Dataset - Wikipedia vi 2018☆14Updated 6 years ago
- Tools for normalizing the use of some characters and checking file consistencies☆11Updated 8 months ago
- preprocessing and postediting tools especially for NLP (bash, perl, python)☆17Updated 2 months ago
- Morphological processing for languages of the Horn of Africa☆46Updated last week
- Run tesseract with the tesserocr bindings with @OCR-D's interfaces☆39Updated 4 months ago
- A corpus of diacritized Hebrew texts (טקסט מנוקד)☆11Updated 3 years ago