phylypo / khmer-text-data
Khmer unicode text data for unsupervised learning language model
☆21Updated 3 years ago
Alternatives and similar repositories for khmer-text-data:
Users that are interested in khmer-text-data are comparing it to the libraries listed below
- A large collection of Khmer language resources. Khmer is a language used by Cambodia.☆101Updated 3 months ago
- Khmer language processing toolkit☆68Updated last year
- A Keras implementation of a deep learning network to simultaneously perform Word Segmentation and Part-of-Speech (POS) Tagging introduced…☆11Updated 2 years ago
- khPOS (Khmer Part-of-Speech) Corpus for Khmer NLP Research and Developments☆24Updated 10 months ago
- ☆13Updated 5 years ago
- New and modern Khmer keyboard with new re-design layout and local word segmentation☆22Updated 9 months ago
- Separate Khmer words from given sentences.☆12Updated 4 years ago
- Python library for Myanmar language☆33Updated 11 months ago
- Khmer wordlist for line and word breaking☆36Updated 3 years ago
- This is a repository for PALM students at Royal University of Phnom Penh (2024). The materials for this course are adopted from https://i…☆14Updated last month
- UTRNet: High-Resolution Urdu Text Recognition In Printed Documents (ICDAR'23)☆45Updated 3 months ago
- From identity card image, this repo detect 4 corners, align by OpenCV, then detect word in image and recognize word by Transformer OCR.☆144Updated 2 years ago
- ☆94Updated 2 years ago
- This repo aims to build a web app that supports speech recognition system It's simple to use and understand☆38Updated last year
- A standardized benchmark dataset for Khmer Optical Character Recognition (OCR) engine.☆19Updated 2 years ago
- Quy Nhon AI Hackathon 2022 - Challenge 2: Review Analytics - Top 1 Solution☆10Updated 2 years ago
- This repository is to create tflite models for the available ocr models☆103Updated 3 years ago
- English-Thai Machine Translation Models☆28Updated 8 months ago
- Recognize and extract information from ID Card VietNam☆26Updated last year
- An web application helps us to extract information from Vietnamese chip-based ID card in a second. This application aims to reduce human …☆32Updated last year
- End-to-end Multi-task Solutions for Aspect Category Sentiment Analysis (ACSA) on Vietnamese Datasets☆22Updated 6 months ago
- A python-based algorithm for id-card rectification☆45Updated 4 months ago
- Vietnamese self-supervised Wav2vec2 model☆61Updated 2 years ago
- preprocessing and postediting tools especially for NLP (bash, perl, python)☆16Updated last month
- [Computer Speech & Language] A transformer-based spelling error correction framework for Bangla and resource scarce Indic languages☆10Updated 5 months ago
- RASA based voice bot after 1 months jump in to AI ;)☆29Updated 5 years ago
- End-to-End Vietnamese Speech Recognition using wav2vec 2.0☆93Updated 3 years ago
- Python - NSW package for Vietnamese: Normalization system to convert numbers, abbreviations, and words that cannot be pronounced into syl…☆51Updated 2 weeks ago
- In this repo, I use encoder, decoder with attention mechanism to auto-correct output of vietnamese ocr model☆25Updated 3 years ago
- POS for African languages☆17Updated 11 months ago