phylypo / khmer-text-dataLinks
Khmer unicode text data for unsupervised learning language model
☆25Updated 4 years ago
Alternatives and similar repositories for khmer-text-data
Users that are interested in khmer-text-data are comparing it to the libraries listed below
Sorting:
- Khmer language processing toolkit☆77Updated 2 years ago
- A large collection of Khmer language resources. Khmer is a language used by Cambodia.☆149Updated last week
- ☆44Updated 3 years ago
- khPOS (Khmer Part-of-Speech) Corpus for Khmer NLP Research and Developments☆33Updated last year
- A Keras implementation of a deep learning network to simultaneously perform Word Segmentation and Part-of-Speech (POS) Tagging introduced…☆11Updated 3 years ago
- From identity card image, this repo detect 4 corners, align by OpenCV, then detect word in image and recognize word by Transformer OCR.☆149Updated 2 years ago
- Ai cũng có thể tự tạo chatbot bằng huấn luyện chỉ dẫn, với 12G GPU (RTX 3060) và khoảng vài chục MB dữ liệu☆113Updated 2 years ago
- Vietnamese Automatic Speech Recognition☆70Updated 6 years ago
- Vietnamese Optical Character Recognition. It works with Vietnamese and Latin characters as well.☆73Updated 7 years ago
- Vietnamese song lyric alignment framework☆68Updated 2 years ago
- Vietnamese self-supervised Wav2vec2 model☆61Updated 2 years ago
- Python library for Myanmar language☆37Updated last year
- End-to-end Multi-task Solutions for Aspect Category Sentiment Analysis (ACSA) on Vietnamese reviews, using PhoBERT as pretrained model☆29Updated last year
- Recognize and extract information from ID Card VietNam☆24Updated 2 years ago
- Machine Learning Project to identify an ID Card on an image☆61Updated last year
- This repo builds an end-to-end deep learning application that supports speech recognition system. It's simple to use and understand☆38Updated 2 years ago
- This repository is to create tflite models for the available ocr models☆107Updated 4 years ago
- Powerful handwritten text recognition. A simple-to-use, unofficial implementation of the paper "TrOCR: Transformer-based Optical Characte…☆220Updated 9 months ago
- ☆97Updated 2 years ago
- Translate large dataset to any language with google translation api and multithreads processing, no key required!☆72Updated last year
- VietASR - Vietnamese Automatic Speech Recognition☆152Updated 11 months ago
- VNOnDB dataset extractor. This dataset can be use for build deep learning model to attack vietnamese handwritten text recognition problem…☆17Updated 4 years ago
- ☆14Updated 6 years ago
- Autonomous car source code for FPT Digital Race (Cuộc Đua Số) 2020 - Round 1 - Simulation Car. TOP 1 DHBKHN (HUST).☆12Updated 3 years ago
- This repo consists of the code as discussed in the Medium blog.☆16Updated 2 years ago
- eKYC (Electronic Know Your Customer) is a project designed to electronically verify the identity of customers☆45Updated 10 months ago
- UTRNet: High-Resolution Urdu Text Recognition In Printed Documents (ICDAR'23)☆59Updated last year
- Solution for MC_OCR competition☆95Updated 2 years ago
- Comparison-of-OCR (KerasOCR, PyTesseract,EasyOCR)☆63Updated 3 years ago
- An web application helps us to extract information from Vietnamese chip-based ID card in a second. This application aims to reduce human …☆35Updated 2 years ago