phylypo / khmer-text-dataLinks
Khmer unicode text data for unsupervised learning language model
☆25Updated 4 years ago
Alternatives and similar repositories for khmer-text-data
Users that are interested in khmer-text-data are comparing it to the libraries listed below
Sorting:
- Word segmentation using Conditional Random Fields (CRF) for Khmer document☆30Updated 5 years ago
- A Keras implementation of a deep learning network to simultaneously perform Word Segmentation and Part-of-Speech (POS) Tagging introduced…☆11Updated 3 years ago
- ☆43Updated 3 years ago
- Vietnamese Optical Character Recognition. It works with Vietnamese and Latin characters as well.☆73Updated 7 years ago
- Python library for Myanmar language☆37Updated last year
- khPOS (Khmer Part-of-Speech) Corpus for Khmer NLP Research and Developments☆31Updated last year
- This repository is to create tflite models for the available ocr models☆107Updated 4 years ago
- Speech Emotion Recognition using PyTorch sponsored by AIS and VISTEC-DEPA AIResearch Institute Thailand.☆21Updated 3 years ago
- From identity card image, this repo detect 4 corners, align by OpenCV, then detect word in image and recognize word by Transformer OCR.☆149Updated 2 years ago
- Machine Learning Project to identify an ID Card on an image☆61Updated last year
- ☆14Updated 6 years ago
- This repository contains all the codes used in a thesis at Information Technology University (ITU). The topic of the thesis is pronunciat…☆26Updated 6 years ago
- OCR Tamil is a powerful tool that can detect and recognize text in Tamil images with high accuracy on Natural Scenes☆70Updated 2 months ago
- This repo aims to build a web application that supports speech recognition system It's simple to use and understand☆38Updated 2 years ago
- Powerful handwritten text recognition. A simple-to-use, unofficial implementation of the paper "TrOCR: Transformer-based Optical Characte…☆213Updated 7 months ago
- Open source speech to text models for Indic Languages☆307Updated 2 years ago
- Python demo for ID card digitization using Nanonets☆26Updated 5 years ago
- ☆49Updated 2 years ago
- UTRNet: High-Resolution Urdu Text Recognition In Printed Documents (ICDAR'23)☆56Updated 10 months ago
- ☆254Updated last year
- Template based form extractor OCR. Train your own character and alphabet OCR.☆18Updated 6 years ago
- eKYC (Electronic Know Your Customer) is a project designed to electronically verify the identity of customers☆45Updated 8 months ago
- TableNet: Deep Learning model for end-to-end Table Detection and Tabular data extraction from Scanned Data Images In modern times, more a…☆60Updated 3 years ago
- A TensorFlow implementation of hybird CNN-LSTM model with CTC loss for OCR problem☆33Updated 6 years ago
- Research papers and code on information extraction from image/pdf☆97Updated 2 years ago
- VNOnDB dataset extractor. This dataset can be use for build deep learning model to attack vietnamese handwritten text recognition problem…☆17Updated 3 years ago
- ☆62Updated 4 years ago
- ☆34Updated 5 years ago
- Whisper finetuned on VinBigdata-VLSP2020-100h + KenLM☆38Updated last year
- End-to-End Vietnamese Speech Recognition using wav2vec 2.0☆100Updated 3 years ago