phylypo / khmer-text-dataLinks
Khmer unicode text data for unsupervised learning language model
☆23Updated 4 years ago
Alternatives and similar repositories for khmer-text-data
Users that are interested in khmer-text-data are comparing it to the libraries listed below
Sorting:
- Khmer language processing toolkit☆72Updated last year
- A Keras implementation of a deep learning network to simultaneously perform Word Segmentation and Part-of-Speech (POS) Tagging introduced…☆11Updated 3 years ago
- khPOS (Khmer Part-of-Speech) Corpus for Khmer NLP Research and Developments☆29Updated last year
- A large collection of Khmer language resources. Khmer is a language used by Cambodia.☆135Updated last month
- Word segmentation using Conditional Random Fields (CRF) for Khmer document☆30Updated 5 years ago
- ☆14Updated 6 years ago
- ☆43Updated 3 years ago
- Khmer wordlist for line and word breaking☆36Updated 3 years ago
- ☆97Updated 2 years ago
- This repo aims to build a web app that supports speech recognition system It's simple to use and understand☆38Updated 2 years ago
- Python library for Myanmar language☆35Updated last year
- Recognize and extract information from ID Card VietNam☆25Updated last year
- End-to-End Vietnamese Speech Recognition using wav2vec 2.0☆99Updated 3 years ago
- This is a repository for PALM students at Royal University of Phnom Penh (2024). The materials for this course are adopted from https://i…☆16Updated 2 months ago
- Vietnamese self-supervised Wav2vec2 model☆62Updated 2 years ago
- Python - NSW package for Vietnamese: Normalization system to convert numbers, abbreviations, and words that cannot be pronounced into syl…☆60Updated 5 months ago
- From identity card image, this repo detect 4 corners, align by OpenCV, then detect word in image and recognize word by Transformer OCR.☆149Updated 2 years ago
- A synthesized dataset for Vietnamese TTS task☆63Updated 3 years ago
- jupyter notebooks to fine tune whisper models on Vietnamese using Colab and/or Kaggle and/or AWS EC2☆15Updated 2 months ago
- Mô hình ngôn ngữ lớn cho người Việt☆61Updated last year
- A TensorFlow implementation of hybird CNN-LSTM model with CTC loss for OCR problem☆32Updated 6 years ago
- An web application helps us to extract information from Vietnamese chip-based ID card in a second. This application aims to reduce human …☆34Updated last year
- Machine Learning Project to identify an ID Card on an image☆58Updated 11 months ago
- ☆26Updated last year
- Whisper finetuned on VinBigdata-VLSP2020-100h + KenLM☆38Updated last year
- Vietnamese Optical Character Recognition. It works with Vietnamese and Latin characters as well.☆73Updated 6 years ago
- VietASR - Vietnamese Automatic Speech Recognition☆134Updated 8 months ago
- Ai cũng có thể tự tạo chatbot bằng huấn luyện chỉ dẫn, với 12G GPU (RTX 3060) và khoảng vài chục MB dữ liệu☆113Updated 2 years ago
- Fine-tuning Vietnamese Text-to-speech model (VITS)☆46Updated 3 months ago
- BARTpho: Pre-trained Sequence-to-Sequence Models for Vietnamese (INTERSPEECH 2022)☆105Updated 11 months ago