phylypo / khmer-text-dataLinks
Khmer unicode text data for unsupervised learning language model
☆23Updated 4 years ago
Alternatives and similar repositories for khmer-text-data
Users that are interested in khmer-text-data are comparing it to the libraries listed below
Sorting:
- Khmer language processing toolkit☆75Updated last year
- A large collection of Khmer language resources. Khmer is a language used by Cambodia.☆138Updated this week
- khPOS (Khmer Part-of-Speech) Corpus for Khmer NLP Research and Developments☆30Updated last year
- Python library for Myanmar language☆36Updated last year
- A Keras implementation of a deep learning network to simultaneously perform Word Segmentation and Part-of-Speech (POS) Tagging introduced…☆11Updated 3 years ago
- Speech Emotion Recognition using PyTorch sponsored by AIS and VISTEC-DEPA AIResearch Institute Thailand.☆21Updated 3 years ago
- UTRNet: High-Resolution Urdu Text Recognition In Printed Documents (ICDAR'23)☆55Updated 9 months ago
- End-to-End Vietnamese Speech Recognition using wav2vec 2.0☆99Updated 3 years ago
- This repo aims to build a web app that supports speech recognition system It's simple to use and understand☆38Updated 2 years ago
- A fine-tuned Large Language Model (LLM) for the Vietnamese language based on the Llama 2 model.☆16Updated last year
- This repository is to create tflite models for the available ocr models☆105Updated 4 years ago
- ☆43Updated 3 years ago
- Vietnamese Optical Character Recognition. It works with Vietnamese and Latin characters as well.☆73Updated 6 years ago
- Whisper finetuned on VinBigdata-VLSP2020-100h + KenLM☆38Updated last year
- OCR Tamil is a powerful tool that can detect and recognize text in Tamil images with high accuracy on Natural Scenes☆67Updated 2 weeks ago
- Open source speech to text models for Indic Languages☆306Updated 2 years ago
- This repository contains all the codes used in a thesis at Information Technology University (ITU). The topic of the thesis is pronunciat…☆26Updated 6 years ago
- A synthesized dataset for Vietnamese TTS task☆63Updated 3 years ago
- Finetune wav2vec2-large-xlsr-53 with Thai Common Voice Corpus 7.0☆49Updated 3 years ago
- Vietnamese song lyric alignment framework☆67Updated 2 years ago
- Ai cũng có thể tự tạo chatbot bằng huấn luyện chỉ dẫn, với 12G GPU (RTX 3060) và khoảng vài chục MB dữ liệu☆113Updated 2 years ago
- The official implementation of CATT Arabic diacritization models.☆46Updated this week
- Vietnamese self-supervised Wav2vec2 model☆62Updated 2 years ago
- VietASR - Vietnamese Automatic Speech Recognition☆135Updated 8 months ago
- ☆14Updated 6 years ago
- A TensorFlow implementation of hybird CNN-LSTM model with CTC loss for OCR problem☆32Updated 6 years ago
- Fine-tune and evaluate Whisper models for Automatic Speech Recognition (ASR) on custom datasets or datasets from huggingface.☆319Updated 2 years ago
- Powerful handwritten text recognition. A simple-to-use, unofficial implementation of the paper "TrOCR: Transformer-based Optical Characte…☆208Updated 6 months ago
- Repository containing experimentation platform on how to train, infer on wav2vec2 models.☆87Updated 2 years ago
- Python - NSW package for Vietnamese: Normalization system to convert numbers, abbreviations, and words that cannot be pronounced into syl…☆61Updated 6 months ago