phylypo / khmer-text-data
Khmer unicode text data for unsupervised learning language model
☆21Updated 4 years ago
Alternatives and similar repositories for khmer-text-data
Users that are interested in khmer-text-data are comparing it to the libraries listed below
Sorting:
- Khmer language processing toolkit☆72Updated last year
- A large collection of Khmer language resources. Khmer is a language used by Cambodia.☆114Updated last week
- khPOS (Khmer Part-of-Speech) Corpus for Khmer NLP Research and Developments☆26Updated last year
- Word segmentation using Conditional Random Fields (CRF) for Khmer document☆29Updated 4 years ago
- Vietnamese self-supervised Wav2vec2 model☆62Updated 2 years ago
- Vietnamese song lyric alignment framework☆68Updated 2 years ago
- ☆43Updated 3 years ago
- VietASR - Vietnamese Automatic Speech Recognition☆130Updated 6 months ago
- A synthesized dataset for Vietnamese TTS task☆63Updated 3 years ago
- Python - NSW package for Vietnamese: Normalization system to convert numbers, abbreviations, and words that cannot be pronounced into syl…☆59Updated 4 months ago
- Scene text recognition vietnamese☆16Updated 4 years ago
- ☆69Updated 2 years ago
- PhoWhisper: Automatic Speech Recognition for Vietnamese (2024)☆152Updated 6 months ago
- Ai cũng có thể tự tạo chatbot bằng huấn luyện chỉ dẫn, với 12G GPU (RTX 3060) và khoảng vài chục MB dữ liệu☆113Updated last year
- This repo aims to build a web app that supports speech recognition system It's simple to use and understand☆38Updated last year
- Vietnamese Automatic Speech Recognition☆69Updated 6 years ago
- Solution for MC_OCR competition☆95Updated 2 years ago
- ☆96Updated 2 years ago
- End-to-End Vietnamese Speech Recognition using wav2vec 2.0☆98Updated 3 years ago
- ☆26Updated last year
- From identity card image, this repo detect 4 corners, align by OpenCV, then detect word in image and recognize word by Transformer OCR.☆147Updated 2 years ago
- Phần mềm nguồn mở giúp mỗi cá nhân trực tiếp sử dụng ChatGPT và hơn thế nữa ngay trên máy tính c ủa mình.☆35Updated 2 years ago
- Separate Khmer words from given sentences.☆12Updated 5 years ago
- This is our solution dealing with BKAI challenge☆63Updated 2 years ago
- Solution for Zalo AI Challenge 2022 - Lyrics Alignment☆68Updated 2 years ago
- EraX-VL-7B-V1 is the multimodal large language model developed by EraX team, base on Qwen2-VL.☆11Updated 4 months ago
- Bud500: A Comprehensive Vietnamese ASR Dataset☆66Updated last year
- A Robustly Optimized BERT Pretraining Approach for Vietnamese☆32Updated 9 months ago
- Top 2 Solution for Zalo AI Challenge 2022 - Liveness Detection track☆43Updated 2 years ago
- ☆14Updated 6 years ago