Socret360 / joint-khmer-word-segmentation-and-pos-tagging
A Keras implementation of a deep learning network to simultaneously perform Word Segmentation and Part-of-Speech (POS) Tagging introduced by Bouy et al. in the paper Joint Khmer Word Segmentation and Part-of-Speech Tagging Using Deep Learning.
☆11Updated 3 years ago
Alternatives and similar repositories for joint-khmer-word-segmentation-and-pos-tagging:
Users that are interested in joint-khmer-word-segmentation-and-pos-tagging are comparing it to the libraries listed below
- Khmer language processing toolkit☆72Updated last year
- Khmer unicode text data for unsupervised learning language model☆21Updated 4 years ago
- ☆14Updated 6 years ago
- A large collection of Khmer language resources. Khmer is a language used by Cambodia.☆110Updated 7 months ago
- khPOS (Khmer Part-of-Speech) Corpus for Khmer NLP Research and Developments☆26Updated last year
- Word segmentation using Conditional Random Fields (CRF) for Khmer document☆29Updated 4 years ago
- The English-Vietnamese Bilingual Corpus (EVBCorpus) is a collection of English and Vietnamese parallel translations and bitexts.☆42Updated 5 years ago
- TUFS Asian Language Parallel Corpus☆50Updated last year
- New and modern Khmer keyboard with new re-design layout and local word segmentation☆23Updated last year
- Machine Reading Comprehension special for the Vietnamese language☆40Updated 3 years ago
- PhoNLP: A BERT-based multi-task learning model for part-of-speech tagging, named entity recognition and dependency parsing (NAACL 2021)☆141Updated 3 months ago
- A dataset for Vietnamese Spelling Correction☆15Updated 3 years ago
- ☆16Updated 2 years ago
- ☆68Updated 2 years ago
- A Robustly Optimized BERT Pretraining Approach for Vietnamese☆32Updated 9 months ago
- ViSen is library to format tone of Vietnamese sentences☆20Updated 3 years ago
- ☆25Updated 7 months ago
- ☆13Updated 2 years ago
- Vietnamese handwritten text recognition system☆17Updated 3 years ago
- End-to-End Vietnamese Speech Recognition using wav2vec 2.0☆98Updated 3 years ago
- Fast Punctuation Restoration using Transformer Models for Vietnamese☆10Updated 2 years ago
- Vistral-V: Visual Instruction Tuning for Vistral - Vietnamese Large Vision-Language Model.☆22Updated 9 months ago
- Electra pre-trained model using Vietnamese corpus☆66Updated last year
- Transformation spoken text to written text☆30Updated 11 months ago
- [Computer Speech & Language] A transformer-based spelling error correction framework for Bangla and resource scarce Indic languages☆12Updated 8 months ago
- Code, models, and data for "Advancements in Arabic Grammatical Error Detection and Correction: An Empirical Investigation". EMNLP 2023.☆16Updated 7 months ago
- ViDeBERTa: A powerful pre-trained language model for Vietnamese, EACL 2023☆56Updated last year
- PhoMT: A High-Quality and Large-Scale Benchmark Dataset for Vietnamese-English Machine Translation (EMNLP 2021)☆42Updated 9 months ago
- Khmer wordlist for line and word breaking☆36Updated 3 years ago
- Okapi: Instruction-tuned Large Language Models in Multiple Languages with Reinforcement Learning from Human Feedback☆94Updated last year