Socret360 / joint-khmer-word-segmentation-and-pos-tagging
A Keras implementation of a deep learning network to simultaneously perform Word Segmentation and Part-of-Speech (POS) Tagging introduced by Bouy et al. in the paper Joint Khmer Word Segmentation and Part-of-Speech Tagging Using Deep Learning.
☆11Updated 2 years ago
Alternatives and similar repositories for joint-khmer-word-segmentation-and-pos-tagging:
Users that are interested in joint-khmer-word-segmentation-and-pos-tagging are comparing it to the libraries listed below
- Khmer language processing toolkit☆68Updated last year
- A large collection of Khmer language resources. Khmer is a language used by Cambodia.☆101Updated 3 months ago
- ☆13Updated 5 years ago
- khPOS (Khmer Part-of-Speech) Corpus for Khmer NLP Research and Developments☆24Updated 10 months ago
- Khmer unicode text data for unsupervised learning language model☆21Updated 3 years ago
- Word segmentation using Conditional Random Fields (CRF) for Khmer document☆28Updated 4 years ago
- Machine Reading Comprehension special for the Vietnamese language☆39Updated 2 years ago
- New and modern Khmer keyboard with new re-design layout and local word segmentation☆22Updated 9 months ago
- The English-Vietnamese Bilingual Corpus (EVBCorpus) is a collection of English and Vietnamese parallel translations and bitexts.☆42Updated 5 years ago
- Khmer wordlist for line and word breaking☆36Updated 3 years ago
- End-to-End Vietnamese Speech Recognition using wav2vec 2.0☆93Updated 3 years ago
- ☆62Updated last year
- ☆16Updated 2 years ago
- A Robustly Optimized BERT Pretraining Approach for Vietnamese☆31Updated 5 months ago
- ViSen is library to format tone of Vietnamese sentences☆19Updated 3 years ago
- Electra pre-trained model using Vietnamese corpus☆66Updated last year
- A dataset for Vietnamese Spelling Correction☆15Updated 3 years ago
- End-to-end Multi-task Solutions for Aspect Category Sentiment Analysis (ACSA) on Vietnamese Datasets☆22Updated 6 months ago
- Vietnamese handwritten text recognition system☆17Updated 3 years ago
- TUFS Asian Language Parallel Corpus☆50Updated last year
- NTREX -- News Test References for MT Evaluation☆80Updated 7 months ago
- This repo aims to build a web app that supports speech recognition system It's simple to use and understand☆38Updated last year
- BERT-based joint intent detection and slot filling with intent-slot attention mechanism (INTERSPEECH 2021)☆85Updated 5 months ago
- Xây dựng tập dữ liệu 500GB (20% done) văn bản tiếng Việt để huấn luyện mô hình ngôn ngữ lớn☆25Updated last year
- Source code for the paper "Post-OCR Document Correction with Large Ensembles of Character Sequence-to-Sequence Models"☆36Updated last year
- ☆15Updated 6 months ago
- An implementation of Tiling and Corruption (TACo) Augmentations for OCR/HTR☆15Updated 3 years ago
- An ensemble system with a search engine for relevant document retrieval and a deep learning model (BERT) for machine comprehension in Vie…☆13Updated 5 years ago
- This is a repository for PALM students at Royal University of Phnom Penh (2024). The materials for this course are adopted from https://i…☆14Updated last month
- A Lexical Normalization Corpus for Vietnamese Social Media Text☆12Updated 9 months ago