high performance tokenizer for Vietnamese language
☆409Apr 15, 2021Updated 4 years ago
Alternatives and similar repositories for coccoc-tokenizer
Users that are interested in coccoc-tokenizer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Vietnamese Analysis Plugin for Elasticsearch☆557Mar 15, 2026Updated last week
- Underthesea - Vietnamese NLP Toolkit☆1,687Mar 15, 2026Updated last week
- A Vietnamese natural language processing toolkit (NAACL 2018)☆662Feb 12, 2023Updated 3 years ago
- A Vietnamese Text Processing Toolkit☆217Jan 27, 2022Updated 4 years ago
- Repository to track the progress in Vietnamese Natural Language Processing, including the datasets and the current state-of-the-art for t…☆371Sep 5, 2022Updated 3 years ago
- Thư viện chuẩn hóa văn bản Tiếng Việt☆180May 26, 2025Updated 9 months ago
- Submission for AIviVN Vietnamese diacritics restoration contest https://www.aivivn.com/contests/3☆40Jul 25, 2024Updated last year
- Solution for AIviVN's Vietnamese tone prediction competition☆35Jun 8, 2019Updated 6 years ago
- Python Vietnamese Core NLP Toolkit☆272Sep 26, 2024Updated last year
- ALBERT for Vietnamese☆96Dec 16, 2019Updated 6 years ago
- Corpus tiếng việt☆385Oct 3, 2025Updated 5 months ago
- Vietnamese NLP Toolkit for Node☆218Feb 26, 2024Updated 2 years ago
- ETNLP: A toolkit to evaluate, extract, and visualize multiple embeddings☆149Aug 23, 2025Updated 7 months ago
- Submission for AIviVN sentiment analysis contest https://www.aivivn.com/contests/1☆15Oct 12, 2021Updated 4 years ago
- PhoBERT: Pre-trained language models for Vietnamese (EMNLP-2020 Findings)☆777Jul 23, 2024Updated last year
- Công cụ quét và phân tích từ khoá các trang báo mạng Việt Nam☆265May 22, 2023Updated 2 years ago
- M ột cuốn sách tập trung vào hướng dẫn cách cấu trúc các dự án Học Máy và phân tích cách làm cho các thuật toán Học Máy hoạt động.☆1,084Oct 13, 2021Updated 4 years ago
- A Fast and Accurate Vietnamese Word Segmenter (LREC 2018)☆83Nov 23, 2022Updated 3 years ago
- Pre-trained Word2Vec models for Vietnamese☆161Dec 30, 2020Updated 5 years ago
- Electra pre-trained model using Vietnamese corpus☆67Jun 12, 2023Updated 2 years ago
- Vietnamese stopwords☆197Jul 25, 2022Updated 3 years ago
- Zalo landmark identification challenge, 103 classes, > 100k images (PyTorch)☆172Sep 1, 2019Updated 6 years ago
- LSTM model for Vietnamese Named Entity Recognition☆17Jul 26, 2017Updated 8 years ago
- A simple/fast/accurate accent prediction for non-accented Vietnamese text☆35Oct 20, 2017Updated 8 years ago
- A toolkit for processing Vietnamese texts☆16Oct 20, 2022Updated 3 years ago
- Những kiến thức cần thiết để học tốt Machine Learning trong vòng 2 tháng. Essential Knowledge for learning Machine Learning in two months…☆2,141Oct 21, 2022Updated 3 years ago
- ☆15Jun 21, 2022Updated 3 years ago
- A Large-scale Vietnamese News Text Classification Corpus☆108Sep 24, 2019Updated 6 years ago
- This project applies multiple deep learning models to the problem of restoring diacritical marks to sentences in Vietnamese.☆26Nov 13, 2018Updated 7 years ago
- ☆33Nov 11, 2013Updated 12 years ago
- A toolkit for Vietnamese word segmentation☆74Oct 20, 2022Updated 3 years ago
- Vietnamese song lyric alignment framework☆68Dec 11, 2022Updated 3 years ago
- Vietnamese Named Entity Recognition☆52Dec 8, 2022Updated 3 years ago
- Transformer OCR☆756Jan 19, 2025Updated last year
- Sentiment classification for Vietnamese text using PhoBert☆98Nov 16, 2020Updated 5 years ago
- vietnamese OCR☆140Apr 28, 2019Updated 6 years ago
- MTet: Multi-domain Translation for English and Vietnamese☆194Feb 7, 2023Updated 3 years ago
- Một cuốn sách về Học Sâu đề cập đến nhiều framework phổ biến, được sử dụng trên 300 trường Đại học từ 55 đất nước bao gồm MIT, Stanford, …☆657Jul 8, 2022Updated 3 years ago
- Vietnamese Chatbot☆96Apr 17, 2024Updated last year