high performance tokenizer for Vietnamese language
☆408Apr 15, 2021Updated 4 years ago
Alternatives and similar repositories for coccoc-tokenizer
Users that are interested in coccoc-tokenizer are comparing it to the libraries listed below
Sorting:
- Vietnamese Analysis Plugin for Elasticsearch☆554Dec 19, 2025Updated 2 months ago
- Underthesea - Vietnamese NLP Toolkit☆1,675Feb 25, 2026Updated last week
- A Vietnamese natural language processing toolkit (NAACL 2018)☆659Feb 12, 2023Updated 3 years ago
- A Vietnamese Text Processing Toolkit☆217Jan 27, 2022Updated 4 years ago
- Submission for AIviVN Vietnamese diacritics restoration contest https://www.aivivn.com/contests/3☆40Jul 25, 2024Updated last year
- Repository to track the progress in Vietnamese Natural Language Processing, including the datasets and the current state-of-the-art for t…☆370Sep 5, 2022Updated 3 years ago
- Thư viện chuẩn hóa văn bản Tiếng Việt☆180May 26, 2025Updated 9 months ago
- ALBERT for Vietnamese☆96Dec 16, 2019Updated 6 years ago
- Corpus tiếng việt☆385Oct 3, 2025Updated 5 months ago
- PhoBERT: Pre-trained language models for Vietnamese (EMNLP-2020 Findings)☆773Jul 23, 2024Updated last year
- Vietnamese NLP Toolkit for Node☆218Feb 26, 2024Updated 2 years ago
- Solution for AIviVN's Vietnamese tone prediction competition☆35Jun 8, 2019Updated 6 years ago
- Python Vietnamese Core NLP Toolkit☆272Sep 26, 2024Updated last year
- ETNLP: A toolkit to evaluate, extract, and visualize multiple embeddings☆149Aug 23, 2025Updated 6 months ago
- Submission for AIviVN sentiment analysis contest https://www.aivivn.com/contests/1☆15Oct 12, 2021Updated 4 years ago
- Công cụ quét và phân tích từ khoá các trang báo mạng Việt Nam☆266May 22, 2023Updated 2 years ago
- A Fast and Accurate Vietnamese Word Segmenter (LREC 2018)☆83Nov 23, 2022Updated 3 years ago
- Một cuốn sách tập trung vào hướng dẫn cách cấu trúc các dự án Học Máy và phân tích cách làm cho các thuật toán Học Máy hoạt động.☆1,085Oct 13, 2021Updated 4 years ago
- Pre-trained Word2Vec models for Vietnamese☆160Dec 30, 2020Updated 5 years ago
- PhoNLP: A BERT-based multi-task learning model for part-of-speech tagging, named entity recognition and dependency parsing (NAACL 2021)☆150Dec 31, 2024Updated last year
- Zalo landmark identification challenge, 103 classes, > 100k images (PyTorch)☆172Sep 1, 2019Updated 6 years ago
- LSTM model for Vietnamese Named Entity Recognition☆17Jul 26, 2017Updated 8 years ago
- Electra pre-trained model using Vietnamese corpus☆67Jun 12, 2023Updated 2 years ago
- A Large-scale Vietnamese News Text Classification Corpus☆109Sep 24, 2019Updated 6 years ago
- Vietnamese stopwords☆197Jul 25, 2022Updated 3 years ago
- Vietnamese song lyric alignment framework☆68Dec 11, 2022Updated 3 years ago
- Những kiến thức cần thiết để học tốt Machine Learning trong vòng 2 tháng. Essential Knowledge for learning Machine Learning in two months…☆2,140Oct 21, 2022Updated 3 years ago
- Transformer OCR☆752Jan 19, 2025Updated last year
- Vietnamese Named Entity Recognition☆52Dec 8, 2022Updated 3 years ago
- MTet: Multi-domain Translation for English and Vietnamese☆193Feb 7, 2023Updated 3 years ago
- Một cuốn sách về Học Sâu đề cập đến nhiều framework phổ biến, được sử dụng trên 300 trường Đại học từ 55 đất nước bao gồm MIT, Stanford, …☆653Jul 8, 2022Updated 3 years ago
- A simple/fast/accurate accent prediction for non-accented Vietnamese text☆35Oct 20, 2017Updated 8 years ago
- A toolkit for Vietnamese word segmentation☆74Oct 20, 2022Updated 3 years ago
- vietnamese OCR☆139Apr 28, 2019Updated 6 years ago
- Vietnamese Chatbot☆96Apr 17, 2024Updated last year
- Vietnamese Analysis Plugin for OpenSearch☆11Feb 20, 2025Updated last year
- Sentiment classification for Vietnamese text using PhoBert☆99Nov 16, 2020Updated 5 years ago
- A toolkit for processing Vietnamese texts☆16Oct 20, 2022Updated 3 years ago
- Vietnamese question answering system with BERT☆117Jan 12, 2023Updated 3 years ago