VinAIResearch / PhoBERTView external linksLinks
PhoBERT: Pre-trained language models for Vietnamese (EMNLP-2020 Findings)
☆771Jul 23, 2024Updated last year
Alternatives and similar repositories for PhoBERT
Users that are interested in PhoBERT are comparing it to the libraries listed below
Sorting:
- A Vietnamese natural language processing toolkit (NAACL 2018)☆657Feb 12, 2023Updated 3 years ago
- Corpus tiếng việt☆384Oct 3, 2025Updated 4 months ago
- Sentiment classification for Vietnamese text using PhoBert☆99Nov 16, 2020Updated 5 years ago
- PhoNLP: A BERT-based multi-task learning model for part-of-speech tagging, named entity recognition and dependency parsing (NAACL 2021)☆150Dec 31, 2024Updated last year
- Repository to track the progress in Vietnamese Natural Language Processing, including the datasets and the current state-of-the-art for t…☆369Sep 5, 2022Updated 3 years ago
- Underthesea - Vietnamese NLP Toolkit☆1,667Updated this week
- BARTpho: Pre-trained Sequence-to-Sequence Models for Vietnamese (INTERSPEECH 2022)☆103Jul 22, 2024Updated last year
- PhoGPT: Generative Pre-training for Vietnamese (2023)☆797Nov 12, 2024Updated last year
- Transformer OCR☆749Jan 19, 2025Updated last year
- Electra pre-trained model using Vietnamese corpus☆67Jun 12, 2023Updated 2 years ago
- MTet: Multi-domain Translation for English and Vietnamese☆192Feb 7, 2023Updated 3 years ago
- Vietnamese question answering system with BERT☆117Jan 12, 2023Updated 3 years ago
- A Robustly Optimized BERT Pretraining Approach for Vietnamese☆32Jul 25, 2024Updated last year
- Một cuốn sách về Học Sâu đề cập đến nhiều framework phổ biến, được sử dụng trên 300 trường Đại học từ 55 đất nước bao gồm MIT, Stanford, …☆653Jul 8, 2022Updated 3 years ago
- A Fast and Accurate Vietnamese Word Segmenter (LREC 2018)☆83Nov 23, 2022Updated 3 years ago
- PhoMT: A High-Quality and Large-Scale Benchmark Dataset for Vietnamese-English Machine Translation (EMNLP 2021)☆48Jun 3, 2025Updated 8 months ago
- high performance tokenizer for Vietnamese language☆406Apr 15, 2021Updated 4 years ago
- End-to-End Vietnamese Speech Recognition using wav2vec 2.0☆105Sep 3, 2021Updated 4 years ago
- A Python wrapper for VnCoreNLP using a bidirectional communication channel.☆58Aug 10, 2018Updated 7 years ago
- Thư viện chuẩn hóa văn bản Tiếng Việt☆180May 26, 2025Updated 8 months ago
- Submission for AIviVN Vietnamese diacritics restoration contest https://www.aivivn.com/contests/3☆40Jul 25, 2024Updated last year
- A collection of Vietnamese Natural Language Processing resources.☆305Oct 28, 2025Updated 3 months ago
- COVID-19 Named Entity Recognition for Vietnamese (NAACL 2021)☆72Jul 22, 2024Updated last year
- Vietnamese language model for spacy.io☆112Jul 14, 2023Updated 2 years ago
- A Large-scale Vietnamese News Text Classification Corpus☆108Sep 24, 2019Updated 6 years ago
- ETNLP: A toolkit to evaluate, extract, and visualize multiple embeddings☆149Aug 23, 2025Updated 5 months ago
- Vietnamese Named Entity Recognition☆52Dec 8, 2022Updated 3 years ago
- Những kiến thức cần thiết để học tốt Machine Learning trong vòng 2 tháng. Essential Knowledge for learning Machine Learning in two months…☆2,135Oct 21, 2022Updated 3 years ago
- Pre-trained Word2Vec syllable- and word-level embeddings for Vietnamese☆53Aug 8, 2023Updated 2 years ago
- Vietnamese song lyric alignment framework☆68Dec 11, 2022Updated 3 years ago
- Một cuốn sách tập trung vào hướng dẫn cách cấu trúc các dự án Học Máy và phân tích cách làm cho các thuật toán Học Máy hoạt động.☆1,083Oct 13, 2021Updated 4 years ago
- ebook Machine Learning cơ bản☆1,723Jul 5, 2024Updated last year
- ☆75Feb 6, 2023Updated 3 years ago
- VnDT: A Vietnamese Dependency Treebank☆24Nov 6, 2021Updated 4 years ago
- vietnamese OCR☆138Apr 28, 2019Updated 6 years ago
- Vietnamese self-supervised Wav2vec2 model☆61Nov 5, 2022Updated 3 years ago
- Sentence Embeddings with BERT & XLNet☆27Aug 23, 2020Updated 5 years ago
- Solution for MC_OCR competition☆95Mar 7, 2023Updated 2 years ago
- Multilingual bert retrained on news + squad2 for vietnamese☆24Feb 16, 2020Updated 5 years ago