PhoMT: A High-Quality and Large-Scale Benchmark Dataset for Vietnamese-English Machine Translation (EMNLP 2021)
☆48Jun 3, 2025Updated 9 months ago
Alternatives and similar repositories for PhoMT
Users that are interested in PhoMT are comparing it to the libraries listed below
Sorting:
- PhoNLP: A BERT-based multi-task learning model for part-of-speech tagging, named entity recognition and dependency parsing (NAACL 2021)☆150Dec 31, 2024Updated last year
- PhoBERT: Pre-trained language models for Vietnamese (EMNLP-2020 Findings)☆773Jul 23, 2024Updated last year
- ViText2SQL: A dataset for Vietnamese Text-to-SQL semantic parsing (EMNLP-2020 Findings)☆36Jul 22, 2024Updated last year
- VnDT: A Vietnamese Dependency Treebank☆24Nov 6, 2021Updated 4 years ago
- ☆25Aug 28, 2024Updated last year
- MTet: Multi-domain Translation for English and Vietnamese☆193Feb 7, 2023Updated 3 years ago
- COVID-19 Named Entity Recognition for Vietnamese (NAACL 2021)☆72Jul 22, 2024Updated last year
- Coloring lips and drawing glasses on faces in custom images or live webcam☆11Sep 10, 2019Updated 6 years ago
- 🌸 A collection of Vietnamese women who are currently working in the field of Computer Science.☆13Updated this week
- Python scripts and datasets of the "Extremely Low-Resource Neural Machine Translation: A Case Study of Cantonese" project☆16Oct 28, 2022Updated 3 years ago
- BERT-based joint intent detection and slot filling with intent-slot attention mechanism (INTERSPEECH 2021)☆88Jul 22, 2024Updated last year
- Pre-trained Word2Vec syllable- and word-level embeddings for Vietnamese☆53Aug 8, 2023Updated 2 years ago
- WNUT-2020 Task 2: Identification of informative COVID-19 English Tweets☆30Jul 22, 2024Updated last year
- A trained neural network built from scratch to classify clothes and digits from the MNIST dataset☆14Sep 11, 2019Updated 6 years ago
- ☆17Jul 10, 2022Updated 3 years ago
- Vietnamese Text to Speech library☆255Aug 20, 2023Updated 2 years ago
- Source code for Zalo AI 2021 submission☆142Dec 20, 2021Updated 4 years ago
- BARTpho: Pre-trained Sequence-to-Sequence Models for Vietnamese (INTERSPEECH 2022)☆104Jul 22, 2024Updated last year
- This repo builds an end-to-end deep learning application that supports speech recognition system. It's simple to use and understand☆38May 23, 2023Updated 2 years ago
- A collection for AI Engineer☆41Jul 5, 2025Updated 8 months ago
- Sentiment classification for Vietnamese text using PhoBert☆98Nov 16, 2020Updated 5 years ago
- We provide benchmark datasets for evaluating Vietnamese processing models: UIT-ViQuAD, ViNewsQA, UIT-VSFC, UIT-ViIC, UIT-ViNames, UIT-VSM…☆20Jun 19, 2021Updated 4 years ago
- Repository to track the progress in Vietnamese Natural Language Processing, including the datasets and the current state-of-the-art for t…☆370Sep 5, 2022Updated 3 years ago
- Predictive Coding for Locally-Linear Control (ICML-2020)☆17Jul 22, 2024Updated last year
- A Vietnamese-English Neural Machine Translation System (INTERSPEECH 2022)☆136Jul 22, 2024Updated last year
- ☆23Nov 6, 2022Updated 3 years ago
- Pioneering in Vietnamese Multimodal Large Language Model☆52Jan 23, 2025Updated last year
- A Vietnamese natural language processing toolkit (NAACL 2018)☆659Feb 12, 2023Updated 3 years ago
- Applied Phobert model by VinAI research for Vietnamese NER task on various dataset☆21Jun 30, 2022Updated 3 years ago
- PhoGPT: Generative Pre-training for Vietnamese (2023)☆798Nov 12, 2024Updated last year
- ☆53Aug 28, 2024Updated last year
- 31st place solution for Kaggle's Humpback Whale Identification challenge☆17Mar 1, 2019Updated 7 years ago
- python scripts for crawling original image from Google Images☆24May 5, 2022Updated 3 years ago
- An attempt to Vietnamese speech enhencement with U-net and Unet based ResNet☆22Nov 6, 2021Updated 4 years ago
- Corpus tiếng việt☆385Oct 3, 2025Updated 5 months ago
- ☆26Jul 30, 2024Updated last year
- Cải thiện Elasticsearch trong bài toán semantic search sử dụng phương pháp Sentence Embeddings☆25May 27, 2021Updated 4 years ago
- Implementation for paper MLP-Mixer: An all-MLP Architecture for Vision☆91Aug 14, 2021Updated 4 years ago
- Xây dựng tập dữ liệu 500GB (20% done) văn bản tiếng Việt để huấn luyện mô hình ngôn ngữ lớn☆29Apr 7, 2023Updated 2 years ago