PhoMT: A High-Quality and Large-Scale Benchmark Dataset for Vietnamese-English Machine Translation (EMNLP 2021)
☆51Jun 3, 2025Updated last year
Alternatives and similar repositories for PhoMT
Users that are interested in PhoMT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A High-Quality and Large-Scale Dataset for English-Vietnamese Speech Translation (INTERSPEECH 2022)☆25Jun 5, 2025Updated last year
- PhoNLP: A BERT-based multi-task learning model for part-of-speech tagging, named entity recognition and dependency parsing (NAACL 2021)☆149Dec 31, 2024Updated last year
- ViText2SQL: A dataset for Vietnamese Text-to-SQL semantic parsing (EMNLP-2020 Findings)☆37Jul 22, 2024Updated last year
- ☆25Aug 28, 2024Updated last year
- PhoBERT: Pre-trained language models for Vietnamese (EMNLP-2020 Findings)☆788Jul 23, 2024Updated last year
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- COVID-19 Named Entity Recognition for Vietnamese (NAACL 2021)☆73Jul 22, 2024Updated last year
- MTet: Multi-domain Translation for English and Vietnamese☆197Feb 7, 2023Updated 3 years ago
- VnDT: A Vietnamese Dependency Treebank☆24Nov 6, 2021Updated 4 years ago
- BARTpho: Pre-trained Sequence-to-Sequence Models for Vietnamese (INTERSPEECH 2022)☆104Jul 22, 2024Updated last year
- WNUT-2020 Task 2: Identification of informative COVID-19 English Tweets☆30Jul 22, 2024Updated last year
- Distributional Sliced-Wasserstein distance code☆51Jul 22, 2024Updated last year
- Predictive Coding for Locally-Linear Control (ICML-2020)☆18Jul 22, 2024Updated last year
- Graph Neural Networks for Knowledge Graph Link Prediction (WSDM 2022) (Pytorch)☆61Dec 25, 2021Updated 4 years ago
- Python scripts and datasets of the "Extremely Low-Resource Neural Machine Translation: A Case Study of Cantonese" project☆16Oct 28, 2022Updated 3 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Pre-trained Word2Vec syllable- and word-level embeddings for Vietnamese☆55Aug 8, 2023Updated 2 years ago
- ☆17Jul 10, 2022Updated 3 years ago
- A trained neural network built from scratch to classify clothes and digits from the MNIST dataset☆14Sep 11, 2019Updated 6 years ago
- BERT-based joint intent detection and slot filling with intent-slot attention mechanism (INTERSPEECH 2021)☆88Jul 22, 2024Updated last year
- 31st place solution for Kaggle's Humpback Whale Identification challenge☆17Mar 1, 2019Updated 7 years ago
- Applied Phobert model by VinAI research for Vietnamese NER task on various dataset☆21Jun 30, 2022Updated 3 years ago
- Source code for Zalo AI 2021 submission☆141Dec 20, 2021Updated 4 years ago
- Data and code for our paper "Exploring and Predicting Transferability across NLP Tasks", to appear at EMNLP 2020.☆48Mar 8, 2021Updated 5 years ago
- A collection for AI Engineer☆41Jul 5, 2025Updated 11 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆51Sep 3, 2025Updated 9 months ago
- PhoGPT: Generative Pre-training for Vietnamese (2023)☆792Nov 12, 2024Updated last year
- We provide benchmark datasets for evaluating Vietnamese processing models: UIT-ViQuAD, ViNewsQA, UIT-VSFC, UIT-ViIC, UIT-ViNames, UIT-VSM…☆24Jun 19, 2021Updated 5 years ago
- Sentiment classification for Vietnamese text using PhoBert☆99Nov 16, 2020Updated 5 years ago
- Korean Speech to English Translation Corpus☆45Sep 3, 2021Updated 4 years ago
- Official Code for ICML 2023 Data-Efficient Contrastive Self-supervised Learning☆33Apr 16, 2024Updated 2 years ago
- Influpaint : Inpainting denoising diffusion probabilistic models for infectious disease (such as influenza) forecasting☆22Jun 4, 2026Updated 2 weeks ago
- ☆16Jan 28, 2024Updated 2 years ago
- Xây dựng tập dữ liệu 500GB (20% done) văn bản tiếng Việt để huấn luyện mô hình ngôn ngữ lớn☆29Apr 7, 2023Updated 3 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- python scripts for crawling original image from Google Images☆24May 5, 2022Updated 4 years ago
- ☆13Jan 16, 2025Updated last year
- Vietnamese Text to Speech library☆258Aug 20, 2023Updated 2 years ago
- Repository to track the progress in Vietnamese Natural Language Processing, including the datasets and the current state-of-the-art for t…☆375Sep 5, 2022Updated 3 years ago
- The English-Vietnamese Bilingual Corpus (EVBCorpus) is a collection of English and Vietnamese parallel translations and bitexts.☆52Jul 12, 2019Updated 6 years ago
- Corpus tiếng việt☆384Oct 3, 2025Updated 8 months ago
- Dictionary-guided Scene Text Recognition (CVPR-2021)☆152Jul 23, 2024Updated last year