Data Augmentation by Backtranslation (DAB) ヽ( •_-)ᕗ
☆69Jun 20, 2022Updated 3 years ago
Alternatives and similar repositories for dab
Users that are interested in dab are comparing it to the libraries listed below
Sorting:
- Submission for AIviVN Vietnamese diacritics restoration contest https://www.aivivn.com/contests/3☆40Jul 25, 2024Updated last year
- ALBERT for Vietnamese☆96Dec 16, 2019Updated 6 years ago
- Electra pre-trained model using Vietnamese corpus☆67Jun 12, 2023Updated 2 years ago
- Pre-trained Word2Vec models for Vietnamese☆160Dec 30, 2020Updated 5 years ago
- MTet: Multi-domain Translation for English and Vietnamese☆193Feb 7, 2023Updated 3 years ago
- Repository to track the progress in Vietnamese Natural Language Processing, including the datasets and the current state-of-the-art for t…☆370Sep 5, 2022Updated 3 years ago
- Repository for our ICLR 2019 paper: Discovery of Natural Language Concepts in Individual Units of CNNs☆26Mar 9, 2019Updated 6 years ago
- PhoBERT: Pre-trained language models for Vietnamese (EMNLP-2020 Findings)☆773Jul 23, 2024Updated last year
- A dataset for Vietnamese Spelling Correction☆15Sep 27, 2021Updated 4 years ago
- PyTorch original implementation of Cross-lingual Language Model Pretraining.☆2,926Feb 14, 2023Updated 3 years ago
- ☆48Dec 13, 2019Updated 6 years ago
- Our submission for Aivivn Contest 1.☆30Mar 16, 2019Updated 6 years ago
- Vietnamese question answering system with BERT☆117Jan 12, 2023Updated 3 years ago
- Corpus tiếng việt☆385Oct 3, 2025Updated 5 months ago
- ☆15Mar 19, 2020Updated 5 years ago
- 1st place solution for Zalo AI 2019 - Vietnamese Wiki Question Answering☆48Dec 11, 2019Updated 6 years ago
- Unsupervised Data Augmentation (UDA)☆2,204Aug 28, 2021Updated 4 years ago
- PhoNLP: A BERT-based multi-task learning model for part-of-speech tagging, named entity recognition and dependency parsing (NAACL 2021)☆150Dec 31, 2024Updated last year
- Thư viện chuẩn hóa văn bản Tiếng Việt☆180May 26, 2025Updated 9 months ago
- dentifying gender and regional accent from speech☆37Aug 21, 2018Updated 7 years ago
- All the ML algorithms, ML models are coded from scratch by pure Python/Numpy with the Math under the hood. It works well on CPU.☆222May 8, 2023Updated 2 years ago
- Một cuốn sách về Học Sâu đề cập đến nhiều framework phổ biến, được sử dụng trên 300 trường Đại học từ 55 đất nước bao gồm MIT, Stanford, …☆653Jul 8, 2022Updated 3 years ago
- Trankit is a Light-Weight Transformer-based Python Toolkit for Multilingual Natural Language Processing☆790Jul 22, 2025Updated 7 months ago
- SentAugment is a data augmentation technique for NLP that retrieves similar sentences from a large bank of sentences. It can be used in c…☆359Feb 22, 2022Updated 4 years ago
- Machine Reading Comprehension special for the Vietnamese language☆41Mar 13, 2022Updated 3 years ago
- Speech and Language Processing 3rd edition Vietnamese Translation☆25Nov 1, 2018Updated 7 years ago
- Sentence Embeddings with BERT & XLNet☆27Aug 23, 2020Updated 5 years ago
- A Vietnamese natural language processing toolkit (NAACL 2018)☆659Feb 12, 2023Updated 3 years ago
- Vietnamese song lyric alignment framework☆68Dec 11, 2022Updated 3 years ago
- A toolkit for evaluating the linguistic knowledge and transferability of contextual representations. Code for "Linguistic Knowledge and T…☆210Oct 20, 2021Updated 4 years ago
- ETNLP: A toolkit to evaluate, extract, and visualize multiple embeddings☆149Aug 23, 2025Updated 6 months ago
- Neural Text Generation with Unlikelihood Training☆310Aug 31, 2021Updated 4 years ago
- Fast, general, and tested differentiable structured prediction in PyTorch☆1,123Apr 20, 2022Updated 3 years ago
- Zalo AI Challenge - Landmark Identification☆40Jul 30, 2024Updated last year
- NLP library designed for reproducible experimentation management☆294Jul 25, 2024Updated last year
- Sentiment classification for Vietnamese text using PhoBert☆99Nov 16, 2020Updated 5 years ago
- Repository of code for the tutorial on Transfer Learning in NLP held at NAACL 2019 in Minneapolis, MN, USA☆722Oct 16, 2019Updated 6 years ago
- jiant is an nlp toolkit☆1,674Jul 6, 2023Updated 2 years ago
- New dataset☆311Aug 31, 2021Updated 4 years ago