keyreply / Thai-NLP-DatasetLinks
More than 43+ collections of Thai Natural Language Processing libraries. Update daily.
☆29Updated 6 years ago
Alternatives and similar repositories for Thai-NLP-Dataset
Users that are interested in Thai-NLP-Dataset are comparing it to the libraries listed below
Sorting:
- Implementation of ConGen: Unsupervised Control and Generalization Distillation For Sentence Representation (Finding of EMNLP 2022).☆22Updated last year
- ☆39Updated 4 years ago
- Load What You Need: Smaller Multilingual Transformers for Pytorch and TensorFlow 2.0.☆103Updated 3 years ago
- Code and data for the IWSLT 2022 shared task on Formality Control for SLT☆21Updated 2 years ago
- Python-based implementation of the Translate-Align-Retrieve method to automatically translate the SQuAD Dataset to Spanish.☆59Updated 2 years ago
- Pretraining transformer based Thai language models☆121Updated last year
- A tiny BERT for low-resource monolingual models☆31Updated 9 months ago
- Speech Emotion Recognition using PyTorch sponsored by AIS and VISTEC-DEPA AIResearch Institute Thailand.☆21Updated 3 years ago
- NTREX -- News Test References for MT Evaluation☆84Updated last year
- Zero-shot Transfer Learning from English to Arabic☆30Updated 3 years ago
- Multilingual abstractive summarization dataset extracted from WikiHow.☆92Updated 4 months ago
- This repo contains a set of neural transducer, e.g. sequence-to-sequence model, focusing on character-level tasks.☆76Updated last year
- Creating super-parallel corpora of more than 1500+ unique languages for NLP research☆34Updated 2 years ago
- TUFS Asian Language Parallel Corpus☆50Updated 2 years ago
- Statistics on multilingual datasets☆17Updated 3 years ago
- PyThaiNLP For spaCy☆16Updated 2 years ago
- ☆20Updated 3 years ago
- BERT pre-training in Thai language☆59Updated 6 years ago
- As good as new. How to successfully recycle English GPT-2 to make models for other languages (ACL Findings 2021)☆48Updated 3 years ago
- Research code for the paper "How Good is Your Tokenizer? On the Monolingual Performance of Multilingual Language Models"☆27Updated 3 years ago
- MFAQ: a Multilingual FAQ Dataset☆17Updated last year
- Tool to fix bitexts and tag near-duplicates for removal☆30Updated 5 months ago
- Parallel Universal Dependencies.☆15Updated last month
- BERT models for many languages created from Wikipedia texts☆33Updated 5 years ago
- Code for ACL 2022 paper "Expanding Pretrained Models to Thousands More Languages via Lexicon-based Adaptation"☆30Updated 3 years ago
- This repository contains the code for paper Prompting ELECTRA Few-Shot Learning with Discriminative Pre-Trained Models.☆48Updated 3 years ago
- Repository for Vajjala & Lucic (2018)☆65Updated last year
- BLOOM+1: Adapting BLOOM model to support a new unseen language☆73Updated last year
- Bilingual term extractor☆54Updated last year
- ☆109Updated last year