keyreply / Thai-NLP-DatasetLinks
More than 43+ collections of Thai Natural Language Processing libraries. Update daily.
☆31Updated 7 years ago
Alternatives and similar repositories for Thai-NLP-Dataset
Users that are interested in Thai-NLP-Dataset are comparing it to the libraries listed below
Sorting:
- ☆44Updated 4 years ago
- Pretraining transformer based Thai language models☆123Updated 2 years ago
- Code and data for the IWSLT 2022 shared task on Formality Control for SLT☆22Updated 2 years ago
- Python-based implementation of the Translate-Align-Retrieve method to automatically translate the SQuAD Dataset to Spanish.☆59Updated 3 years ago
- TUFS Asian Language Parallel Corpus☆52Updated 2 years ago
- Implementation of ConGen: Unsupervised Control and Generalization Distillation For Sentence Representation (Finding of EMNLP 2022).☆22Updated 2 years ago
- BERT pre-training in Thai language☆59Updated 7 years ago
- ☆15Updated 7 months ago
- NTREX -- News Test References for MT Evaluation☆86Updated last year
- Load What You Need: Smaller Multilingual Transformers for Pytorch and TensorFlow 2.0.☆105Updated 3 years ago
- Multilingual abstractive summarization dataset extracted from WikiHow.☆99Updated 9 months ago
- A Fast and Accurate Neural Thai Word Segmenter☆93Updated 11 months ago
- English-Thai Machine Translation with OPUS data☆19Updated 5 years ago
- Thai Named Entity Recognition☆56Updated 2 years ago
- Speech Emotion Recognition using PyTorch sponsored by AIS and VISTEC-DEPA AIResearch Institute Thailand.☆22Updated 4 years ago
- ☆116Updated 2 months ago
- Zero-shot Transfer Learning from English to Arabic☆30Updated 3 years ago
- ☆50Updated last year
- PyThaiNLP For spaCy☆16Updated 3 years ago
- A Dataset for Thai Text Summarization with over 310K articles.☆29Updated 2 years ago
- Repository for Multilingual-VQA task created during HuggingFace JAX/Flax community week.☆34Updated 4 years ago
- This dataset contains synthetic training data for grammatical error correction. The corpus is generated by corrupting clean sentences fro…☆162Updated last year
- Pytorch implementation of paper: Thai Nested Named Entity Recognition☆46Updated last year
- ☆12Updated 5 years ago
- Creating super-parallel corpora of more than 1500+ unique languages for NLP research☆34Updated 3 years ago
- This repo contains a set of neural transducer, e.g. sequence-to-sequence model, focusing on character-level tasks.☆76Updated 2 years ago
- Glot500: Scaling Multilingual Corpora and Language Models to 500 Languages -- ACL 2023☆107Updated last year
- BLOOM+1: Adapting BLOOM model to support a new unseen language☆74Updated last year
- Repository for Vajjala & Lucic (2018)☆67Updated last year
- This repository contains the Arabic sarcasm dataset (ArSarcasm)☆25Updated 4 years ago