Thai News Dataset from Thai government website.
☆21Oct 21, 2025Updated 5 months ago
Alternatives and similar repositories for thaigov-v2-corpus
Users that are interested in thaigov-v2-corpus are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- scripts for cleaning and creating train/validation/test splits for Thai commonvoice☆12Sep 2, 2021Updated 4 years ago
- Thai smart home corpus with "Gowajee" hotword☆18Jul 30, 2023Updated 2 years ago
- It is fine-tune the GPT-Neo model for Thai language.☆12Jun 30, 2021Updated 4 years ago
- Make Pad Thai From few-shot learning 😉☆12Jan 19, 2023Updated 3 years ago
- Domain Adaptation of Thai Word Segmentation Models using Stacked Ensemble (EMNLP2020)☆20Feb 2, 2024Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Tesseract OCR tools for read Thai National Document used TH Sarabun National Font trained and fine-tuned. Read README.md to see about my …☆29Dec 5, 2022Updated 3 years ago
- The large thai word2vec☆11Nov 16, 2022Updated 3 years ago
- A Dataset for Thai text summarization from Thairath, ThaiPBS, Prachathai and The Standard with over 350,000 articles. Trained models are …☆49Apr 25, 2024Updated last year
- A Dataset for Thai Text Summarization with over 310K articles.☆29Feb 4, 2023Updated 3 years ago
- OpenThaiGPT focuses on developing a Thai Chatbot system to have capabilities equivalent to ChatGPT, as well as being able to connect to e…☆116May 26, 2023Updated 2 years ago
- Isan NLP☆17Mar 27, 2024Updated last year
- NLP course at Chulalongkorn University 2019☆21Mar 28, 2019Updated 6 years ago
- Thai Named Entity Recognition with BiLSTM-CRF using Word/Character Embedding☆17Oct 27, 2019Updated 6 years ago
- Thai sentence segmentation with conditional random fields☆16Jun 17, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Standalone Dictionary-based, Maximum Matching + Thai Character Cluster (newmm) tokenizer extracted from PyThaiNLP☆13Jan 6, 2022Updated 4 years ago
- Thai Named Entity Recognition☆57Mar 22, 2023Updated 3 years ago
- A public repository for corrupt0 datathon's court data☆11Jul 6, 2019Updated 6 years ago
- ☆14Jun 22, 2020Updated 5 years ago
- Open Thai Wikipedia QA Dataset made by iApp Technology☆14Feb 17, 2021Updated 5 years ago
- NLP stuff with quantum computing☆17Nov 9, 2020Updated 5 years ago
- Pytorch implementation of paper: Thai Nested Named Entity Recognition☆46Feb 27, 2026Updated 3 weeks ago
- Finetune wav2vec2-large-xlsr-53 with Thai Common Voice Corpus 7.0☆53Apr 23, 2022Updated 3 years ago
- English-Thai Machine Translation Models☆29May 3, 2024Updated last year
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- News Article Corpus from Prachathai.com☆17Apr 23, 2021Updated 4 years ago
- ☆45Mar 26, 2021Updated 5 years ago
- Pretraining transformer based Thai language models☆126Nov 6, 2023Updated 2 years ago
- Scrape, clean and explore ThaiME dataset☆12Jul 29, 2020Updated 5 years ago
- ☆14Dec 23, 2024Updated last year
- a Pythonic interface to pull thai mutual fund NAV☆15Dec 15, 2023Updated 2 years ago
- Arabic Grapheme-to-Phoneme (G2P) Conversion☆13Mar 15, 2025Updated last year
- Type-safe CSV and Google Sheets parser for TypeScript and JavaScript☆14Aug 19, 2025Updated 7 months ago
- ☆11Aug 26, 2019Updated 6 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆17Sep 24, 2024Updated last year
- The KlicStudio MCP server is a connector based on the Model Context Protocol (MCP), designed to facilitate interactions with KlicStudio s…☆20Jul 30, 2025Updated 7 months ago
- The implementation of CL-ReLKT (NAACL-2022)☆14Aug 31, 2022Updated 3 years ago
- Simple Telegram bot to annotate and varify automatic speech recognition datasets☆12Mar 30, 2021Updated 4 years ago
- รวมข้อมูล ตำบล อำเภอ และ จังหวัด ในประเทศไทย (77 จังหวัด) อ้างอิงตาม รหัสไปรษณีย์ไทย โดยที่ไม่ใช้ Server side☆34Jun 24, 2025Updated 9 months ago
- Implementation of ConGen: Unsupervised Control and Generalization Distillation For Sentence Representation (Finding of EMNLP 2022).☆22Sep 13, 2023Updated 2 years ago
- Thai_TTS is the project about training "Text to Speech in Thai" using Tacotron2 by NVIDIA.☆34May 24, 2022Updated 3 years ago