Standalone Dictionary-based, Maximum Matching + Thai Character Cluster (newmm) tokenizer extracted from PyThaiNLP
☆13Jan 6, 2022Updated 4 years ago
Alternatives and similar repositories for newmm-tokenizer
Users that are interested in newmm-tokenizer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A public repository for corrupt0 datathon's court data☆11Jul 6, 2019Updated 6 years ago
- ☆14Jun 22, 2020Updated 5 years ago
- scripts for cleaning and creating train/validation/test splits for Thai commonvoice☆12Sep 2, 2021Updated 4 years ago
- Thai smart home corpus with "Gowajee" hotword☆19Jul 30, 2023Updated 2 years ago
- A Dataset for Thai Text Summarization with over 310K articles.☆30Feb 4, 2023Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- หนังสือ "Interpretable Machine Learning" โดย Christoph Molnar ฉบับแปลภาษาไทย / Thai translation of "Interpretable Machine Learning" book…☆15Oct 15, 2021Updated 4 years ago
- Thai PDPA Website (Unofficial)☆12Jun 10, 2023Updated 3 years ago
- A companion repository to the "You Only Write Thrice: Creating Documents, Computational Notebooks and Presentations From a Single Source"…☆20Oct 14, 2022Updated 3 years ago
- Make Pad Thai From few-shot learning 😉☆12Jan 19, 2023Updated 3 years ago
- Parallel Universal Dependencies.☆15May 6, 2026Updated last month
- Thai Named Entity Recognition with BiLSTM-CRF using Word/Character Embedding☆17Oct 27, 2019Updated 6 years ago
- Scrape, clean and explore ThaiME dataset☆12Jul 29, 2020Updated 5 years ago
- ☆46Mar 26, 2021Updated 5 years ago
- Thai Named Entity Recognition☆59Mar 22, 2023Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Type-safe CSV and Google Sheets parser for TypeScript and JavaScript☆15Updated this week
- ☆40Feb 1, 2023Updated 3 years ago
- Java library to tokenize Thai text into a list of TCCs☆21May 30, 2017Updated 9 years ago
- ☆17May 6, 2022Updated 4 years ago
- Simple Telegram bot to annotate and varify automatic speech recognition datasets☆12Mar 30, 2021Updated 5 years ago
- Thai Word Segmentation and Part-of-Speech Tagging with Deep Learning☆41May 26, 2017Updated 9 years ago
- Tesseract OCR tools for read Thai National Document used TH Sarabun National Font trained and fine-tuned. Read README.md to see about my …☆29Dec 5, 2022Updated 3 years ago
- Dependency parser on Thai language☆27Jan 25, 2025Updated last year
- Thai Spelling Check☆42Apr 2, 2023Updated 3 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- English-Thai Machine Translation Models☆30May 3, 2024Updated 2 years ago
- Python Thai Automatic Speech Recognition☆79Jan 31, 2026Updated 4 months ago
- An initiative for Bangkokians to develop contributable open-source projects to solve local problems!☆38Feb 1, 2023Updated 3 years ago
- AI Software Bill of Materials for EU AI Act☆12Jan 18, 2024Updated 2 years ago
- Explainable AI for Software Engineering: A Hands-on Guide on How to Make Software Analytics More Practical, Explainable, and Actionable (…☆27Nov 14, 2021Updated 4 years ago
- Thai social media text sentiment dataset☆91Nov 7, 2024Updated last year
- A Fast and Accurate Neural Thai Word Segmenter☆96Jan 14, 2025Updated last year
- CRF syllable segmenter for Thai☆27May 3, 2024Updated 2 years ago
- An idea that take advantages of features of deep learning to use unannotated samples for NER and identify sequences with error labels.☆15Feb 4, 2024Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- A python library / model for creating co-references between AMR graph nodes.☆11Dec 11, 2022Updated 3 years ago
- 🧘 Summon monk to "Blessing" your code☆23Jul 7, 2022Updated 3 years ago
- Tool to collect and review sentences for Common Voice☆83May 10, 2023Updated 3 years ago
- Pytorch implementation of paper: Thai Nested Named Entity Recognition☆47Feb 27, 2026Updated 3 months ago
- A simple tool to generate a binary hex map of Thailand☆13Jan 9, 2024Updated 2 years ago
- Simple bash script that will optimize JPG and PNG images in a directory using jpegoptim, optipng, advpng, and pngcrush.☆11Sep 27, 2013Updated 12 years ago
- A Thai word tokenization library using Deep Neural Network☆428Oct 23, 2020Updated 5 years ago