copninixh / TH-National-Document-OCR
Tesseract OCR tools for read Thai National Document used TH Sarabun National Font trained and fine-tuned. Read README.md to see about my process.
☆24Updated 2 years ago
Alternatives and similar repositories for TH-National-Document-OCR:
Users that are interested in TH-National-Document-OCR are comparing it to the libraries listed below
- Fix Thai PDF☆33Updated last month
- Pytorch implementation of paper: Thai Nested Named Entity Recognition☆45Updated 5 months ago
- Thonburian Whisper: Open models for fine-tuned Whisper in Thai. Try our demo on Huggingface space:☆120Updated 4 months ago
- Benchmark for Thai sentence representation☆113Updated 8 months ago
- WangchanX Fine-tuning Pipeline☆45Updated 5 months ago
- Pretraining transformer based Thai language models☆121Updated last year
- NLP course @ chula 2023☆46Updated 2 years ago
- WangChanGLM 🐘 - The Multilingual Instruction-Following Model☆94Updated last year
- A Python Parser for Thai address☆64Updated 2 years ago
- A Dataset for Thai text summarization from Thairath, ThaiPBS, Prachathai and The Standard with over 350,000 articles. Trained models are …☆43Updated 11 months ago
- Handling Cross- and Out-of-Domain Samples in Thai Word Segmentation (ACL 2021 Findings).☆30Updated last year
- Automated generation of subtitles for tech talks in Thai language using Speechmatics, Gemini, GPT-4o and Claude.☆27Updated 4 months ago
- Thai Law Dataset (Act of Parliament)☆20Updated 3 years ago
- Let's create a tool to convert Thailand budget from PDF to CSV.☆110Updated 11 months ago
- Python Thai Automatic Speech Recognition☆66Updated 2 years ago
- Thai social media text sentiment dataset☆81Updated 4 months ago
- ☆34Updated 10 months ago
- Automated data scraper for Thailand COVID-19 data☆31Updated 2 years ago
- OpenThaiGPT focuses on developing a Thai Chatbot system to have capabilities equivalent to ChatGPT, as well as being able to connect to e…☆114Updated last year
- CodeJourney: Internship Stories รวบรวมบทความฝึกงานสาย Programmer☆29Updated last year
- คลังศัพท์ภาษาไทย☆142Updated last year
- OpenThaiRAG is an open-source Retrieval-Augmented Generation (RAG) framework designed specifically for Thai language processing. This pro…☆36Updated 3 months ago
- Collection of Wongnai's datasets☆76Updated 5 years ago
- ข้อมูลรายชื่อจังหวัด อำเภอ ตำบล รวมไปถึงรหัสไปรษณีย์ และพิกัด latitude longitude ในประเทศไทย☆56Updated 5 years ago
- Dataset for fake news detection in healthcare domain☆12Updated 2 years ago
- NLP course at Chulalongkorn University☆126Updated 6 years ago
- Thai Named Entity Recognition☆54Updated 2 years ago
- Pattern @ chula 2024☆77Updated 11 months ago
- ☆28Updated 11 months ago
- ระบบติดตามและรายงานข้อมูลการระบาดและการฉีดวัคซีน COVID-19 ในประเทศไทย☆112Updated 2 years ago