parinzee / seed-free-synthetic-instruct
Official Code for "Seed-Free Synthetic Data Generation Framework for Instruction-Tuning LLMs: A Case Study in Thai"
☆21Updated 2 months ago
Alternatives and similar repositories for seed-free-synthetic-instruct:
Users that are interested in seed-free-synthetic-instruct are comparing it to the libraries listed below
- WangchanX Fine-tuning Pipeline☆44Updated 3 months ago
- Benchmark for Thai sentence representation☆106Updated 6 months ago
- ☆24Updated last year
- OpenThaiRAG is an open-source Retrieval-Augmented Generation (RAG) framework designed specifically for Thai language processing. This pro…☆33Updated last month
- WangChanGLM 🐘 - The Multilingual Instruction-Following Model☆94Updated last year
- ☆28Updated 9 months ago
- ☆21Updated 8 months ago
- Speech Emotion Recognition using PyTorch sponsored by AIS and VISTEC-DEPA AIResearch Institute Thailand.☆18Updated 3 years ago
- Tesseract OCR tools for read Thai National Document used TH Sarabun National Font trained and fine-tuned. Read README.md to see about my …☆19Updated 2 years ago
- Fix Thai PDF☆32Updated last week
- NLP course @ chula 2023☆46Updated last year
- OpenThaiGPT focuses on developing a Thai Chatbot system to have capabilities equivalent to ChatGPT, as well as being able to connect to e…☆112Updated last year
- Pytorch implementation of paper: Thai Nested Named Entity Recognition☆43Updated 3 months ago
- Pretraining transformer based Thai language models☆121Updated last year
- ☆24Updated 3 months ago
- Thonburian Whisper: Open models for fine-tuned Whisper in Thai. Try our demo on Huggingface space:☆108Updated 2 months ago
- Handling Cross- and Out-of-Domain Samples in Thai Word Segmentation (ACL 2021 Findings).☆30Updated 11 months ago
- ☆34Updated 8 months ago
- Implementation of ConGen: Unsupervised Control and Generalization Distillation For Sentence Representation (Finding of EMNLP 2022).☆21Updated last year
- Python Thai Automatic Speech Recognition☆64Updated last year
- ☆22Updated 3 months ago
- Simple chatbot for Siriraj SIP. This is the demo repository for Mahidol's hackathon☆12Updated 10 months ago
- ☆10Updated 2 weeks ago
- ☆40Updated 3 months ago
- ☆11Updated last year
- 🧰 The AutoTokenizer that TikToken always needed -- Load any tokenizer with TikToken now! ✨☆35Updated 3 weeks ago
- ☆12Updated 4 months ago
- A Dataset for Thai text summarization from Thairath, ThaiPBS, Prachathai and The Standard with over 350,000 articles. Trained models are …☆41Updated 9 months ago
- ☆11Updated last year