언어모델을 학습하기 위한 공개 한국어 instruction dataset들을 모아두었습니다.
☆455Apr 13, 2025Updated 11 months ago
Alternatives and similar repositories for open-korean-instructions
Users that are interested in open-korean-instructions are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☁️ 구름(KULLM): 고려대학교에서 개발한, 한국어에 특화된 LLM☆588May 1, 2024Updated last year
- 한국어 데이터 세트 링크☆910Oct 14, 2024Updated last year
- KoAlpaca: 한국어 명령어를 이해하는 오픈소스 언어모델 (KoAlpaca: An open-source language model to understand Korean instructions)☆1,576Oct 25, 2024Updated last year
- 한국어 언어모델 다분야 사고력 벤치마크☆201Oct 17, 2024Updated last year
- [KO-Platy🥮] Korean-Open-platypus를 활용하여 llama-2-ko를 fine-tuning한 KO-platypus model☆73Aug 24, 2025Updated 7 months ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- This repo is for Korean wiki table question answering datasets described in the paper of Korean-Specific Dataset for Table Question Answe…☆91Oct 22, 2024Updated last year
- List of Korean pre-trained language models.☆188Aug 31, 2023Updated 2 years ago
- ChatGPT의 RLHF를 학습을 위한 3가지 step별 한국어 데이터셋☆41Nov 21, 2023Updated 2 years ago
- KURE: 고려대학교에서 개발한, 한국어 검색에 특화된 임베딩 모델☆209Feb 26, 2026Updated last month
- huggingface에 있는 한국어 데이터 세트☆36Oct 10, 2024Updated last year
- Forked repo from https://github.com/EleutherAI/lm-evaluation-harness/commit/1f66adc☆82Feb 28, 2024Updated 2 years ago
- Polyglot: Large Language Models of Well-balanced Competence in Multi-languages☆484Aug 22, 2023Updated 2 years ago
- Awesome list of Korean Large Language Models.☆474Oct 31, 2023Updated 2 years ago
- 한국어 심리 상담 데이터셋☆81Jun 20, 2023Updated 2 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆36Oct 4, 2023Updated 2 years ago
- BERTScore for Korean☆80Feb 22, 2024Updated 2 years ago
- 언어모델을 학습하기 위한 공개 한국어 instruction dataset들을 모아두었습니다.☆19Jul 16, 2023Updated 2 years ago
- Benchmark in Korean Context☆138Sep 26, 2023Updated 2 years ago
- Korean Multi-task Instruction Tuning☆156Dec 20, 2023Updated 2 years ago
- Korean corpus repository☆747Oct 3, 2022Updated 3 years ago
- ☆123Apr 21, 2023Updated 2 years ago
- ☆106May 8, 2023Updated 2 years ago
- ☆116Feb 25, 2026Updated last month
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- KoLLaVA: Korean Large Language-and-Vision Assistant (feat.LLaVA)☆296Sep 20, 2024Updated last year
- Pretrained ELECTRA Model for Korean☆631Feb 19, 2024Updated 2 years ago
- KSS: Korean String processing Suite☆470Nov 13, 2025Updated 4 months ago
- Yet another python binding for mecab-ko☆88May 16, 2023Updated 2 years ago
- Curation note of NLP datasets☆98Dec 6, 2022Updated 3 years ago
- Official datasets and pytorch implementation repository of SQuARe and KoSBi (ACL 2023)☆249Jun 29, 2023Updated 2 years ago
- ☆443Apr 8, 2022Updated 3 years ago
- ☆69Mar 21, 2024Updated 2 years ago
- 🤗 최소한의 세팅으로 LM을 학습하기 위한 샘플코드☆59May 23, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Pecab: Pure python Korean morpheme analyzer based on Mecab☆172Apr 27, 2024Updated last year
- Easy Language Model Pretraining leveraging Huggingface's Transformers and Datasets☆130Nov 12, 2022Updated 3 years ago
- Official repository for KoMT-Bench built by LG AI Research☆71Aug 8, 2024Updated last year
- Korean Sentence Embedding Repository☆210Dec 1, 2024Updated last year
- StrategyQA 데이터 세트 번역☆23Apr 12, 2024Updated last year
- ☆33Aug 30, 2023Updated 2 years ago
- 자체 구축한 한국어 평가 데이터셋을 이용한 한국어 모델 평가☆31May 31, 2024Updated last year