gyunggyung/LLM-Ko-Datasets

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/gyunggyung/LLM-Ko-Datasets)

gyunggyung / LLM-Ko-Datasets

🇰🇷 Korean LLM Datasets | Pre-training, SFT, DPO, RLHF, CoT | 한국어 LLM 데이터셋 큐레이션

☆41

Alternatives and similar repositories for LLM-Ko-Datasets

Users that are interested in LLM-Ko-Datasets are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

sionic-ai / pycon-2024-tutorial
View on GitHub
2024 PyCon Korea 튜토리얼
☆12Nov 8, 2024Updated last year
hmmhmmhm / typescript-json
View on GitHub
📦 Initialize JSON data according to type schema
☆10May 10, 2021Updated 5 years ago
gyunggyung / LiOnConnect
View on GitHub
"Learning-based One-line intelligence Owner Network Connectivity Tool"
☆15Apr 19, 2023Updated 3 years ago
hmmhmmhm / curiosity
View on GitHub
Curiosity about consciousness
☆11May 6, 2023Updated 3 years ago
cardy20 / KODOLI
View on GitHub
"Why do I feel offended?" - Korean Dataset for Offensive Language Identification (EACL2023 Findings)
☆15May 14, 2023Updated 3 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
SOMJANG / Youtube_Comment_Crawler
View on GitHub
유튜브 댓글 크롤러 ( Python, BeautifulSoup, Selenium )
☆35Sep 13, 2022Updated 3 years ago
blacktoast / super-gen
View on GitHub
vscode extention
☆13Jan 13, 2023Updated 3 years ago
kyopark2014 / llm-agent
View on GitHub
It shows how to deploy and use an agent with LLM.
☆19Mar 1, 2025Updated last year
chalkpe / Cesium
View on GitHub
Node.js powered chatting server
☆10Oct 19, 2019Updated 6 years ago
psymon-ai / KoLlama2
View on GitHub
☆12Aug 19, 2023Updated 2 years ago
MrBananaHuman / KoGPT2ForParaphrasing
View on GitHub
TEMP
☆34Apr 2, 2020Updated 6 years ago
seongyeon1 / oh-my-slides
View on GitHub
A Claude Code plugin that generates animation-rich HTML presentations from natural language prompts. 20 curated design presets, PPTX expo…
☆16May 15, 2026Updated 2 months ago
moon1ite / koco
View on GitHub
Easy installer of kocohub dataset
☆24May 31, 2020Updated 6 years ago
SungjoonPark / DeepNLP2
View on GitHub
Deep NLP 2 (2019.3-5)
☆10Feb 19, 2019Updated 7 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
ksmin23 / my-adk-python-samples
View on GitHub
A collection of Python agent samples built with the Google Agent Development Kit (ADK), demonstrating integrations with services like B…
☆21May 8, 2026Updated 2 months ago
DunZhang / Jasper-Token-Compression-Training
View on GitHub
The training codes of Jasper-Token-Compression-600M
☆20Nov 19, 2025Updated 8 months ago
marpple / FxSVG
View on GitHub
Functional SVG Handling Library
☆18Dec 12, 2022Updated 3 years ago
hong-seongmin / techGiterview
View on GitHub
☆35Mar 22, 2026Updated 3 months ago
samchon / nestia-start
View on GitHub
Nestia template project installed by "npx nestia start"
☆18Updated this week
Marker-Inc-Korea / AutoRAG-example-korean-embedding-benchmark
View on GitHub
AutoRAG example about benchmarking Korean embeddings.
☆45Oct 2, 2024Updated last year
workdd / LLM_Foreign_Block
View on GitHub
LLM 모델의 외국어 토큰 생성을 막는 코드 구현
☆87Aug 7, 2025Updated 11 months ago
ro-ko / Awesome-SLM
View on GitHub
Awesome-SLM: a curated list of Small Language Model
☆33Jun 24, 2024Updated 2 years ago
teddylee777 / react-voice-agent
View on GitHub
☆12Oct 3, 2024Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
HoyunS / MentalBench
View on GitHub
☆22May 19, 2026Updated 2 months ago
LCS2-IIITD / quarc-counterspeech
View on GitHub
[ACL 2023] Counterspeeches up my sleeve! Intent Distribution Learning and Persistent Fusion for Intent-Conditioned Counterspeech Generati…
☆10Sep 23, 2023Updated 2 years ago
pinion05 / codex-claudecode-proxy
View on GitHub
☆20Mar 6, 2026Updated 4 months ago
boychaboy / KOLD
View on GitHub
KOLD: Korean Offensive Language Dataset
☆83Nov 13, 2022Updated 3 years ago
jiwoochris / In-Memory-Vector-DB
View on GitHub
Construct a vector database through sentence embedding. And make your LLM respond based on this database.
☆10Feb 5, 2024Updated 2 years ago
Gubuzeong / Getting-Started-with-Google-BERT
View on GitHub
☆15Mar 28, 2022Updated 4 years ago
seokgukim / MSWSeokguKimsPackages
View on GitHub
Bunch of Maplestory World frameworks I made
☆12Feb 5, 2023Updated 3 years ago
daje0601 / AllinOne_LLM
View on GitHub
NLP 역사부터 서빙까지 한 권의 책에서 다룹니다.
☆27Dec 6, 2025Updated 7 months ago
gyunggyung / docling-translate
View on GitHub
Advanced PDF/Document Translator with interactive comparison. Built on IBM Docling.
☆18Jan 5, 2026Updated 6 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
MrAndrewBlood / Captcha-Solver
View on GitHub
Solver with Interface window for Cloudflare Turnstile and other Captchas.
☆14Oct 7, 2024Updated last year
sionic-ai / serverless-rag-mcp-server
View on GitHub
☆40Mar 11, 2025Updated last year
MLP-Lab / Bllossom
View on GitHub
☆85May 8, 2024Updated 2 years ago
Steven-A3 / HWPX-CLAUDE-SKILL
View on GitHub
☆20Jun 16, 2026Updated last month
braincrew-lab / Log26_n_Connect2026
View on GitHub
Connect 2026 & Log26 발표자료를 공유합니다.
☆97Jan 16, 2026Updated 6 months ago
HP-DEVGRU / UnrealSteel
View on GitHub
Make 3D modeled character imitating user's motion in real time using Unreal Engine, just like REAL STEEL
☆20Nov 4, 2023Updated 2 years ago
LangChain-OpenTutorial / langchain-opentutorial-pypi
View on GitHub
langchain opentutorial utility package for Open Tutorial
☆10Feb 2, 2025Updated last year