MrBananaHuman/PangyoCorpora

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/MrBananaHuman/PangyoCorpora)

MrBananaHuman / PangyoCorpora

☆38

Alternatives and similar repositories for PangyoCorpora

Users that are interested in PangyoCorpora are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

jason9693 / polyglot-finetuning-oslo
View on GitHub
☆19Sep 20, 2022Updated 3 years ago
MrBananaHuman / open-korean-instructions
View on GitHub
언어모델을 학습하기 위한 공개 한국어 instruction dataset들을 모아두었습니다.
☆19Jul 16, 2023Updated 3 years ago
human-rights-corpus / HRC
View on GitHub
#인권코퍼스
☆31Oct 6, 2023Updated 2 years ago
nlpai-lab / KommonGen
View on GitHub
한국어 생성 모델의 상식 추론을 위한 KommonGen 데이터셋입니다.
☆21Oct 5, 2021Updated 4 years ago
openkorpos / model-mecab
View on GitHub
MeCab model trained with OpenKorPos.
☆23Jun 19, 2022Updated 4 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
BM-K / KoDiffCSE
View on GitHub
Difference-based Contrastive Learning for Korean Sentence Embeddings
☆23Mar 11, 2026Updated 4 months ago
korean-named-entity / konne
View on GitHub
Korean Nested Named Entity Corpus
☆20May 13, 2023Updated 3 years ago
ko-nlp / moducorpus-sanitizer
View on GitHub
모두의 말뭉치 데이터를 분석에 편리한 형태로 변환하는 기능을 제공합니다.
☆11Mar 2, 2022Updated 4 years ago
hyunwoongko / beyond-lm
View on GitHub
Beyond LM: How can language model go forward in the future?
☆15Apr 30, 2023Updated 3 years ago
korean-named-entity / konec
View on GitHub
Korean Named Entity Corpus
☆25May 12, 2023Updated 3 years ago
Data-Intelligence-Lab / DEFT-korean-alpaca
View on GitHub
☆23Oct 30, 2023Updated 2 years ago
formidable-stella / ShareGPT-translation
View on GitHub
☆21May 24, 2023Updated 3 years ago
QuoQA-NLP / Ko-conceptual-captions
View on GitHub
Google's Conceptual Captions Dataset translated into Korean
☆23Aug 28, 2022Updated 3 years ago
J-Seo / KommonGen
View on GitHub
한국어 생성 모델의 상식 추론을 위한 KommonGen 데이터셋입니다.
☆17Oct 5, 2021Updated 4 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
YongWookHa / kor-text-preprocess
View on GitHub
Korean text data preprocess toolkit for NLP
☆18Jun 11, 2019Updated 7 years ago
EleutherAI / hae-rae
View on GitHub
☆33Aug 30, 2023Updated 2 years ago
baikalai / baikal-bert
View on GitHub
baikal.ai's pre-trained BERT models: descriptions and sample codes
☆12Jun 24, 2021Updated 5 years ago
krafton-ai / KORani
View on GitHub
☆108May 8, 2023Updated 3 years ago
Beomi / easy-lm-trainer
View on GitHub
🤗 최소한의 세팅으로 LM을 학습하기 위한 샘플코드
☆59May 23, 2023Updated 3 years ago
Atipico1 / Kor-IR
View on GitHub
Kor-IR: Korean Information Retrieval Benchmark
☆87Jul 3, 2024Updated 2 years ago
Seondong / LocEmb
View on GitHub
LocEmb: Location Embedding (Currently covering districts, roads, and businesses in Korea)
☆11Aug 15, 2022Updated 3 years ago
koalanlp / python-support
View on GitHub
Python wrapper for KoalaNLP (Korean NLP with Java/Scala)
☆31Jan 20, 2026Updated 6 months ago
HeegyuKim / open-korean-instructions
View on GitHub
언어모델을 학습하기 위한 공개 한국어 instruction dataset들을 모아두었습니다.
☆469Apr 13, 2025Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
MrBananaHuman / UnethicalQuestionsKor
View on GitHub
☆19Oct 24, 2023Updated 2 years ago
tunib-ai / artwork_captions
View on GitHub
Machine Generated Captions for Best Artworks
☆22Sep 21, 2022Updated 3 years ago
teddysum / Korean_SC_2023
View on GitHub
☆10Oct 28, 2024Updated last year
nlpai-lab / KURE
View on GitHub
KURE: 고려대학교에서 개발한, 한국어 검색에 특화된 임베딩 모델
☆225Apr 14, 2026Updated 3 months ago
nlpai-lab / Korean-CommonGen
View on GitHub
[Findings of NAACL2022] A Dog Is Passing Over The Jet? A Text-Generation Dataset for Korean Commonsense Reasoning and Evaluation
☆11May 27, 2022Updated 4 years ago
smilegate-ai / OPELA
View on GitHub
☆29Nov 23, 2022Updated 3 years ago
sionic-ai / Data_KoSuperNI
View on GitHub
StrategyQA 데이터 세트 번역
☆22Apr 12, 2024Updated 2 years ago
HeegyuKim / ko-rm-judge
View on GitHub
Reward Model을 이용하여 언어모델의 답변을 평가하기
☆30Feb 23, 2024Updated 2 years ago
jason9693 / oslo-kogpt-finetunig
View on GitHub
kogpt를 oslo로 파인튜닝하는 예제.
☆23Aug 26, 2022Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
HAE-RAE / HAE-RAE-BENCH
View on GitHub
Benchmark in Korean Context
☆139Sep 26, 2023Updated 2 years ago
AIRC-KETI / Korean-Copora
View on GitHub
☆14Dec 9, 2021Updated 4 years ago
korean-named-entity / konne-prep
View on GitHub
☆19Jan 29, 2023Updated 3 years ago
daje0601 / CoT-Reasoning_without_Prompting
View on GitHub
구글에서 발표한 Chain-of-Thought Reasoning without Prompting을 코드로 구현한 레포입니다.
☆65Sep 28, 2024Updated last year
tabtoyou / KoLLaVA
View on GitHub
KoLLaVA: Korean Large Language-and-Vision Assistant (feat.LLaVA)
☆295Sep 20, 2024Updated last year
EleutherAI / dps
View on GitHub
Data processing system for polyglot
☆93Jul 6, 2026Updated 3 weeks ago
teddysum / korean_evaluation
View on GitHub
☆10Jun 5, 2025Updated last year