songys / huggingface_KoreanDatasetView external linksLinks
huggingface에 있는 한국어 데이터 세트
☆35Oct 10, 2024Updated last year
Alternatives and similar repositories for huggingface_KoreanDataset
Users that are interested in huggingface_KoreanDataset are comparing it to the libraries listed below
Sorting:
- ☆20Jul 24, 2024Updated last year
- 언어모델을 학습하기 위한 공개 한국어 instruction dataset들을 모아두었습니다.☆452Apr 13, 2025Updated 10 months ago
- #인권코퍼스☆31Oct 6, 2023Updated 2 years ago
- ☆19Oct 24, 2023Updated 2 years ago
- KURE: 고려대학교에서 개발한, 한국어 검색에 특화된 임베딩 모델☆206Sep 10, 2025Updated 5 months ago
- 한국어 언어모델 다분야 사고력 벤치마크☆201Oct 17, 2024Updated last year
- KoCommonGEN v2: A Benchmark for Navigating Korean Commonsense Reasoning Challenges in Large Language Models☆25Aug 24, 2024Updated last year
- bpe based korean t5 model for text-to-text unified framework☆63Apr 17, 2024Updated last year
- Kor-IR: Korean Information Retrieval Benchmark☆87Jul 3, 2024Updated last year
- LLM 모델의 외국어 토큰 생성을 막는 코드 구현☆83Aug 7, 2025Updated 6 months ago
- CLIcK: A Benchmark Dataset of Cultural and Linguistic Intelligence in Korean☆47Dec 23, 2024Updated last year
- This repo is for Korean wiki table question answering datasets described in the paper of Korean-Specific Dataset for Table Question Answe…☆91Oct 22, 2024Updated last year
- 자체 구축한 한국어 평가 데이터셋을 이용한 한국어 모델 평가☆31May 31, 2024Updated last year
- Korean Commonsense Knowledge Graph☆15Dec 23, 2022Updated 3 years ago
- ☆33Aug 30, 2023Updated 2 years ago
- 최신 자연어처리 모델 소개☆74Jul 22, 2022Updated 3 years ago
- 한국어 심리 상담 데이터셋☆81Jun 20, 2023Updated 2 years ago
- KOLD: Korean Offensive Language Dataset☆81Nov 13, 2022Updated 3 years ago
- Evaluating Multimodal Generative AI with Korean Educational Standards, NAACL 2025.☆24May 15, 2025Updated 9 months ago
- MeCab model trained with OpenKorPos.☆23Jun 19, 2022Updated 3 years ago
- ☆123Apr 21, 2023Updated 2 years ago
- Difference-based Contrastive Learning for Korean Sentence Embeddings☆23Updated this week
- Google's Conceptual Captions Dataset translated into Korean☆23Aug 28, 2022Updated 3 years ago
- The most modern LLM evaluation toolkit☆70Nov 9, 2025Updated 3 months ago
- [HCLT 2022] Korean sentence text similarity dataset using naver shopping review☆25Oct 20, 2022Updated 3 years ago
- ☆10Jan 20, 2024Updated 2 years ago
- [COLING 2024] SentiCSE: A Sentiment-aware Contrastive Sentence Embedding Framework with Sentiment-guided Textual Similarity☆13May 8, 2024Updated last year
- BERT score for text generation☆12Jan 15, 2025Updated last year
- ☆11Aug 9, 2022Updated 3 years ago
- ☆10Jun 5, 2025Updated 8 months ago
- For the rlhf learning environment of Koreans☆25Sep 25, 2023Updated 2 years ago
- Korean Speech to English Translation Corpus☆45Sep 3, 2021Updated 4 years ago
- ChatGPT의 RLHF를 학습을 위한 3가지 step별 한국어 데이터셋☆40Nov 21, 2023Updated 2 years ago
- Make running benchmark simple yet maintainable, again. Now only supports Korean-based cross-encoder.☆29Dec 2, 2025Updated 2 months ago
- Repository for "Scaling Evaluation-time Compute with Reasoning Models as Process Evaluators"☆12Mar 25, 2025Updated 10 months ago
- [Suspended] Modern, customizable AI character frontend for enthusiasts (inspired by SillyTavern)☆10Nov 8, 2024Updated last year
- ☆10Oct 28, 2024Updated last year
- Liner LLM Meetup archive☆71Mar 27, 2024Updated last year
- Open-WikiTable :Dataset for Open Domain Question Answering with Complex Reasoning over Table☆27Jun 2, 2023Updated 2 years ago