KOLD: Korean Offensive Language Dataset
☆81Nov 13, 2022Updated 3 years ago
Alternatives and similar repositories for KOLD
Users that are interested in KOLD are comparing it to the libraries listed below
Sorting:
- "Why do I feel offended?" - Korean Dataset for Offensive Language Identification (EACL2023 Findings)☆15May 14, 2023Updated 2 years ago
- Korean Named Entity Corpus☆25May 12, 2023Updated 2 years ago
- ☆19Oct 24, 2023Updated 2 years ago
- Hate speech detection corpus in Korean, shared with EMNLP 2023 paper☆17Apr 19, 2024Updated last year
- 모두의 말뭉치 데이터를 분석에 편리한 형태로 변환하는 기능을 제공합니다.☆11Mar 2, 2022Updated 4 years ago
- Beyond LM: How can language model go forward in the future?☆15Apr 30, 2023Updated 2 years ago
- This repository contains Korean Hate Speech dataset for paper, "K-MHaS: A Multi-label Hate Speech Detection Dataset in Korean Online News…☆50May 11, 2024Updated last year
- Korean large emotion labeled dataset (EmoNSMC)☆14Mar 5, 2020Updated 5 years ago
- [HCLT 2022] Korean sentence text similarity dataset using naver shopping review☆25Oct 20, 2022Updated 3 years ago
- Training Transformers of Huggingface with KoNLPy☆68Aug 28, 2020Updated 5 years ago
- Bias, Hate classification with KoELECTRA 👿☆27Jun 12, 2023Updated 2 years ago
- A utility for storing and reading files for Korean LM training 💾☆35Oct 15, 2025Updated 4 months ago
- #인권코퍼스☆31Oct 6, 2023Updated 2 years ago
- CareCall for Seniors: Role Specified Open-Domain Dialogue dataset generated by leveraging LLMs (NAACL 2022).☆60May 3, 2022Updated 3 years ago
- 모두의 말뭉치 인공 지능 언어 능력 평가 1등 솔루션입니다.☆49Nov 20, 2021Updated 4 years ago
- Official datasets and pytorch implementation repository of SQuARe and KoSBi (ACL 2023)☆249Jun 29, 2023Updated 2 years ago
- 최신 자연어처리 모델 소개☆74Jul 22, 2022Updated 3 years ago
- Dataset of Korean Threatening Conversations☆72Nov 1, 2022Updated 3 years ago
- Korean Moview Review Emotion (KMRE) Dataset☆21Sep 7, 2020Updated 5 years ago
- baikal.ai's pre-trained BERT models: descriptions and sample codes☆12Jun 24, 2021Updated 4 years ago
- ☆442Apr 8, 2022Updated 3 years ago
- 언어모델을 학습하기 위한 공개 한국어 instruction dataset들을 모아두었습니다.☆19Jul 16, 2023Updated 2 years ago
- Korean Nested Named Entity Corpus☆20May 13, 2023Updated 2 years ago
- ☆197May 22, 2023Updated 2 years ago
- BERTScore for Korean☆80Feb 22, 2024Updated 2 years ago
- CLIcK: A Benchmark Dataset of Cultural and Linguistic Intelligence in Korean☆48Dec 23, 2024Updated last year
- [Findings of NAACL2022] A Dog Is Passing Over The Jet? A Text-Generation Dataset for Korean Commonsense Reasoning and Evaluation☆28Dec 9, 2022Updated 3 years ago
- ☆59Jan 2, 2024Updated 2 years ago
- APEACH: Attacking Pejorative Expressions with Analysis on Crowd-generated Hate Speech Evaluation Datasets☆77Feb 5, 2023Updated 3 years ago
- KLUE 데이터를 활용한 HuggingFace Transformers 튜토리얼☆129Jun 28, 2021Updated 4 years ago
- Korean HateSpeech Dataset☆394Jul 18, 2020Updated 5 years ago
- 이동호, 이정훈, 김유리, 김형준, 박승면, 양유준, 신웅비 (Dong Ho Lee, Jung Hoon Lee, Yu Ri Kim, Hyung Jun Kim, Seung Myun Park, Yu Jun Yang, Woong Bi Shin)☆15Apr 16, 2020Updated 5 years ago
- This repo is for Korean wiki table question answering datasets described in the paper of Korean-Specific Dataset for Table Question Answe…☆91Oct 22, 2024Updated last year
- Pecab: Pure python Korean morpheme analyzer based on Mecab☆172Apr 27, 2024Updated last year
- MeCab model trained with OpenKorPos.☆23Jun 19, 2022Updated 3 years ago
- Benchmark in Korean Context☆138Sep 26, 2023Updated 2 years ago
- ELECTRA기반 한국어 대화체 언어모델☆53Aug 4, 2021Updated 4 years ago
- ☆44Jul 5, 2024Updated last year
- Google's Conceptual Captions Dataset translated into Korean☆23Aug 28, 2022Updated 3 years ago