ko-nlp / Open-korean-corpora
Open Korean NLP Dataset Curation for the Users All Around the Globe
☆145Updated last year
Alternatives and similar repositories for Open-korean-corpora:
Users that are interested in Open-korean-corpora are comparing it to the libraries listed below
- The code and models for "An Empirical Study of Tokenization Strategies for Various Korean NLP Tasks" (AACL-IJCNLP 2020)☆118Updated 4 years ago
- 한국어 개체명 정의 및 표지 표준화 기술보고서와 이를 기반으로 제작된 개체명 형태소 말뭉치☆90Updated 4 years ago
- Pecab: Pure python Korean morpheme analyzer based on Mecab☆161Updated 11 months ago
- Korean-English Bilingual Electra Models☆109Updated 3 years ago
- 🦅 Pretrained BigBird Model for Korean (up to 4096 tokens)☆203Updated last year
- KoRean based SBERT pre-trained models (KR-SBERT) for PyTorch☆96Updated 2 years ago
- 세종 말뭉치 데이터를 정제하기 위한 utils☆36Updated 5 years ago
- Jiphyeonjeon Season 2☆121Updated 2 years ago
- ☆73Updated 3 years ago
- https://ailabs.enliple.com/☆105Updated 4 years ago
- 국내 자연어 처리 기술을 연구 및 개발하는 스타트업 목록☆166Updated 4 years ago
- Distillation of KoBERT from SKTBrain (Lightweight KoBERT)☆189Updated last year
- KoRean based BERT pre-trained models (KR-BERT) for Tensorflow and PyTorch☆207Updated 11 months ago
- BERTScore for Korean☆76Updated last year
- Parallel dataset of Korean Questions and Commands☆60Updated 2 years ago
- NER Task with KoBERT (with Naver NLP Challenge dataset)☆99Updated last year
- ☆196Updated last year
- Korean wiki QA dataset for MRC☆121Updated last year
- Dataset of Korean Threatening Conversations☆70Updated 2 years ago
- Yet another python binding for mecab-ko☆85Updated last year
- Korean sejong corpus download and simple analysis☆141Updated 5 years ago
- KorNLI and KorSTS: New Benchmark Datasets for Korean Natural Language Understanding☆303Updated last year
- Korean Online That-gul Emotions Dataset☆121Updated last year
- HanBert on 🤗 Huggingface Transformers 🤗☆87Updated 4 years ago
- Korean Relation Extraction Gold Standard☆36Updated 3 years ago
- APEACH: Attacking Pejorative Expressions with Analysis on Crowd-generated Hate Speech Evaluation Datasets☆75Updated 2 years ago
- GPT-2 pretrained on Korean datasets.☆54Updated 3 years ago
- ☆58Updated last year
- A BERT-based reverse dictionary of Korean proverbs☆96Updated 2 years ago
- KLUE 데이터를 활용한 HuggingFace Transformers 튜토리얼☆129Updated 3 years ago