ko-nlp / Open-korean-corpora
Open Korean NLP Dataset Curation for the Users All Around the Globe
☆141Updated last year
Related projects ⓘ
Alternatives and complementary repositories for Open-korean-corpora
- The code and models for "An Empirical Study of Tokenization Strategies for Various Korean NLP Tasks" (AACL-IJCNLP 2020)☆116Updated 4 years ago
- 한국어 개체명 정의 및 표지 표준화 기술보고서와 이를 기반으로 제작된 개체명 형태소 말뭉치☆90Updated 3 years ago
- Korean-English Bilingual Electra Models☆109Updated 3 years ago
- Pecab: Pure python Korean morpheme analyzer based on Mecab☆156Updated 6 months ago
- https://ailabs.enliple.com/☆105Updated 3 years ago
- 🦅 Pretrained BigBird Model for Korean (up to 4096 tokens)☆202Updated 10 months ago
- Dataset of Korean Threatening Conversations☆70Updated 2 years ago
- Finetuning Pipeline☆90Updated 2 years ago
- Naver movie review sentiment classification with KoBERT☆76Updated last year
- Korean wiki QA dataset for MRC☆121Updated 9 months ago
- BERTScore for Korean☆73Updated 9 months ago
- Yet another python binding for mecab-ko☆80Updated last year
- 국내 자연어 처리 기술을 연구 및 개발하는 스타트업 목록☆165Updated 4 years ago
- Distillation of KoBERT from SKTBrain (Lightweight KoBERT)☆187Updated last year
- HanBert on 🤗 Huggingface Transformers 🤗☆86Updated 4 years ago
- Parallel dataset of Korean Questions and Commands☆59Updated last year
- ☆73Updated 2 years ago
- 세종 말뭉치 데이터를 정제하기 위한 utils☆36Updated 5 years ago
- NER Task with KoBERT (with Naver NLP Challenge dataset)☆98Updated last year
- KLUE 데이터를 활용한 HuggingFace Transformers 튜토리얼☆129Updated 3 years ago
- Jiphyeonjeon Season 2☆121Updated 2 years ago
- GPT-2 pretrained on Korean datasets.☆54Updated 3 years ago
- KorNLI and KorSTS: New Benchmark Datasets for Korean Natural Language Understanding☆299Updated last year
- Kobart model on Huggingface transformers☆63Updated 2 years ago
- Character-level Korean ELECTRA Model (음절 단위 한국어 ELECTRA)☆53Updated last year
- KoRean based SBERT pre-trained models (KR-SBERT) for PyTorch☆95Updated 2 years ago
- APEACH: Attacking Pejorative Expressions with Analysis on Crowd-generated Hate Speech Evaluation Datasets☆75Updated last year
- BERT with SentencePiece for Korean text☆70Updated 4 years ago
- Korean Relation Extraction Gold Standard☆37Updated 3 years ago
- Korean sejong corpus download and simple analysis☆138Updated 5 years ago