ko-nlp / Open-korean-corpora
Open Korean NLP Dataset Curation for the Users All Around the Globe
☆146Updated last year
Alternatives and similar repositories for Open-korean-corpora:
Users that are interested in Open-korean-corpora are comparing it to the libraries listed below
- The code and models for "An Empirical Study of Tokenization Strategies for Various Korean NLP Tasks" (AACL-IJCNLP 2020)☆118Updated 4 years ago
- 한국어 개체명 정의 및 표지 표준화 기술보고서와 이를 기반으로 제작된 개체명 형태소 말뭉치☆91Updated 4 years ago
- Pecab: Pure python Korean morpheme analyzer based on Mecab☆162Updated 11 months ago
- Parallel dataset of Korean Questions and Commands☆60Updated 2 years ago
- A BERT-based reverse dictionary of Korean proverbs☆96Updated 2 years ago
- https://ailabs.enliple.com/☆105Updated 4 years ago
- 🦅 Pretrained BigBird Model for Korean (up to 4096 tokens)☆203Updated last year
- Korean wiki QA dataset for MRC☆121Updated last year
- 국내 자연어 처리 기술을 연구 및 개발하는 스타트업 목록☆166Updated 4 years ago
- Jiphyeonjeon Season 2☆121Updated 2 years ago
- KoRean based SBERT pre-trained models (KR-SBERT) for PyTorch☆97Updated 2 years ago
- Korean-English Bilingual Electra Models☆109Updated 3 years ago
- Korean sejong corpus download and simple analysis☆141Updated 5 years ago
- Korean Online That-gul Emotions Dataset☆120Updated last year
- APEACH: Attacking Pejorative Expressions with Analysis on Crowd-generated Hate Speech Evaluation Datasets☆76Updated 2 years ago
- Split Korean text into sentences using heuristic algorithm.☆213Updated 4 years ago
- Dataset of Korean Threatening Conversations☆71Updated 2 years ago
- BERT with SentencePiece for Korean text☆72Updated 5 years ago
- GPT-2 pretrained on Korean datasets.☆54Updated 3 years ago
- Korean Relation Extraction Gold Standard☆35Updated 3 years ago
- KLUE 데이터를 활용한 HuggingFace Transformers 튜토리얼☆129Updated 3 years ago
- 자연어 처리와 관련한 여러 튜토리얼 저장소☆79Updated 4 years ago
- Finetuning Pipeline☆90Updated 3 years ago
- ☆73Updated 3 years ago
- ☆195Updated last year
- HanBert on 🤗 Huggingface Transformers 🤗☆87Updated 4 years ago
- KorNLI and KorSTS: New Benchmark Datasets for Korean Natural Language Understanding☆304Updated last year
- Yet another python binding for mecab-ko☆85Updated last year
- Distillation of KoBERT from SKTBrain (Lightweight KoBERT)☆193Updated last year
- NER Task with KoBERT (with Naver NLP Challenge dataset)☆99Updated last year