ko-nlp / Open-korean-corporaView external linksLinks
Open Korean NLP Dataset Curation for the Users All Around the Globe
☆152Nov 18, 2023Updated 2 years ago
Alternatives and similar repositories for Open-korean-corpora
Users that are interested in Open-korean-corpora are comparing it to the libraries listed below
Sorting:
- Korean corpus repository☆737Oct 3, 2022Updated 3 years ago
- KorNLI and KorSTS: New Benchmark Datasets for Korean Natural Language Understanding☆309Jul 9, 2023Updated 2 years ago
- Training Transformers of Huggingface with KoNLPy☆68Aug 28, 2020Updated 5 years ago
- 모두의 말뭉치 데이터를 분석에 편리한 형태로 변환하는 기능을 제공합니다.☆11Mar 2, 2022Updated 3 years ago
- ☆25Oct 28, 2020Updated 5 years ago
- The code and models for "An Empirical Study of Tokenization Strategies for Various Korean NLP Tasks" (AACL-IJCNLP 2020)☆119Oct 8, 2020Updated 5 years ago
- NLP Shared tasks (NER, SRL) using NSML☆182Jan 3, 2019Updated 7 years ago
- 🤗 Pretrained BERT model & WordPiece tokenizer trained on Korean Comments 한국어 댓글로 프리트레이닝한 BERT 모델과 데이터셋☆495Nov 7, 2022Updated 3 years ago
- https://ailabs.enliple.com/☆105Feb 25, 2021Updated 4 years ago
- 한국어 데이터 세트 링크☆900Oct 14, 2024Updated last year
- Structured argument extraction for Korean☆22Feb 17, 2022Updated 3 years ago
- 문장단위로 분절된 한국어 위키피디아 코퍼스. Releases에서 다운로드 받거나 tfds-korean으로 사용해주세요.☆24Sep 6, 2023Updated 2 years ago
- Parallel dataset of Korean Questions and Commands☆60Mar 24, 2023Updated 2 years ago
- Pretrained ELECTRA Model for Korean☆630Feb 19, 2024Updated last year
- Adversarial Test Dataset for Korean Multi-turn Response Selection☆34Dec 16, 2021Updated 4 years ago
- #Paired Question☆24Jun 16, 2020Updated 5 years ago
- Bias, Hate classification with KoELECTRA 👿☆27Jun 12, 2023Updated 2 years ago
- Large scale unannotated Korean corpus for unsupervised tasks. (e.g. Language modeling)☆28Aug 11, 2019Updated 6 years ago
- Sentence Embeddings using Siamese ETRI KoBERT☆163Aug 16, 2025Updated 6 months ago
- ☆197May 22, 2023Updated 2 years ago
- KoParadigm: Korean Inflectional Paradigm Generator☆57Nov 23, 2022Updated 3 years ago
- [Findings of NAACL2022] A Dog Is Passing Over The Jet? A Text-Generation Dataset for Korean Commonsense Reasoning and Evaluation☆11May 27, 2022Updated 3 years ago
- ☆92Mar 3, 2022Updated 3 years ago
- This repo is for Korean wiki table question answering datasets described in the paper of Korean-Specific Dataset for Table Question Answe…☆91Oct 22, 2024Updated last year
- MeCab model trained with OpenKorPos.☆23Jun 19, 2022Updated 3 years ago
- KB국민은행에서 제공하는 경제/금융 도메인에 특화된 한국어 ALBERT 모델☆241Oct 7, 2021Updated 4 years ago
- GPT-2 pretrained on Korean datasets.☆54Oct 12, 2021Updated 4 years ago
- Korean BART☆464Jun 14, 2025Updated 8 months ago
- Kobart model on Huggingface transformers☆64Feb 15, 2022Updated 4 years ago
- KoRean based BERT pre-trained models (KR-BERT) for Tensorflow and PyTorch☆210Apr 24, 2024Updated last year
- Korean large emotion labeled dataset (EmoNSMC)☆14Mar 5, 2020Updated 5 years ago
- 2019 국어경진대회 한국어 의존구문 분석 대상(문체부 장관상)☆15Oct 26, 2022Updated 3 years ago
- Korean Nested Named Entity Corpus☆20May 13, 2023Updated 2 years ago
- [HCLT 2022] Korean sentence text similarity dataset using naver shopping review☆25Oct 20, 2022Updated 3 years ago
- 한국어 악성댓글 데이터셋☆73Sep 26, 2020Updated 5 years ago
- Finetuning Pipeline☆89Feb 25, 2022Updated 3 years ago
- Polyglot: Large Language Models of Well-balanced Competence in Multi-languages☆484Aug 22, 2023Updated 2 years ago
- Curation note of NLP datasets☆98Dec 6, 2022Updated 3 years ago
- Korean Visual Question Answering☆59Feb 18, 2020Updated 5 years ago