jonghwanhyeon / namu-wiki-extractor
A library to extract plaintexts from the JSON dump file of namu wiki
☆25Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for namu-wiki-extractor
- KoBART chatbot☆47Updated 3 years ago
- Parallel dataset of Korean Questions and Commands☆59Updated last year
- 한국어 중의성 해소 평가 데이터 세트☆46Updated last year
- Bias, Hate classification with KoELECTRA 👿☆26Updated last year
- #Paired Question☆23Updated 4 years ago
- ☆29Updated 7 years ago
- Training Transformers of Huggingface with KoNLPy☆68Updated 4 years ago
- Kobart model on Huggingface transformers☆63Updated 2 years ago
- 최신 자연어처리 모델 소개☆75Updated 2 years ago
- ☆15Updated 2 years ago
- The code and models for "An Empirical Study of Tokenization Strategies for Various Korean NLP Tasks" (AACL-IJCNLP 2020)☆116Updated 4 years ago
- This repository contains Korean Hate Speech dataset for paper, "K-MHaS: A Multi-label Hate Speech Detection Dataset in Korean Online News…☆40Updated 6 months ago
- huggingface를 이용하여 downstream task 수행하기☆64Updated 2 years ago
- 한국어 언어 모델 학습을 위한 프로젝트(Flax, Pytorch with Huggingface Accelerate)☆30Updated last year
- Wikitext format dataset of Namuwiki (Most famous Korean wikipedia)☆51Updated 4 years ago
- bpe based korean t5 model for text-to-text unified framework☆63Updated 7 months ago
- A BERT-based reverse dictionary of Korean proverbs☆96Updated last year