jonghwanhyeon / namu-wiki-extractorLinks
A library to extract plaintexts from the JSON dump file of namu wiki
☆27Updated 3 years ago
Alternatives and similar repositories for namu-wiki-extractor
Users that are interested in namu-wiki-extractor are comparing it to the libraries listed below
Sorting:
- Parallel dataset of Korean Questions and Commands☆61Updated 2 years ago
- This repository contains Korean Hate Speech dataset for paper, "K-MHaS: A Multi-label Hate Speech Detection Dataset in Korean Online News…☆49Updated last year
- KoBART chatbot☆47Updated 4 years ago
- Dataset of Korean Threatening Conversations☆73Updated 3 years ago
- APEACH: Attacking Pejorative Expressions with Analysis on Crowd-generated Hate Speech Evaluation Datasets☆77Updated 2 years ago
- #Paired Question☆24Updated 5 years ago
- BERTScore for Korean☆81Updated last year
- Korean Online That-gul Emotions Dataset☆129Updated 2 years ago
- Training Transformers of Huggingface with KoNLPy☆68Updated 5 years ago
- bpe based korean t5 model for text-to-text unified framework☆63Updated last year
- Kobart model on Huggingface transformers☆64Updated 3 years ago
- Yet another python binding for mecab-ko☆88Updated 2 years ago
- 한국어 높임말 교정☆26Updated 2 years ago
- ☆76Updated 3 years ago
- The code and models for "An Empirical Study of Tokenization Strategies for Various Korean NLP Tasks" (AACL-IJCNLP 2020)☆120Updated 5 years ago
- Wikitext format dataset of Namuwiki (Most famous Korean wikipedia)☆52Updated 5 years ago
- 🤗 최소한의 세팅으로 LM을 학습하기 위한 샘플코드☆58Updated 2 years ago
- ☆21Updated 3 years ago
- huggingface를 이용하여 downstream task 수행하기☆64Updated 3 years ago
- 특허분야 특화된 한국어 AI언어모델 KorPatBERT☆66Updated last year
- 한국어 중의성 해소 평가 데이터 세트☆50Updated 2 years ago
- Korean-English Bilingual Electra Models☆110Updated 4 years ago
- KoGPT2 on Huggingface Transformers☆33Updated 4 years ago
- ELECTRA기반 한국어 대화체 언어모델☆54Updated 4 years ago
- Korean Math Word Problems☆59Updated 3 years ago
- ☆29Updated 8 years ago
- Korean Relation Extraction Gold Standard☆35Updated 4 years ago
- 한국어 T5 모델☆55Updated 3 years ago
- Data Augmentation Toolkit for Korean text.☆52Updated 4 years ago
- 한국어 개체명 정의 및 표지 표준화 기술보고서와 이를 기반으로 제작된 개체명 형태소 말뭉치☆94Updated 4 years ago