jonghwanhyeon / namu-wiki-extractor
A library to extract plaintexts from the JSON dump file of namu wiki
☆25Updated 2 years ago
Alternatives and similar repositories for namu-wiki-extractor:
Users that are interested in namu-wiki-extractor are comparing it to the libraries listed below
- Dataset of Korean Threatening Conversations☆69Updated 2 years ago
- KoBART chatbot☆47Updated 3 years ago
- 최신 자연어처리 모델 소개☆75Updated 2 years ago
- APEACH: Attacking Pejorative Expressions with Analysis on Crowd-generated Hate Speech Evaluation Datasets☆75Updated last year
- bpe based korean t5 model for text-to-text unified framework☆63Updated 9 months ago
- Kobart model on Huggingface transformers☆63Updated 2 years ago
- Training Transformers of Huggingface with KoNLPy☆68Updated 4 years ago
- #Paired Question☆23Updated 4 years ago
- Parallel dataset of Korean Questions and Commands☆59Updated last year
- BERTScore for Korean☆73Updated 11 months ago
- Language Style과 감정에 따른 챗봇 답변 변화 모델☆33Updated 3 years ago
- 🤗 최소한의 세팅으로 LM을 학습하기 위한 샘플코드☆57Updated last year
- This repository contains Korean Hate Speech dataset for paper, "K-MHaS: A Multi-label Hate Speech Detection Dataset in Korean Online News…☆40Updated 8 months ago
- Wikitext format dataset of Namuwiki (Most famous Korean wikipedia)☆51Updated 4 years ago
- Intonation-aided intention identification for Korean☆85Updated 2 years ago
- ☆15Updated 3 years ago
- Python wrapper for KoalaNLP (Korean NLP with Java/Scala)☆31Updated 7 months ago
- Automatic Korean word spacing with neural n-gram detector(NND)☆39Updated 4 years ago
- 한국어 언어 모델 학습을 위한 프로젝트(Flax, Pytorch with Huggingface Accelerate)☆31Updated last year
- KoGPT2 on Huggingface Transformers☆33Updated 3 years ago
- A utility for storing and reading files for Korean LM training 💾☆37Updated last year
- huggingface를 이용하여 downstream task 수행하기☆64Updated 3 years ago
- Korean Math Word Problems☆57Updated 3 years ago
- 한국어 중의성 해소 평가 데이터 세트☆47Updated last year
- 세종 말뭉치 데이터를 정제하기 위한 utils☆36Updated 5 years ago
- Korean Relation Extraction Gold Standard☆36Updated 3 years ago
- Kiwi 형태소 분석기를 활용한 딥러닝 언어 모델 실험실☆50Updated last year
- 나무위키덤프에서 정제된 텍스트를 얻기 위한 NamuwikiExtractor☆16Updated 2 years ago
- GPT-2 pretrained on Korean datasets.☆54Updated 3 years ago
- Korean Light Weight Language Model☆30Updated last year