jonghwanhyeon / namu-wiki-extractor
A library to extract plaintexts from the JSON dump file of namu wiki
☆26Updated 2 years ago
Alternatives and similar repositories for namu-wiki-extractor:
Users that are interested in namu-wiki-extractor are comparing it to the libraries listed below
- Parallel dataset of Korean Questions and Commands☆60Updated 2 years ago
- #Paired Question☆23Updated 4 years ago
- bpe based korean t5 model for text-to-text unified framework☆62Updated last year
- This repository contains Korean Hate Speech dataset for paper, "K-MHaS: A Multi-label Hate Speech Detection Dataset in Korean Online News…☆44Updated 11 months ago
- BERTScore for Korean☆77Updated last year
- APEACH: Attacking Pejorative Expressions with Analysis on Crowd-generated Hate Speech Evaluation Datasets☆76Updated 2 years ago
- 나무위키덤프에서 정제된 텍스트를 얻기 위한 NamuwikiExtractor☆18Updated 3 years ago
- KoBART chatbot☆47Updated 3 years ago
- Dataset of Korean Threatening Conversations☆71Updated 2 years ago
- 🤗 최소한의 세팅으로 LM을 학습하기 위한 샘플코드☆58Updated last year
- Kobart model on Huggingface transformers☆63Updated 3 years ago
- 한국어 높임말 교정☆26Updated 2 years ago
- Bias, Hate classification with KoELECTRA 👿☆27Updated last year
- 최신 자연어처리 모델 소개☆75Updated 2 years ago
- Kiwi 형태소 분석기를 활용한 딥러닝 언어 모델 실험실☆52Updated last year
- Korean Relation Extraction Gold Standard☆35Updated 3 years ago
- Training Transformers of Huggingface with KoNLPy☆68Updated 4 years ago
- ☆73Updated 3 years ago
- 한국어 T5 모델☆51Updated 3 years ago
- 한국어 언어 모델 학습을 위한 프로젝트(Flax, Pytorch with Huggingface Accelerate)☆32Updated last year
- KoRean based SBERT pre-trained models (KR-SBERT) for PyTorch☆97Updated 2 years ago
- KoGPT2 on Huggingface Transformers☆33Updated 3 years ago
- Automatic Korean word spacing with neural n-gram detector(NND)☆39Updated 5 years ago
- ☆14Updated 3 years ago
- CLIcK: A Benchmark Dataset of Cultural and Linguistic Intelligence in Korean☆45Updated 4 months ago
- 특허분야 특화된 한국어 AI언어모델 KorPatBERT☆62Updated last year
- KOLD: Korean Offensive Language Dataset☆80Updated 2 years ago
- 한국어 중의성 해소 평가 데이터 세트☆48Updated last year
- 한국어 악성댓글 데이터셋☆73Updated 4 years ago
- 한국어 심리 상담 데이터셋☆78Updated last year