jeongukjae / tfds-korean
A collection of Korean Text Datasets ready to use using Tensorflow-Datasets.
☆20Updated 2 years ago
Related projects: ⓘ
- Transformers Pipeline with KoELECTRA☆40Updated last year
- Korean ALBERT☆47Updated 4 years ago
- ☆26Updated 2 years ago
- KoGPT2 on Huggingface Transformers☆33Updated 3 years ago
- Easy installer of kocohub dataset☆24Updated 4 years ago
- Parallel dataset of Korean Questions and Commands☆59Updated last year
- Training Transformers of Huggingface with KoNLPy☆68Updated 4 years ago
- Korean version of GoEmotions Dataset 😍😢😱☆52Updated last year
- KoBART chatbot☆46Updated 3 years ago
- KorQuAD (Korean Question Answering Dataset) submission guide using PyTorch pretrained BERT☆31Updated 5 years ago
- 네이버 뉴스 중 IT/과학 분야에서 50개를 선정해서 요약에 해당하는 문장을 태깅해둔 데이터셋입니다.☆39Updated 7 years ago
- Bias, Hate classification with KoELECTRA 👿☆26Updated last year
- Kobart model on Huggingface transformers☆63Updated 2 years ago
- Guide KorQuAD upload to leaderboard (EM 68.947 / F1 88.468) model which only use BERT-multilingual(single)☆41Updated 5 years ago
- Named Entity Recognition Model for Naver NLP Challenge 2018 : BiLSTM-CRF model based Korean named entity tagger☆14Updated last year
- 청와대 국민청원 데이터 아카이브☆15Updated 4 years ago
- Similar string search in Levenshtein distance☆22Updated 3 years ago
- A utility for storing and reading files for Korean LM training 💾☆36Updated 8 months ago
- 나무위키덤프에서 정제된 텍스트를 얻기 위한 NamuwikiExtractor☆16Updated 2 years ago
- Structured argument extraction for Korean☆22Updated 2 years ago
- Korean BERT model using character tokenizer☆27Updated 3 years ago
- Wikitext format dataset of Namuwiki (Most famous Korean wikipedia)☆50Updated 3 years ago
- ☆20Updated 3 years ago
- Automatic Korean word spacing with neural n-gram detector(NND)☆39Updated 4 years ago
- 이기창(ratsgo)님의 자연어 처리 저서 '한국어 임베딩' 스터디 기록 저장소 [DONE]☆22Updated 4 years ago
- Tokenizer 비교 실험☆11Updated 2 years ago
- 숭실대학교 커뮤니티용 언어모델☆40Updated 2 years ago
- 한국어 문서에 노이즈를 추가합니다.☆27Updated last year
- 문장단위로 분절된 한국어 위키피디아 코퍼스. Releases에서 다운로드 받거나 tfds-korean으로 사용해주세요.☆24Updated last year
- ☆21Updated 2 years ago