jeongukjae / korean-wikipedia-corpus
문장단위로 분절된 한국어 위키피디아 코퍼스. Releases에서 다운로드 받거나 tfds-korean으로 사용해주세요.
☆24Updated last year
Alternatives and similar repositories for korean-wikipedia-corpus:
Users that are interested in korean-wikipedia-corpus are comparing it to the libraries listed below
- KoBART chatbot☆47Updated 3 years ago
- ☆20Updated 3 years ago
- A utility for storing and reading files for Korean LM training 💾☆36Updated last year
- KoRean based ELECTRA pre-trained models (KR-ELECTRA) for Tensorflow and PyTorch☆14Updated 3 years ago
- Wikitext format dataset of Namuwiki (Most famous Korean wikipedia)☆51Updated 4 years ago