cynthia / kosentences
Large scale unannotated Korean corpus for unsupervised tasks. (e.g. Language modeling)
☆27Updated 5 years ago
Alternatives and similar repositories for kosentences:
Users that are interested in kosentences are comparing it to the libraries listed below
- Universal Dependency Treebanks in Korean☆37Updated 3 years ago
- Real-time automatic word segmentation (for user-generated texts)☆21Updated 2 years ago
- TEMP☆34Updated 5 years ago
- 매주 목요일, 20:00 모임☆16Updated 4 years ago
- 한국어 문서에 노이즈를 추가합니다.☆27Updated 2 years ago
- Transformers Pipeline with KoELECTRA☆40Updated last year
- Structured argument extraction for Korean☆22Updated 3 years ago
- 야자타임 (a.k.a. 야밤의 자연어처리 타임)☆27Updated 4 years ago
- Data from KAIST (a Korean treebank).☆19Updated 5 months ago
- Python wrapper for KoalaNLP (Korean NLP with Java/Scala)☆31Updated 10 months ago
- Bias, Hate classification with KoELECTRA 👿☆27Updated last year
- MeCab model trained with OpenKorPos.☆23Updated 2 years ago
- 🦛 파이썬 한글 처리 라이브러리. Python Korean Morphological Analyzer☆20Updated 2 months ago
- 문장단위로 분절된 한국어 위키피디아 코퍼스. Releases에서 다운로드 받거나 tfds-korean으로 사용해주세요.☆24Updated last year
- Adversarial Test Dataset for Korean Multi-turn Response Selection☆34Updated 3 years ago
- Parallel dataset of Korean Questions and Commands☆60Updated 2 years ago
- Korean Nested Named Entity Corpus☆18Updated last year
- Similar string search in Levenshtein distance☆21Updated 3 years ago
- 네이버 뉴스 중 IT/과학 분야에서 50개를 선정해서 요약에 해당하는 문장을 태깅해둔 데이터셋입니다.☆39Updated 8 years ago
- Korean Moview Review Emotion (KMRE) Dataset☆21Updated 4 years ago
- KU_NERDY 이동엽, 임희석 (2017 국어 정보 처리 시스템경진대회 금상) - 한글 및 한국어 정 보처리 학술대회☆32Updated 6 years ago
- Korean ALBERT☆47Updated 5 years ago
- A utility for storing and reading files for Korean LM training 💾☆36Updated last year
- Korean version of GoEmotions Dataset 😍😢😱☆54Updated last year
- 청와대 국민청원 데이터 아카이브☆15Updated 4 years ago
- KorQuAD (Korean Question Answering Dataset) submission guide using PyTorch pretrained BERT☆31Updated 5 years ago
- Easy installer of kocohub dataset☆24Updated 4 years ago
- 한국어 용언 분석기 (원형 복원, 용언 형태소 분석)☆44Updated 5 years ago
- Prosody-semantics Interface in Seoul Korean☆12Updated 4 years ago
- Korean morphological analyzer☆26Updated 5 years ago