jeongukjae / tfds-korean
A collection of Korean Text Datasets ready to use using Tensorflow-Datasets.
☆20Updated 2 years ago
Alternatives and similar repositories for tfds-korean:
Users that are interested in tfds-korean are comparing it to the libraries listed below
- Transformers Pipeline with KoELECTRA☆40Updated last year
- Korean ALBERT☆47Updated 5 years ago
- Training Transformers of Huggingface with KoNLPy☆68Updated 4 years ago
- KoGPT2 on Huggingface Transformers☆33Updated 3 years ago
- 네이버 뉴스 중 IT/과학 분야에서 50개를 선정해서 요약에 해당하는 문장을 태깅해둔 데이터셋입니다.☆39Updated 8 years ago
- Kobart model on Huggingface transformers☆63Updated 3 years ago
- 한국어 문서에 노이즈를 추가합니다.☆27Updated 2 years ago
- Adversarial Test Dataset for Korean Multi-turn Response Selection☆34Updated 3 years ago
- A utility for storing and reading files for Korean LM training 💾☆36Updated last year
- ☆20Updated 3 years ago
- KoBART chatbot☆47Updated 3 years ago
- 딥러닝에 필요한 데이터를 인터넷에서 크롤링하기 위한 기능들을 모음 입니다.☆28Updated 5 years ago
- ☆26Updated 3 years ago
- Similar string search in Levenshtein distance☆21Updated 3 years ago
- [HCLT 2022] Korean sentence text similarity dataset using naver shopping review☆25Updated 2 years ago
- Parallel dataset of Korean Questions and Commands☆60Updated 2 years ago
- Wikitext format dataset of Namuwiki (Most famous Korean wikipedia)☆51Updated 4 years ago
- Easy installer of kocohub dataset☆24Updated 4 years ago
- #Paired Question☆23Updated 4 years ago
- Language Style과 감정에 따른 챗봇 답변 변화 모델☆33Updated 3 years ago
- 야자타임 (a.k.a. 야밤의 자연어처리 타임)☆27Updated 4 years ago
- Named Entity Recognition Model for Naver NLP Challenge 2018 : BiLSTM-CRF model based Korean named entity tagger☆14Updated 2 years ago
- Korean-English Bilingual Electra Models☆109Updated 3 years ago
- Korean version of GoEmotions Dataset 😍😢😱☆54Updated last year
- Automatic Korean word spacing with neural n-gram detector(NND)☆39Updated 5 years ago
- 문장단위로 분절된 한국어 위키피디아 코퍼스. Releases에서 다운로드 받거나 tfds-korean으로 사용해주세요.☆24Updated last year
- Guide KorQuAD upload to leaderboard (EM 68.947 / F1 88.468) model which only use BERT-multilingual(single)☆41Updated 5 years ago
- ELECTRA기반 한국어 대화체 언어모델☆54Updated 3 years ago
- Dataset of Korean Threatening Conversations☆71Updated 2 years ago
- Structured argument extraction for Korean☆22Updated 3 years ago