kakaobrain / kortokLinks
The code and models for "An Empirical Study of Tokenization Strategies for Various Korean NLP Tasks" (AACL-IJCNLP 2020)
☆119Updated 4 years ago
Alternatives and similar repositories for kortok
Users that are interested in kortok are comparing it to the libraries listed below
Sorting:
- Character-level Korean ELECTRA Model (음절 단위 한국어 ELECTRA)☆54Updated 2 years ago
- Kobart model on Huggingface transformers☆64Updated 3 years ago
- Korean-English Bilingual Electra Models☆110Updated 3 years ago
- Dataset of Korean Threatening Conversations☆74Updated 2 years ago
- BERTScore for Korean☆81Updated last year
- Training Transformers of Huggingface with KoNLPy☆68Updated 4 years ago
- Finetuning Pipeline☆90Updated 3 years ago
- Adversarial Test Dataset for Korean Multi-turn Response Selection☆35Updated 3 years ago
- 11.5기의 beyondBERT의 토론 내용을 정리하는 repository입니다.☆58Updated 5 years ago
- Parallel dataset of Korean Questions and Commands☆61Updated 2 years ago
- huggingface를 이용하여 downstream task 수행하기☆64Updated 3 years ago
- 나무위키덤프에서 정제된 텍스트를 얻기 위한 NamuwikiExtractor☆18Updated 3 years ago
- #Paired Question☆24Updated 5 years ago
- Korean Relation Extraction Gold Standard☆35Updated 4 years ago
- Korean Math Word Problems☆59Updated 3 years ago
- Yet another python binding for mecab-ko☆86Updated 2 years ago
- KoBART chatbot☆47Updated 4 years ago
- ☆60Updated last year
- question generation model with KorQuAD dataset☆38Updated 3 years ago
- Transformers Pipeline with KoELECTRA☆40Updated 2 years ago
- Open Korean NLP Dataset Curation for the Users All Around the Globe☆152Updated last year
- ELECTRA기반 한국어 대화체 언어모델☆54Updated 4 years ago
- Korean ALBERT☆47Updated 5 years ago
- KOLD: Korean Offensive Language Dataset☆81Updated 2 years ago
- APEACH: Attacking Pejorative Expressions with Analysis on Crowd-generated Hate Speech Evaluation Datasets☆77Updated 2 years ago
- Korean Easy Data Augmentation☆94Updated 3 years ago
- Intonation-aided intention identification for Korean☆84Updated 2 years ago
- Baseline code for Korean open domain question answering(ODQA)☆76Updated 2 years ago
- Data Augmentation Toolkit for Korean text.☆52Updated 3 years ago
- Simple Contrastive Learning of Korean Sentence Embeddings☆52Updated 2 years ago