kakaobrain / kortokLinks
The code and models for "An Empirical Study of Tokenization Strategies for Various Korean NLP Tasks" (AACL-IJCNLP 2020)
☆119Updated 4 years ago
Alternatives and similar repositories for kortok
Users that are interested in kortok are comparing it to the libraries listed below
Sorting:
- Kobart model on Huggingface transformers☆64Updated 3 years ago
- Character-level Korean ELECTRA Model (음절 단위 한국어 ELECTRA)☆54Updated 2 years ago
- Korean-English Bilingual Electra Models☆110Updated 3 years ago
- Training Transformers of Huggingface with KoNLPy☆68Updated 4 years ago
- Dataset of Korean Threatening Conversations☆74Updated 2 years ago
- Korean Math Word Problems☆59Updated 3 years ago
- Adversarial Test Dataset for Korean Multi-turn Response Selection☆35Updated 3 years ago
- 11.5기의 beyondBERT의 토론 내용을 정리하는 repository입니다.☆58Updated 5 years ago
- Finetuning Pipeline☆90Updated 3 years ago
- Parallel dataset of Korean Questions and Commands☆61Updated 2 years ago
- KoBART chatbot☆47Updated 4 years ago
- huggingface를 이용하여 downstream task 수행하기☆64Updated 3 years ago
- #Paired Question☆24Updated 5 years ago
- BERTScore for Korean☆80Updated last year
- Yet another python binding for mecab-ko☆86Updated 2 years ago
- Korean Relation Extraction Gold Standard☆35Updated 4 years ago
- Open Korean NLP Dataset Curation for the Users All Around the Globe☆150Updated last year
- 나무위키덤프에서 정제된 텍스트를 얻기 위한 NamuwikiExtractor☆18Updated 3 years ago
- ELECTRA기반 한국어 대화체 언어모델☆54Updated 3 years ago
- 한국어 개체명 정의 및 표지 표준화 기술보고서와 이를 기반으로 제작된 개체명 형태소 말뭉치☆91Updated 4 years ago
- Korean ALBERT☆47Updated 5 years ago
- ☆60Updated last year
- Transformers Pipeline with KoELECTRA☆40Updated 2 years ago
- Data Augmentation Toolkit for Korean text.☆51Updated 3 years ago
- APEACH: Attacking Pejorative Expressions with Analysis on Crowd-generated Hate Speech Evaluation Datasets☆77Updated 2 years ago
- KOLD: Korean Offensive Language Dataset☆81Updated 2 years ago
- ☆29Updated 7 years ago
- Pecab: Pure python Korean morpheme analyzer based on Mecab☆168Updated last year
- KoGPT2 on Huggingface Transformers☆33Updated 4 years ago
- Wikitext format dataset of Namuwiki (Most famous Korean wikipedia)☆51Updated 4 years ago