kakaobrain / kor-nlu-datasets
KorNLI and KorSTS: New Benchmark Datasets for Korean Natural Language Understanding
β299Updated last year
Related projects β
Alternatives and complementary repositories for kor-nlu-datasets
- Summarization module based on KoBARTβ197Updated last year
- KoBERT on π€ Huggingface Transformers π€ (with Bug Fixed)β203Updated 3 months ago
- π€ Korean Comments ELECTRA: νκ΅μ΄ λκΈλ‘ νμ΅ν ELECTRA λͺ¨λΈβ240Updated 2 years ago
- π¦ Pretrained BigBird Model for Korean (up to 4096 tokens)β202Updated 10 months ago
- Korean Language Modelμ μ΄μ©ν μ¬λ¦¬μλ΄ λν μΈμ΄ λͺ¨λΈβ210Updated last year
- OOVμμ΄ λΉ λ₯΄κ³ μ νν νκ΅μ΄ Embedding λΌμ΄λΈλ¬λ¦¬β220Updated 6 years ago
- β194Updated last year
- Sentence Embeddings using Siamese ETRI KoBERT-Networksβ162Updated last year
- π€ Pretrained BERT model & WordPiece tokenizer trained on Korean Comments νκ΅μ΄ λκΈλ‘ ν리νΈλ μ΄λν BERT λͺ¨λΈκ³Ό λ°μ΄ν°μ β476Updated 2 years ago
- π Korean NLU Benchmarkβ565Updated 2 years ago
- KBκ΅λ―Όμνμμ μ 곡νλ κ²½μ /κΈμ΅ λλ©μΈμ νΉνλ νκ΅μ΄ ALBERT λͺ¨λΈβ229Updated 3 years ago
- Korean HateSpeech Datasetβ375Updated 4 years ago
- Distillation of KoBERT from SKTBrain (Lightweight KoBERT)β187Updated last year
- π₯ Korean GPT-2, KoGPT2 FineTuning cased. νκ΅μ΄ κ°μ¬ λ°μ΄ν° νμ΅ π₯β230Updated 3 months ago
- This repository provides list of Korean NLP papers.β205Updated 4 years ago
- Jiphyeonjeon Season 2β121Updated 2 years ago
- κ΅λ΄ μμ°μ΄ μ²λ¦¬ κΈ°μ μ μ°κ΅¬ λ° κ°λ°νλ μ€ννΈμ λͺ©λ‘β165Updated 4 years ago
- Jiphyeonjeon Season 1β179Updated 3 years ago
- Split Korean text into sentences using heuristic algorithm.β210Updated 3 years ago
- Korean BARTβ447Updated last month
- NER Task with KoBERT (with Naver NLP Challenge dataset)β98Updated last year
- Chatbot_data_for_Koreanβ355Updated last year
- Pretrained ELECTRA Model for Koreanβ603Updated 9 months ago
- ν μ€νΈ μμ½ λΆμΌμ μ£Όμ μ°κ΅¬ μ£Όμ , Must-read Papers, μ΄μ© κ°λ₯ν model λ° data λ±μ μΆμ² μλ£μ ν¨κ» μ 리ν μ μ₯μμ λλ€.β334Updated 2 years ago
- Sentence Embeddings using Siamese SKT KoBERT-Networksβ134Updated last year
- ν μνλ‘2μ λ¨Έμ λ¬λμΌλ‘ μμνλ μμ°μ΄μ²λ¦¬ (λ‘μ§μ€ν±νκ·λΆν° BERTμ GPT3κΉμ§) μ€μ΅μλ£β276Updated 2 years ago
- EDAλ₯Ό νκ΅μ΄ λ°μ΄ν°μμλ μ¬μ©ν μ μλλ‘ WordNetμ μΆκ°β106Updated 4 years ago
- κ°μΈμ μΌλ‘ μμ§ν νκ΅μ΄ NLPμ© λ§λμΉ λͺ¨μβ132Updated 4 years ago
- νκ΅μ΄ μλ² λ© (Sentence Embeddings Using Korean Corpora)β455Updated 2 years ago
- List of Korean pre-trained language models.β189Updated last year