π€ Pretrained BERT model & WordPiece tokenizer trained on Korean Comments νκ΅μ΄ λκΈλ‘ ν리νΈλ μ΄λν BERT λͺ¨λΈκ³Ό λ°μ΄ν°μ
β494Nov 7, 2022Updated 3 years ago
Alternatives and similar repositories for KcBERT
Users that are interested in KcBERT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- π€ Korean Comments ELECTRA: νκ΅μ΄ λκΈλ‘ νμ΅ν ELECTRA λͺ¨λΈβ261Nov 7, 2022Updated 3 years ago
- Pretrained ELECTRA Model for Koreanβ630Feb 19, 2024Updated 2 years ago
- Korean corpus repositoryβ745Oct 3, 2022Updated 3 years ago
- νκ΅μ΄ λ°μ΄ν° μΈνΈ λ§ν¬β908Oct 14, 2024Updated last year
- KorNLI and KorSTS: New Benchmark Datasets for Korean Natural Language Understandingβ310Jul 9, 2023Updated 2 years ago
- Korean HateSpeech Datasetβ395Jul 18, 2020Updated 5 years ago
- Korean BERT pre-trained cased (KoBERT)β1,408Jun 14, 2025Updated 9 months ago
- Korean GPT-2 pretrained cased (KoGPT2)β559Oct 3, 2024Updated last year
- Distillation of KoBERT from SKTBrain (Lightweight KoBERT)β198Sep 6, 2023Updated 2 years ago
- π Korean NLU Benchmarkβ589Jul 6, 2022Updated 3 years ago
- KBκ΅λ―Όμνμμ μ 곡νλ κ²½μ /κΈμ΅ λλ©μΈμ νΉνλ νκ΅μ΄ ALBERT λͺ¨λΈβ240Oct 7, 2021Updated 4 years ago
- KoRean based BERT pre-trained models (KR-BERT) for Tensorflow and PyTorchβ211Apr 24, 2024Updated last year
- KcBERT/KcELECTRA Fine Tune Benchmarks code (forked from https://github.com/monologg/KoELECTRA/tree/master/finetune)β47Apr 10, 2022Updated 3 years ago
- Korean wellness chatbot models: KoGPT2 + KoBERT/KoELECTRA (PyTorch, Transformers).β209Jan 12, 2026Updated 2 months ago
- νκ΅μ΄ μμ°μ΄μ²λ¦¬λ₯Ό μν νμ΄μ¬ λΌμ΄λΈλ¬λ¦¬μ λλ€. λ¨μ΄ μΆμΆ/ ν ν¬λμ΄μ / νμ¬νλ³/ μ μ²λ¦¬μ κΈ°λ₯μ μ 곡ν©λλ€.β984Mar 10, 2026Updated last week
- Pretrained Language Models for Koreanβ398Jan 1, 2023Updated 3 years ago
- Naver sentiment movie corpusβ600Mar 7, 2017Updated 9 years ago
- KoBERTμ CRFλ‘ λ§λ νκ΅μ΄ κ°μ²΄λͺ μΈμκΈ° (BERT+CRF based Named Entity Recognition model for Korean)β504Feb 11, 2024Updated 2 years ago
- KoBERT on π€ Huggingface Transformers π€ (with Bug Fixed)β211Aug 21, 2024Updated last year
- PORORO: Platform Of neuRal mOdels for natuRal language prOcessingβ1,306Mar 23, 2022Updated 4 years ago
- νκ΅μ΄ μλ² λ© (Sentence Embeddings Using Korean Corpora)β467Dec 1, 2021Updated 4 years ago
- The code and models for "An Empirical Study of Tokenization Strategies for Various Korean NLP Tasks" (AACL-IJCNLP 2020)β119Oct 8, 2020Updated 5 years ago
- Open Korean NLP Dataset Curation for the Users All Around the Globeβ152Nov 18, 2023Updated 2 years ago
- Training Transformers of Huggingface with KoNLPyβ68Aug 28, 2020Updated 5 years ago
- Korean BARTβ465Jun 14, 2025Updated 9 months ago
- Implementing nlp papers relevant to classification with PyTorch, gluonnlpβ229Dec 8, 2022Updated 3 years ago
- Simple Chit-Chat based on KoGPT2β182Jun 12, 2023Updated 2 years ago
- Split Korean text into sentences using heuristic algorithm.β215Dec 24, 2020Updated 5 years ago
- KakaoBrain KoGPT (Korean Generative Pre-trained Transformer)β1,013Jan 30, 2024Updated 2 years ago
- Sentence Embeddings using Siamese ETRI KoBERTβ162Aug 16, 2025Updated 7 months ago
- KoAlpaca: νκ΅μ΄ λͺ λ Ήμ΄λ₯Ό μ΄ν΄νλ μ€νμμ€ μΈμ΄λͺ¨λΈ (KoAlpaca: An open-source language model to understand Korean instructions)β1,576Oct 25, 2024Updated last year
- Chatbot_data_for_Koreanβ358Mar 30, 2023Updated 2 years ago
- KSS: Korean String processing Suiteβ470Nov 13, 2025Updated 4 months ago
- Easy Language Model Pretraining leveraging Huggingface's Transformers and Datasetsβ130Nov 12, 2022Updated 3 years ago
- π¦ Pretrained BigBird Model for Korean (up to 4096 tokens)β202Dec 28, 2023Updated 2 years ago
- β443Apr 8, 2022Updated 3 years ago
- κ°μΈμ μΌλ‘ μμ§ν νκ΅μ΄ NLPμ© λ§λμΉ λͺ¨μβ140Sep 15, 2020Updated 5 years ago
- β© All about Korean Transformers (information and tutorial)β18Jun 21, 2022Updated 3 years ago
- List of Korean pre-trained language models.β188Aug 31, 2023Updated 2 years ago