π€ Pretrained BERT model & WordPiece tokenizer trained on Korean Comments νκ΅μ΄ λκΈλ‘ ν리νΈλ μ΄λν BERT λͺ¨λΈκ³Ό λ°μ΄ν°μ
β496Nov 7, 2022Updated 3 years ago
Alternatives and similar repositories for KcBERT
Users that are interested in KcBERT are comparing it to the libraries listed below
Sorting:
- Pretrained ELECTRA Model for Koreanβ631Feb 19, 2024Updated 2 years ago
- Korean corpus repositoryβ743Oct 3, 2022Updated 3 years ago
- π€ Korean Comments ELECTRA: νκ΅μ΄ λκΈλ‘ νμ΅ν ELECTRA λͺ¨λΈβ261Nov 7, 2022Updated 3 years ago
- νκ΅μ΄ λ°μ΄ν° μΈνΈ λ§ν¬β905Oct 14, 2024Updated last year
- KorNLI and KorSTS: New Benchmark Datasets for Korean Natural Language Understandingβ310Jul 9, 2023Updated 2 years ago
- Korean HateSpeech Datasetβ394Jul 18, 2020Updated 5 years ago
- Korean BERT pre-trained cased (KoBERT)β1,402Jun 14, 2025Updated 8 months ago
- Korean GPT-2 pretrained cased (KoGPT2)β560Oct 3, 2024Updated last year
- Distillation of KoBERT from SKTBrain (Lightweight KoBERT)β197Sep 6, 2023Updated 2 years ago
- Korean wellness chatbot models: KoGPT2 + KoBERT/KoELECTRA (PyTorch, Transformers).β209Jan 12, 2026Updated last month
- KBκ΅λ―Όμνμμ μ 곡νλ κ²½μ /κΈμ΅ λλ©μΈμ νΉνλ νκ΅μ΄ ALBERT λͺ¨λΈβ241Oct 7, 2021Updated 4 years ago
- π Korean NLU Benchmarkβ587Jul 6, 2022Updated 3 years ago
- KoRean based BERT pre-trained models (KR-BERT) for Tensorflow and PyTorchβ212Apr 24, 2024Updated last year
- Pretrained Language Models for Koreanβ399Jan 1, 2023Updated 3 years ago
- νκ΅μ΄ μμ°μ΄μ²λ¦¬λ₯Ό μν νμ΄μ¬ λΌμ΄λΈλ¬λ¦¬μ λλ€. λ¨μ΄ μΆμΆ/ ν ν¬λμ΄μ / νμ¬νλ³/ μ μ²λ¦¬μ κΈ°λ₯μ μ 곡ν©λλ€.β985May 7, 2025Updated 9 months ago
- The code and models for "An Empirical Study of Tokenization Strategies for Various Korean NLP Tasks" (AACL-IJCNLP 2020)β119Oct 8, 2020Updated 5 years ago
- νκ΅μ΄ μλ² λ© (Sentence Embeddings Using Korean Corpora)β468Dec 1, 2021Updated 4 years ago
- Naver sentiment movie corpusβ598Mar 7, 2017Updated 8 years ago
- Korean BARTβ464Jun 14, 2025Updated 8 months ago
- KoBERT on π€ Huggingface Transformers π€ (with Bug Fixed)β212Aug 21, 2024Updated last year
- Split Korean text into sentences using heuristic algorithm.β214Dec 24, 2020Updated 5 years ago
- KoBERTμ CRFλ‘ λ§λ νκ΅μ΄ κ°μ²΄λͺ μΈμκΈ° (BERT+CRF based Named Entity Recognition model for Korean)β504Feb 11, 2024Updated 2 years ago
- Training Transformers of Huggingface with KoNLPyβ68Aug 28, 2020Updated 5 years ago
- KakaoBrain KoGPT (Korean Generative Pre-trained Transformer)β1,014Jan 30, 2024Updated 2 years ago
- Open Korean NLP Dataset Curation for the Users All Around the Globeβ152Nov 18, 2023Updated 2 years ago
- Sentence Embeddings using Siamese ETRI KoBERTβ163Aug 16, 2025Updated 6 months ago
- Simple Chit-Chat based on KoGPT2β182Jun 12, 2023Updated 2 years ago
- Implementing nlp papers relevant to classification with PyTorch, gluonnlpβ230Dec 8, 2022Updated 3 years ago
- Chatbot_data_for_Koreanβ358Mar 30, 2023Updated 2 years ago
- KSS: Korean String processing Suiteβ468Nov 13, 2025Updated 3 months ago
- PORORO: Platform Of neuRal mOdels for natuRal language prOcessingβ1,307Mar 23, 2022Updated 3 years ago
- KoAlpaca: νκ΅μ΄ λͺ λ Ήμ΄λ₯Ό μ΄ν΄νλ μ€νμμ€ μΈμ΄λͺ¨λΈ (KoAlpaca: An open-source language model to understand Korean instructions)β1,576Oct 25, 2024Updated last year
- KcBERT/KcELECTRA Fine Tune Benchmarks code (forked from https://github.com/monologg/KoELECTRA/tree/master/finetune)β47Apr 10, 2022Updated 3 years ago
- π¦ Pretrained BigBird Model for Korean (up to 4096 tokens)β201Dec 28, 2023Updated 2 years ago
- β442Apr 8, 2022Updated 3 years ago
- κ°μΈμ μΌλ‘ μμ§ν νκ΅μ΄ NLPμ© λ§λμΉ λͺ¨μβ139Sep 15, 2020Updated 5 years ago
- Easy Language Model Pretraining leveraging Huggingface's Transformers and Datasetsβ130Nov 12, 2022Updated 3 years ago
- βοΈ κ΅¬λ¦(KULLM): κ³ λ €λνκ΅μμ κ°λ°ν, νκ΅μ΄μ νΉνλ LLMβ589May 1, 2024Updated last year
- κ΅λ΄ μμ°μ΄ μ²λ¦¬ κΈ°μ μ μ°κ΅¬ λ° κ°λ°νλ μ€ννΈμ λͺ©λ‘β165May 10, 2020Updated 5 years ago