Korean corpus repository
☆747Oct 3, 2022Updated 3 years ago
Alternatives and similar repositories for Korpora
Users that are interested in Korpora are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 한국어 데이터 세트 링크☆910Oct 14, 2024Updated last year
- 🤗 Pretrained BERT model & WordPiece tokenizer trained on Korean Comments 한국어 댓글로 프리트레이닝한 BERT 모델과 데이터셋☆492Nov 7, 2022Updated 3 years ago
- KorNLI and KorSTS: New Benchmark Datasets for Korean Natural Language Understanding☆310Jul 9, 2023Updated 2 years ago
- Korean GPT-2 pretrained cased (KoGPT2)☆558Oct 3, 2024Updated last year
- Pretrained ELECTRA Model for Korean☆633Feb 19, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Open Korean NLP Dataset Curation for the Users All Around the Globe☆154Nov 18, 2023Updated 2 years ago
- 📖 Korean NLU Benchmark☆594Jul 6, 2022Updated 3 years ago
- KB국민은행에서 제공하는 경제/금융 도메인에 특화된 한국어 ALBERT 모델☆240Oct 7, 2021Updated 4 years ago
- Korean HateSpeech Dataset☆396Jul 18, 2020Updated 5 years ago
- Korean BART☆465Jun 14, 2025Updated 10 months ago
- KSS: Korean String processing Suite☆470Nov 13, 2025Updated 5 months ago
- KakaoBrain KoGPT (Korean Generative Pre-trained Transformer)☆1,015Jan 30, 2024Updated 2 years ago
- Pretrained Language Models for Korean☆394Jan 1, 2023Updated 3 years ago
- 한국어 자연어처리를 위한 파이썬 라이브러리입니다. 단어 추출/ 토크나이저 / 품사판별/ 전처리의 기능을 제공합니다.☆984Mar 10, 2026Updated last month
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Korean wellness chatbot models: KoGPT2 + KoBERT/KoELECTRA (PyTorch, Transformers).☆209Jan 12, 2026Updated 3 months ago
- Korean BERT pre-trained cased (KoBERT)☆1,408Jun 14, 2025Updated 10 months ago
- 🦅 Pretrained BigBird Model for Korean (up to 4096 tokens)☆202Dec 28, 2023Updated 2 years ago
- KoRean based BERT pre-trained models (KR-BERT) for Tensorflow and PyTorch☆211Apr 24, 2024Updated last year
- Chatbot_data_for_Korean☆359Mar 30, 2023Updated 3 years ago
- 🤗 Korean Comments ELECTRA: 한국어 댓글로 학습한 ELECTRA 모델☆261Nov 7, 2022Updated 3 years ago
- Split Korean text into sentences using heuristic algorithm.☆216Dec 24, 2020Updated 5 years ago
- PORORO: Platform Of neuRal mOdels for natuRal language prOcessing☆1,305Mar 23, 2022Updated 4 years ago
- Distillation of KoBERT from SKTBrain (Lightweight KoBERT)☆199Sep 6, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- 한국어 임베딩 (Sentence Embeddings Using Korean Corpora)☆468Dec 1, 2021Updated 4 years ago
- Naver sentiment movie corpus☆601Mar 7, 2017Updated 9 years ago
- 언어모델을 학습하기 위한 공개 한국어 instruction dataset들을 모아두었습니다.☆456Apr 13, 2025Updated last year
- KoBERT on 🤗 Huggingface Transformers 🤗 (with Bug Fixed)☆211Aug 21, 2024Updated last year
- ☆442Apr 8, 2022Updated 4 years ago
- The code and models for "An Empirical Study of Tokenization Strategies for Various Korean NLP Tasks" (AACL-IJCNLP 2020)☆119Oct 8, 2020Updated 5 years ago
- NLP Shared tasks (NER, SRL) using NSML☆184Jan 3, 2019Updated 7 years ago
- Simple Chit-Chat based on KoGPT2☆183Jun 12, 2023Updated 2 years ago
- Kiwi(지능형 한국어 형태소 분석기)☆706Apr 4, 2026Updated last week
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- KoBERT와 CRF로 만든 한국어 개체명인식기 (BERT+CRF based Named Entity Recognition model for Korean)☆505Feb 11, 2024Updated 2 years ago
- Training Transformers of Huggingface with KoNLPy☆68Aug 28, 2020Updated 5 years ago
- 🔥 Korean GPT-2, KoGPT2 FineTuning cased. 한국어 가사 데이터 학습 🔥☆225Apr 29, 2025Updated 11 months ago
- Automatic Korean word spacing with Python☆425Jul 4, 2024Updated last year
- ☁️ 구름(KULLM): 고려대학교에서 개발한, 한국어에 특화된 LLM☆587May 1, 2024Updated last year
- Finetuning Pipeline☆89Feb 25, 2022Updated 4 years ago
- Kobart model on Huggingface transformers☆64Feb 15, 2022Updated 4 years ago