KSS: Korean String processing Suite
☆470Nov 13, 2025Updated 4 months ago
Alternatives and similar repositories for kss
Users that are interested in kss are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Yet another python binding for mecab-ko☆88May 16, 2023Updated 2 years ago
- Pecab: Pure python Korean morpheme analyzer based on Mecab☆172Apr 27, 2024Updated last year
- Split Korean text into sentences using heuristic algorithm.☆215Dec 24, 2020Updated 5 years ago
- Korean corpus repository☆747Oct 3, 2022Updated 3 years ago
- Pretrained Language Models for Korean☆398Jan 1, 2023Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- 🦅 Pretrained BigBird Model for Korean (up to 4096 tokens)☆202Dec 28, 2023Updated 2 years ago
- KorNLI and KorSTS: New Benchmark Datasets for Korean Natural Language Understanding☆310Jul 9, 2023Updated 2 years ago
- ☆197May 22, 2023Updated 2 years ago
- Pretrained ELECTRA Model for Korean☆631Feb 19, 2024Updated 2 years ago
- 📖 Korean NLU Benchmark☆589Jul 6, 2022Updated 3 years ago
- Kobart model on Huggingface transformers☆64Feb 15, 2022Updated 4 years ago
- Finetuning Pipeline☆89Feb 25, 2022Updated 4 years ago
- Automatic Korean word spacing with Python☆426Jul 4, 2024Updated last year
- PORORO: Platform Of neuRal mOdels for natuRal language prOcessing☆1,306Mar 23, 2022Updated 4 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- 한국어 데이터 세트 링크☆910Oct 14, 2024Updated last year
- Python API for Kiwi☆367Mar 18, 2026Updated last week
- Korean GPT-2 pretrained cased (KoGPT2)☆558Oct 3, 2024Updated last year
- Training Transformers of Huggingface with KoNLPy☆68Aug 28, 2020Updated 5 years ago
- 🤗 Pretrained BERT model & WordPiece tokenizer trained on Korean Comments 한국어 댓글로 프리트레이닝한 BERT 모델과 데이터셋☆493Nov 7, 2022Updated 3 years ago
- Korean-English Bilingual Electra Models☆110Nov 22, 2021Updated 4 years ago
- Korean Named Entity Corpus☆25May 12, 2023Updated 2 years ago
- The code and models for "An Empirical Study of Tokenization Strategies for Various Korean NLP Tasks" (AACL-IJCNLP 2020)☆119Oct 8, 2020Updated 5 years ago
- #Paired Question☆24Jun 16, 2020Updated 5 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- KakaoBrain KoGPT (Korean Generative Pre-trained Transformer)☆1,013Jan 30, 2024Updated 2 years ago
- 한국어 자연어처리를 위한 파이썬 라이브러리입니다. 단어 추출/ 토크나이저 / 품사판별/ 전처리의 기능을 제공합니다.☆985Mar 10, 2026Updated 2 weeks ago
- 🤗 Korean Comments ELECTRA: 한국어 댓글로 학습한 ELECTRA 모델☆261Nov 7, 2022Updated 3 years ago
- Curation note of NLP datasets☆98Dec 6, 2022Updated 3 years ago
- Polyglot: Large Language Models of Well-balanced Competence in Multi-languages☆484Aug 22, 2023Updated 2 years ago
- Korean HateSpeech Dataset☆395Jul 18, 2020Updated 5 years ago
- A utility for storing and reading files for Korean LM training 💾☆35Oct 15, 2025Updated 5 months ago
- BERTScore for Korean☆80Feb 22, 2024Updated 2 years ago
- 언어모델을 학습하기 위한 공개 한국어 instruction dataset들을 모아두었습니다.☆455Apr 13, 2025Updated 11 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- KLUE 데이터를 활용한 HuggingFace Transformers 튜토리얼☆129Jun 28, 2021Updated 4 years ago
- ☁️ 구름(KULLM): 고려대학교에서 개발한, 한국어에 특화된 LLM☆588May 1, 2024Updated last year
- 한국어 문장 띄어쓰기(삭제/추가) 모델입니다. 데이터 준비 후 직접 학습이 가능하도록 작성하였습니다.☆57Jul 11, 2022Updated 3 years ago
- Korean BART☆465Jun 14, 2025Updated 9 months ago
- Easy Language Model Pretraining leveraging Huggingface's Transformers and Datasets☆130Nov 12, 2022Updated 3 years ago
- Beyond LM: How can language model go forward in the future?☆15Apr 30, 2023Updated 2 years ago
- This repo is for Korean wiki table question answering datasets described in the paper of Korean-Specific Dataset for Table Question Answe…☆91Oct 22, 2024Updated last year