An integrated library for Korean language preprocessing.
☆204Apr 23, 2023Updated 3 years ago
Alternatives and similar repositories for hangul-utils
Users that are interested in hangul-utils are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 한글 자모 분리/조합 작업을 위한 툴킷☆297Nov 1, 2024Updated last year
- 한국어 악성댓글 데이터셋☆73Sep 26, 2020Updated 5 years ago
- The code and models for "An Empirical Study of Tokenization Strategies for Various Korean NLP Tasks" (AACL-IJCNLP 2020)☆119Oct 8, 2020Updated 5 years ago
- Korean Parallel Corpus☆147Feb 24, 2024Updated 2 years ago
- Easy Language Model Pretraining leveraging Huggingface's Transformers and Datasets☆130Nov 12, 2022Updated 3 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- https://ailabs.enliple.com/☆105Feb 25, 2021Updated 5 years ago
- Korean HateSpeech Dataset☆397Jul 18, 2020Updated 5 years ago
- Subword-level Word Vector Representations for Korean (ACL 2018)☆106Oct 17, 2019Updated 6 years ago
- KorNLI and KorSTS: New Benchmark Datasets for Korean Natural Language Understanding☆312Jul 9, 2023Updated 2 years ago
- Korean corpus repository☆748Oct 3, 2022Updated 3 years ago
- Convert Numerical Representations to Korean Pronunciation☆14Apr 20, 2020Updated 6 years ago
- Automatic Korean Hanja tagging tool powered by Hanjaro (hanjaro.juntong.or.kr)☆19Feb 22, 2019Updated 7 years ago
- 📖 Korean NLU Benchmark☆595Jul 6, 2022Updated 3 years ago
- A curated list of resources for NLP (Natural Language Processing) for Korean☆662Sep 18, 2020Updated 5 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- 개인적으로 수집한 한국어 NLP용 말뭉치 모음☆140Sep 15, 2020Updated 5 years ago
- 🤗 Pretrained BERT model & WordPiece tokenizer trained on Korean Comments 한국어 댓글로 프리트레이닝한 BERT 모델과 데이터셋☆491Nov 7, 2022Updated 3 years ago
- ☆198May 22, 2023Updated 3 years ago
- ☆11Oct 3, 2021Updated 4 years ago
- 한국어 자연어처리를 위한 파이썬 라이브러리입니다. 단어 추출/ 토크나이저 / 품사판별/ 전처리의 기능을 제공합니다.☆984Mar 10, 2026Updated 3 months ago
- KoRean based ELECTRA pre-trained models (KR-ELECTRA) for Tensorflow and PyTorch☆15Feb 13, 2022Updated 4 years ago
- BERTScore for Korean☆80Feb 22, 2024Updated 2 years ago
- Split Korean text into sentences using heuristic algorithm.☆216Dec 24, 2020Updated 5 years ago
- KSS: Korean String processing Suite☆470Nov 13, 2025Updated 6 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Open Korean NLP Dataset Curation for the Users All Around the Globe☆155Nov 18, 2023Updated 2 years ago
- Implementing nlp papers relevant to classification with PyTorch, gluonnlp☆229Dec 8, 2022Updated 3 years ago
- Simple extension of WikiExtractor(https://github.com/attardi/wikiextractor)☆16Dec 23, 2016Updated 9 years ago
- ☆25Oct 28, 2020Updated 5 years ago
- Korean sejong corpus download and simple analysis☆149May 9, 2019Updated 7 years ago
- Korean BART☆467Jun 14, 2025Updated 11 months ago
- A utility for storing and reading files for Korean LM training 💾☆35Oct 15, 2025Updated 7 months ago
- Kobart model on Huggingface transformers☆64Feb 15, 2022Updated 4 years ago
- I hope to this list will contribute good influence in Korean online services.☆64Feb 10, 2019Updated 7 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Intonation-aided intention identification for Korean☆83Nov 21, 2022Updated 3 years ago
- 2019 국어경진대회 한국어 의존구문 분석 대상(문체부 장관상)☆15Oct 26, 2022Updated 3 years ago
- 🦅 Pretrained BigBird Model for Korean (up to 4096 tokens)☆202Dec 28, 2023Updated 2 years ago
- Data from KAIST (a Korean treebank).☆19May 6, 2026Updated last month
- KoRean based BERT pre-trained models (KR-BERT) for Tensorflow and PyTorch☆211Apr 24, 2024Updated 2 years ago
- Pretrained ELECTRA Model for Korean☆635Feb 19, 2024Updated 2 years ago
- Finetuning Pipeline☆89Feb 25, 2022Updated 4 years ago