ko-nlp / moducorpus-sanitizerView external linksLinks
모두의 말뭉치 데이터를 분석에 편리한 형태로 변환하는 기능을 제공합니다.
☆11Mar 2, 2022Updated 3 years ago
Alternatives and similar repositories for moducorpus-sanitizer
Users that are interested in moducorpus-sanitizer are comparing it to the libraries listed below
Sorting:
- Korean large emotion labeled dataset (EmoNSMC)☆14Mar 5, 2020Updated 5 years ago
- 🦛 파이썬 한글 처리 라이브러리. Python Korean Morphological Analyzer☆19Feb 4, 2025Updated last year
- ☆11Oct 3, 2021Updated 4 years ago
- baikal.ai's pre-trained BERT models: descriptions and sample codes☆12Jun 24, 2021Updated 4 years ago
- MeCab model trained with OpenKorPos.☆23Jun 19, 2022Updated 3 years ago
- 나무위키, 위키피디아, 다음블로그, 티스토리, 유튜브, 네이트판 크롤러☆12Jul 21, 2021Updated 4 years ago
- Convert Numerical Representations to Korean Pronunciation☆14Apr 20, 2020Updated 5 years ago
- Korean Named Entity Corpus☆25May 12, 2023Updated 2 years ago
- Beyond LM: How can language model go forward in the future?☆15Apr 30, 2023Updated 2 years ago
- A utility for storing and reading files for Korean LM training 💾☆35Oct 15, 2025Updated 4 months ago
- ☆21May 24, 2023Updated 2 years ago
- Korean Nested Named Entity Corpus☆20May 13, 2023Updated 2 years ago
- This is project for korean auto spacing☆12Aug 3, 2020Updated 5 years ago
- 비속어 탐지 모델☆16Dec 19, 2019Updated 6 years ago
- 한국어 어휘 의미 분석 모델☆21Apr 4, 2022Updated 3 years ago
- ☆19Jan 29, 2023Updated 3 years ago
- ☆19Sep 20, 2022Updated 3 years ago
- Machine Generated Captions for Best Artworks☆22Sep 21, 2022Updated 3 years ago
- Korean Moview Review Emotion (KMRE) Dataset☆21Sep 7, 2020Updated 5 years ago
- Google's Conceptual Captions Dataset translated into Korean☆23Aug 28, 2022Updated 3 years ago
- huggingface를 이용하여 downstream task 수행하기☆62Dec 28, 2021Updated 4 years ago
- Korean Parallel Corpus☆11Nov 27, 2014Updated 11 years ago
- [Findings of NAACL2022] A Dog Is Passing Over The Jet? A Text-Generation Dataset for Korean Commonsense Reasoning and Evaluation☆11May 27, 2022Updated 3 years ago
- Wikitext format dataset of Namuwiki (Most famous Korean wikipedia)☆53Oct 25, 2020Updated 5 years ago
- 🦕 A library that handles everything with 🤗 and supports batching to models in PORORO☆37Jun 16, 2022Updated 3 years ago
- KSenticNet: 한국어 감성 사전☆33May 20, 2019Updated 6 years ago
- kogpt를 oslo로 파인튜닝하는 예제.☆23Aug 26, 2022Updated 3 years ago
- Jasper 기반 양자화된 모델인 Quartznet 한국어 음성인식☆22Jul 21, 2021Updated 4 years ago
- Similar string search in Levenshtein distance☆21Jun 19, 2021Updated 4 years ago
- 다양한 feature와 deep learning을 이용한 Phoneme Recognition입니다.☆14Nov 27, 2019Updated 6 years ago
- The official python client library for deeq NLP which is new Korean NLP with DL.☆21Aug 2, 2022Updated 3 years ago
- 이동호, 이정훈, 김유리, 김형준, 박승면, 양유준, 신웅비 (Dong Ho Lee, Jung Hoon Lee, Yu Ri Kim, Hyung Jun Kim, Seung Myun Park, Yu Jun Yang, Woong Bi Shin)☆15Apr 16, 2020Updated 5 years ago
- "Why do I feel offended?" - Korean Dataset for Offensive Language Identification (EACL2023 Findings)☆15May 14, 2023Updated 2 years ago
- KoRean based ELECTRA pre-trained models (KR-ELECTRA) for Tensorflow and PyTorch☆15Feb 13, 2022Updated 4 years ago
- KOLD: Korean Offensive Language Dataset☆81Nov 13, 2022Updated 3 years ago
- Korean Math Word Problems☆59Jan 14, 2022Updated 4 years ago
- Yet another python binding for mecab-ko☆88May 16, 2023Updated 2 years ago
- Abstractive summarization using Bert2Bert framework.☆31Dec 5, 2020Updated 5 years ago
- Korean BERT model using character tokenizer☆27Apr 8, 2021Updated 4 years ago