jeongukjae/namuwiki-corpus

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/jeongukjae/namuwiki-corpus)

jeongukjae / namuwiki-corpus

문장단위로 분절된 나무위키 데이터셋. Releases에서 다운로드 받거나, tfds-korean을 통해 다운로드 받으세요.

☆19

Alternatives and similar repositories for namuwiki-corpus

Users that are interested in namuwiki-corpus are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

songys / single_turn_dialogue
View on GitHub
사전에서 대화 예문만 추출한 데이터
☆16Apr 24, 2023Updated 3 years ago
taeminlee / KoGPT2-Transformers
View on GitHub
KoGPT2 on Huggingface Transformers
☆33May 4, 2021Updated 5 years ago
passing2961 / EmoNSMC
View on GitHub
Korean large emotion labeled dataset (EmoNSMC)
☆14Mar 5, 2020Updated 6 years ago
lovit / huggingface_konlpy
View on GitHub
Training Transformers of Huggingface with KoNLPy
☆68Aug 28, 2020Updated 5 years ago
smothly / bad-word-detection
View on GitHub
비속어 탐지 모델
☆16Dec 19, 2019Updated 6 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
Data-Intelligence-Lab / DEFT-korean-alpaca
View on GitHub
☆23Oct 30, 2023Updated 2 years ago
Beomi / exbert-transformers
View on GitHub
exBERT on Transformers🤗
☆10Jun 14, 2021Updated 5 years ago
EleutherAI / hae-rae
View on GitHub
☆33Aug 30, 2023Updated 2 years ago
korean-named-entity / konne
View on GitHub
Korean Nested Named Entity Corpus
☆20May 13, 2023Updated 3 years ago
nlpai-lab / Korean-CommonGen
View on GitHub
[Findings of NAACL2022] A Dog Is Passing Over The Jet? A Text-Generation Dataset for Korean Commonsense Reasoning and Evaluation
☆11May 27, 2022Updated 4 years ago
warnikchow / kosp2e
View on GitHub
Korean Speech to English Translation Corpus
☆45Sep 3, 2021Updated 4 years ago
MrBananaHuman / KoreanCharacterBert
View on GitHub
Korean BERT model using character tokenizer
☆27Apr 8, 2021Updated 5 years ago
ko-nlp / moducorpus-sanitizer
View on GitHub
모두의 말뭉치 데이터를 분석에 편리한 형태로 변환하는 기능을 제공합니다.
☆11Mar 2, 2022Updated 4 years ago
ModuNLP / hacking_transformers
View on GitHub
☆11Aug 12, 2020Updated 5 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
AIRC-KETI / Korean-Copora
View on GitHub
☆14Dec 9, 2021Updated 4 years ago
korean-named-entity / konec
View on GitHub
Korean Named Entity Corpus
☆25May 12, 2023Updated 3 years ago
snunlp / KR-ELECTRA
View on GitHub
KoRean based ELECTRA pre-trained models (KR-ELECTRA) for Tensorflow and PyTorch
☆15Feb 13, 2022Updated 4 years ago
choe-hyonsu-gabrielle / korean-amr-corpus
View on GitHub
Korean Abstract Meaning Representation (AMR) Corpus
☆10Feb 27, 2022Updated 4 years ago
songys / Question_pair
View on GitHub
#Paired Question
☆24Jun 16, 2020Updated 6 years ago
QuoQA-NLP / Ko-conceptual-captions
View on GitHub
Google's Conceptual Captions Dataset translated into Korean
☆23Aug 28, 2022Updated 3 years ago
Gyeongmin47 / KoCHET-A-Korean-Cultural-Heritage-corpus-for-Entity-related-Tasks
View on GitHub
☆13Nov 30, 2022Updated 3 years ago
SKplanet / Dialog-KoELECTRA
View on GitHub
ELECTRA기반 한국어 대화체 언어모델
☆54Aug 4, 2021Updated 4 years ago
jeongukjae / korean-wikipedia-corpus
View on GitHub
문장단위로 분절된 한국어 위키피디아 코퍼스. Releases에서 다운로드 받거나 tfds-korean으로 사용해주세요.
☆24Sep 6, 2023Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
baikalai / baikal-bert
View on GitHub
baikal.ai's pre-trained BERT models: descriptions and sample codes
☆12Jun 24, 2021Updated 5 years ago
lovit / KoBERTScore
View on GitHub
BERTScore for Korean
☆81Feb 22, 2024Updated 2 years ago
minhoryang / KoNLPy-gRPC
View on GitHub
Redesigned KoNLPy (Wrapper) for Usability and Portability with gRPC. [EXPERIMENTAL]
☆13Mar 7, 2023Updated 3 years ago
lih0905 / WSD_kor
View on GitHub
한국어 어휘 의미 분석 모델
☆25Apr 4, 2022Updated 4 years ago
passing2961 / KMRE
View on GitHub
Korean Moview Review Emotion (KMRE) Dataset
☆21Sep 7, 2020Updated 5 years ago
jinmang2 / AdvancedTransformers
View on GitHub
⛩ All about Korean Transformers (information and tutorial)
☆17Jun 21, 2022Updated 4 years ago
J-Seo / Korean-CommonGen
View on GitHub
[Findings of NAACL2022] A Dog Is Passing Over The Jet? A Text-Generation Dataset for Korean Commonsense Reasoning and Evaluation
☆28Dec 9, 2022Updated 3 years ago
YongWookHa / kor-text-preprocess
View on GitHub
Korean text data preprocess toolkit for NLP
☆18Jun 11, 2019Updated 7 years ago
Huffon / nlp-various-tutorials
View on GitHub
자연어 처리와 관련한 여러 튜토리얼 저장소
☆80Jun 1, 2020Updated 6 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
smilegate-ai / HuLiC
View on GitHub
☆93Mar 3, 2022Updated 4 years ago
jungyeul / korean-parallel-corpora
View on GitHub
Korean Parallel Corpus
☆147Feb 24, 2024Updated 2 years ago
kakaobrain / kortok
View on GitHub
The code and models for "An Empirical Study of Tokenization Strategies for Various Korean NLP Tasks" (AACL-IJCNLP 2020)
☆119Oct 8, 2020Updated 5 years ago
sebkim / lda2vec-pytorch
View on GitHub
lda2vec pytorch implementation
☆11Oct 18, 2019Updated 6 years ago
monologg / KoCharELECTRA
View on GitHub
Character-level Korean ELECTRA Model (음절 단위 한국어 ELECTRA)
☆55Jun 12, 2023Updated 3 years ago
KPFBERT / kpfbertsum
View on GitHub
☆15Nov 28, 2021Updated 4 years ago
reniew / NSMC_Sentimental-Analysis
View on GitHub
네이버 영화 리뷰데이터를 활용한 한글 텍스트 감정 분석
☆12Aug 22, 2018Updated 7 years ago