QuoQA-NLP/Ko-conceptual-captions

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/QuoQA-NLP/Ko-conceptual-captions)

QuoQA-NLP / Ko-conceptual-captions

Google's Conceptual Captions Dataset translated into Korean

☆23

Alternatives and similar repositories for Ko-conceptual-captions

Users that are interested in Ko-conceptual-captions are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

passing2961 / EmoNSMC
View on GitHub
Korean large emotion labeled dataset (EmoNSMC)
☆14Mar 5, 2020Updated 6 years ago
korean-named-entity / konec
View on GitHub
Korean Named Entity Corpus
☆25May 12, 2023Updated 3 years ago
korean-named-entity / konne-prep
View on GitHub
☆19Jan 29, 2023Updated 3 years ago
tunib-ai / artwork_captions
View on GitHub
Machine Generated Captions for Best Artworks
☆22Sep 21, 2022Updated 3 years ago
ko-nlp / moducorpus-sanitizer
View on GitHub
모두의 말뭉치 데이터를 분석에 편리한 형태로 변환하는 기능을 제공합니다.
☆11Mar 2, 2022Updated 4 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
openkorpos / model-mecab
View on GitHub
MeCab model trained with OpenKorPos.
☆23Jun 19, 2022Updated 4 years ago
smilegate-ai / OPELA
View on GitHub
☆29Nov 23, 2022Updated 3 years ago
baikalai / baikal-bert
View on GitHub
baikal.ai's pre-trained BERT models: descriptions and sample codes
☆12Jun 24, 2021Updated 5 years ago
MrBananaHuman / PangyoCorpora
View on GitHub
☆38Oct 4, 2023Updated 2 years ago
nlpai-lab / Korean-CommonGen
View on GitHub
[Findings of NAACL2022] A Dog Is Passing Over The Jet? A Text-Generation Dataset for Korean Commonsense Reasoning and Evaluation
☆11May 27, 2022Updated 4 years ago
jeongukjae / namuwiki-corpus
View on GitHub
문장단위로 분절된 나무위키 데이터셋. Releases에서 다운로드 받거나, tfds-korean을 통해 다운로드 받으세요.
☆19Jun 16, 2021Updated 5 years ago
bjpublic / TMI-Deeplearning
View on GitHub
친절한 실전 딥러닝 수업
☆12Sep 22, 2020Updated 5 years ago
YongWookHa / kor-text-preprocess
View on GitHub
Korean text data preprocess toolkit for NLP
☆18Jun 11, 2019Updated 7 years ago
songys / 2021Langcon
View on GitHub
☆11Oct 3, 2021Updated 4 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
EleutherAI / hae-rae
View on GitHub
☆33Aug 30, 2023Updated 2 years ago
korean-named-entity / konne
View on GitHub
Korean Nested Named Entity Corpus
☆20May 13, 2023Updated 3 years ago
soyoung97 / Standard_Korean_GEC
View on GitHub
☆62Jan 2, 2024Updated 2 years ago
passing2961 / KMRE
View on GitHub
Korean Moview Review Emotion (KMRE) Dataset
☆21Sep 7, 2020Updated 5 years ago
jooinjang / Ko-ATOMIC
View on GitHub
Korean Commonsense Knowledge Graph
☆15Dec 23, 2022Updated 3 years ago
JoungheeKim / kor-spacing
View on GitHub
This is project for korean auto spacing
☆12Aug 3, 2020Updated 5 years ago
lovit / petitions_archive
View on GitHub
청와대 국민청원 데이터 아카이브
☆16Aug 29, 2020Updated 5 years ago
AIRC-KETI / Korean-Copora
View on GitHub
☆14Dec 9, 2021Updated 4 years ago
SKplanet / Dialog-KoELECTRA
View on GitHub
ELECTRA기반 한국어 대화체 언어모델
☆54Aug 4, 2021Updated 4 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
upskyy / Automatic-Speech-Recognition-Models
View on GitHub
End-to-End Korean Automatic Speech Recognition leveraging PyTorch and Hydra.
☆10Jan 21, 2022Updated 4 years ago
HeegyuKim / korouge
View on GitHub
Google 공식 Rouge Implementation을 한국어에서 사용할 수 있도록 처리
☆17Jan 3, 2024Updated 2 years ago
ByungjunKim / DDMKL
View on GitHub
한국 현대문학 박사학위 논문 서지 데이터 분석
☆24Aug 17, 2024Updated last year
smothly / bad-word-detection
View on GitHub
비속어 탐지 모델
☆16Dec 19, 2019Updated 6 years ago
kipi-ai / korpatbert
View on GitHub
특허분야 특화된 한국어 AI언어모델 KorPatBERT
☆70Jan 31, 2024Updated 2 years ago
joeljang / FLM
View on GitHub
All-in-one repository for Fine-tuning & Pretraining (Large) Language Models
☆15Mar 8, 2023Updated 3 years ago
sb-jang / kodialogbench
View on GitHub
Code and data for "KoDialogBench: Evaluating Conversational Understanding of Language Models with Korean Dialogue Benchmark" (LREC-COLING…
☆18Apr 15, 2025Updated last year
BitnaKeum / Web_Crawler
View on GitHub
나무위키, 위키피디아, 다음블로그, 티스토리, 유튜브, 네이트판 크롤러
☆13Feb 20, 2026Updated 5 months ago
monologg / ko_lm_dataformat
View on GitHub
A utility for storing and reading files for Korean LM training 💾
☆35Jul 18, 2026Updated last week
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
nlpai-lab / KommonGen
View on GitHub
한국어 생성 모델의 상식 추론을 위한 KommonGen 데이터셋입니다.
☆21Oct 5, 2021Updated 4 years ago
nayohan / SimKoR
View on GitHub
[HCLT 2022] Korean sentence text similarity dataset using naver shopping review
☆25Oct 20, 2022Updated 3 years ago
boychaboy / KOLD
View on GitHub
KOLD: Korean Offensive Language Dataset
☆83Nov 13, 2022Updated 3 years ago
Data-Intelligence-Lab / DEFT-korean-alpaca
View on GitHub
☆23Oct 30, 2023Updated 2 years ago
jason9693 / FROZEN
View on GitHub
☆14May 3, 2022Updated 4 years ago
HeegyuKim / language-model
View on GitHub
한국어 언어 모델 학습을 위한 프로젝트(Flax, Pytorch with Huggingface Accelerate)
☆32Sep 13, 2023Updated 2 years ago
warnikchow / kosp2e
View on GitHub
Korean Speech to English Translation Corpus
☆45Sep 3, 2021Updated 4 years ago