MrBananaHuman / PangyoCorporaView external linksLinks
☆36Oct 4, 2023Updated 2 years ago
Alternatives and similar repositories for PangyoCorpora
Users that are interested in PangyoCorpora are comparing it to the libraries listed below
Sorting:
- 한국어 생성 모델의 상식 추론을 위한 KommonGen 데이터셋입니다.☆21Oct 5, 2021Updated 4 years ago
- ☆19Sep 20, 2022Updated 3 years ago
- MeCab model trained with OpenKorPos.☆23Jun 19, 2022Updated 3 years ago
- 언어모델을 학습하기 위한 공개 한국어 instruction dataset들을 모아두었습니다.☆19Jul 16, 2023Updated 2 years ago
- #인권코퍼스☆31Oct 6, 2023Updated 2 years ago
- Korean Nested Named Entity Corpus☆20May 13, 2023Updated 2 years ago
- Google's Conceptual Captions Dataset translated into Korean☆23Aug 28, 2022Updated 3 years ago
- Difference-based Contrastive Learning for Korean Sentence Embeddings☆23Updated this week
- Beyond LM: How can language model go forward in the future?☆15Apr 30, 2023Updated 2 years ago
- Korean Named Entity Corpus☆25May 12, 2023Updated 2 years ago
- 한국어 생성 모델의 상식 추론을 위한 KommonGen 데이터셋입니다.☆17Oct 5, 2021Updated 4 years ago
- ☆23Oct 30, 2023Updated 2 years ago
- ☆21May 24, 2023Updated 2 years ago
- 모두의 말뭉치 데이터를 분석에 편리한 형태로 변환하는 기능을 제공합니다.☆11Mar 2, 2022Updated 3 years ago
- ☆10Oct 28, 2024Updated last year
- Korean text data preprocess toolkit for NLP☆18Jun 11, 2019Updated 6 years ago
- baikal.ai's pre-trained BERT models: descriptions and sample codes☆12Jun 24, 2021Updated 4 years ago
- Kor-IR: Korean Information Retrieval Benchmark☆87Jul 3, 2024Updated last year
- 🤗 최소한의 세팅으로 LM을 학습하기 위한 샘플코드☆59May 23, 2023Updated 2 years ago
- ☆19Oct 24, 2023Updated 2 years ago
- StrategyQA 데이터 세트 번역☆23Apr 12, 2024Updated last year
- Machine Generated Captions for Best Artworks☆22Sep 21, 2022Updated 3 years ago
- ☆33Aug 30, 2023Updated 2 years ago
- 언어모델을 학습하기 위한 공개 한국어 instruction dataset들을 모아두었습니다.☆452Apr 13, 2025Updated 10 months ago
- ☆107May 8, 2023Updated 2 years ago
- Benchmark in Korean Context☆136Sep 26, 2023Updated 2 years ago
- KURE: 고려대학교에서 개발한, 한국어 검색에 특화된 임베딩 모델☆206Sep 10, 2025Updated 5 months ago
- LocEmb: Location Embedding (Currently covering districts, roads, and businesses in Korea)☆11Aug 15, 2022Updated 3 years ago
- Dataset of Korean Threatening Conversations☆72Nov 1, 2022Updated 3 years ago
- Pecab: Pure python Korean morpheme analyzer based on Mecab☆172Apr 27, 2024Updated last year
- Hate speech detection corpus in Korean, shared with EMNLP 2023 paper☆17Apr 19, 2024Updated last year
- Korean Commonsense Knowledge Graph☆15Dec 23, 2022Updated 3 years ago
- BERT score for text generation☆12Jan 15, 2025Updated last year
- ☆12Nov 30, 2022Updated 3 years ago
- AI model designed to test the effectiveness in handling external ethical attacks.☆11Updated this week
- ☆10Jun 5, 2025Updated 8 months ago
- This is project for korean auto spacing☆12Aug 3, 2020Updated 5 years ago
- ☆30Nov 23, 2022Updated 3 years ago
- ☆19Jan 29, 2023Updated 3 years ago