Difference-based Contrastive Learning for Korean Sentence Embeddings
☆23Mar 11, 2026Updated last week
Alternatives and similar repositories for KoDiffCSE
Users that are interested in KoDiffCSE are comparing it to the libraries listed below
Sorting:
- ☆36Oct 4, 2023Updated 2 years ago
- Korean Sentence Embedding Repository☆210Dec 1, 2024Updated last year
- Korean Light Weight Language Model☆31May 26, 2023Updated 2 years ago
- 🧀 KoBART summarization using pytorch☆13Jun 7, 2023Updated 2 years ago
- SQuAD Question Generation module based on T5-large☆17Aug 26, 2022Updated 3 years ago
- Troll Detector☆15Nov 28, 2022Updated 3 years ago
- bpe based korean t5 model for text-to-text unified framework☆63Apr 17, 2024Updated last year
- Simple Contrastive Learning of Korean Sentence Embeddings☆53Dec 9, 2022Updated 3 years ago
- [Findings of NAACL2022] A Dog Is Passing Over The Jet? A Text-Generation Dataset for Korean Commonsense Reasoning and Evaluation☆28Dec 9, 2022Updated 3 years ago
- BERT score for text generation☆12Jan 15, 2025Updated last year
- 최신 자연어처리 모델 소개☆74Jul 22, 2022Updated 3 years ago
- [HCLT 2022] Korean sentence text similarity dataset using naver shopping review☆25Oct 20, 2022Updated 3 years ago
- Beyond LM: How can language model go forward in the future?☆15Apr 30, 2023Updated 2 years ago
- #인권코퍼스☆31Oct 6, 2023Updated 2 years ago
- KorQuAD Korean domain Question Generation module based on KoBART☆29Nov 9, 2022Updated 3 years ago
- ☆31Nov 23, 2022Updated 3 years ago
- KOLD: Korean Offensive Language Dataset☆82Nov 13, 2022Updated 3 years ago
- 거꾸로 읽는 self-supervised learning in NLP☆27Oct 30, 2022Updated 3 years ago
- ☆197May 22, 2023Updated 2 years ago
- Code and data for "KoDialogBench: Evaluating Conversational Understanding of Language Models with Korean Dialogue Benchmark" (LREC-COLING…☆17Apr 15, 2025Updated 11 months ago
- Korean Named Entity Corpus☆25May 12, 2023Updated 2 years ago
- 한국어 생성 모델의 상식 추론을 위한 KommonGen 데이터셋입니다.☆21Oct 5, 2021Updated 4 years ago
- ☆11Jul 5, 2020Updated 5 years ago
- A Framework aims to wisely initialize unseen subword embeddings in PLMs for efficient large-scale continued pretraining☆18Nov 26, 2023Updated 2 years ago
- ☆23Oct 30, 2023Updated 2 years ago
- ☆10Jun 9, 2021Updated 4 years ago
- Sentence Embeddings using Siamese ETRI KoBERT☆163Aug 16, 2025Updated 7 months ago
- ☆11Mar 10, 2023Updated 3 years ago
- Korean-English Bilingual Electra Models☆110Nov 22, 2021Updated 4 years ago
- Finetuning Pipeline☆89Feb 25, 2022Updated 4 years ago
- Benchmark in Korean Context☆138Sep 26, 2023Updated 2 years ago
- Source codes and dataset of Call for Customized Conversation: Customized Conversation Grounding Persona and Knowledge☆62Aug 4, 2023Updated 2 years ago
- Korean Commonsense Knowledge Graph☆15Dec 23, 2022Updated 3 years ago
- Korean Nested Named Entity Corpus☆20May 13, 2023Updated 2 years ago
- CLIcK: A Benchmark Dataset of Cultural and Linguistic Intelligence in Korean☆48Dec 23, 2024Updated last year
- ☆10Jul 28, 2022Updated 3 years ago
- An official codebase for "NormLens: Reading Books is Great, But Not if You Are Driving! Visually Grounded Reasoning about Defeasible Comm…☆10May 9, 2024Updated last year
- Convenient Text-to-Text Training for Transformers☆19Dec 10, 2021Updated 4 years ago
- ☆106May 8, 2023Updated 2 years ago