metterian / korean_bert_scoreView external linksLinks
BERT score for text generation
☆12Jan 15, 2025Updated last year
Alternatives and similar repositories for korean_bert_score
Users that are interested in korean_bert_score are comparing it to the libraries listed below
Sorting:
- KoCommonGEN v2: A Benchmark for Navigating Korean Commonsense Reasoning Challenges in Large Language Models☆25Aug 24, 2024Updated last year
- Beyond LM: How can language model go forward in the future?☆15Apr 30, 2023Updated 2 years ago
- Difference-based Contrastive Learning for Korean Sentence Embeddings☆23Updated this week
- ☆23Aug 30, 2024Updated last year
- This is a hands-on for ML beginners to perform SimCSE step-by-step. Implemented both supervised SimCSE and unsupervisied SimCSE, and dist…☆22Oct 6, 2023Updated 2 years ago
- 거꾸로 읽는 self-supervised learning in NLP☆27Oct 30, 2022Updated 3 years ago
- BERTScore for Korean☆80Feb 22, 2024Updated last year
- An unofficial implementation of SOLAR-10.7B model and the newly proposed interlocked-DUS(iDUS) implementation and experiment details.☆14Mar 20, 2024Updated last year
- 자체 구축한 한국어 평가 데이터셋을 이용한 한국어 모델 평가☆31May 31, 2024Updated last year
- ☆36Oct 4, 2023Updated 2 years ago
- ☆19Apr 22, 2022Updated 3 years ago
- This library supports evaluating disparities in generated image quality, diversity, and consistency between geographic regions.☆20Jun 3, 2024Updated last year
- AutoRAG example about benchmarking Korean embeddings.☆43Oct 2, 2024Updated last year
- Evaluate gpt-4o on CLIcK (Korean NLP Dataset)☆20May 18, 2024Updated last year
- KURE: 고려대학교에서 개발한, 한국어 검색에 특화된 임베딩 모델☆206Sep 10, 2025Updated 5 months ago
- ☆20Jul 24, 2024Updated last year
- Forked repo from https://github.com/EleutherAI/lm-evaluation-harness/commit/1f66adc☆82Feb 28, 2024Updated last year
- CLIcK: A Benchmark Dataset of Cultural and Linguistic Intelligence in Korean☆47Dec 23, 2024Updated last year
- 한국어 언어모델 다분야 사고력 벤치마크☆201Oct 17, 2024Updated last year
- hwplib 패키지 python에서 쉽게 사용 할수 있게 만든 github repo 입니다.☆55Mar 29, 2025Updated 10 months ago
- generate synthetic data for LLM fine-tuning in arbitrary situations within systematic way☆22Mar 18, 2024Updated last year
- StrategyQA 데이터 세트 번역☆23Apr 12, 2024Updated last year
- Train GEMMA on TPU/GPU! (Codebase for training Gemma-Ko Series)☆48Mar 2, 2024Updated last year
- Korean Sentence Embedding Model Performance Benchmark for RAG☆50Jan 27, 2025Updated last year
- This hands-on lab aims to alleviate some of that headache by demonstrating how to create/augment a QnA dataset from complex unstructured …☆64Apr 29, 2025Updated 9 months ago
- Ko-Arena-Hard-Auto: An automatic LLM benchmark for Korean☆22Apr 23, 2025Updated 9 months ago
- A hackable, simple, and reseach-friendly GRPO Training Framework with high speed weight synchronization in a multinode environment.☆36Aug 27, 2025Updated 5 months ago
- Curation note of NLP datasets☆98Dec 6, 2022Updated 3 years ago
- ☆63Dec 29, 2025Updated last month
- 1-Click is all you need.☆63Apr 29, 2024Updated last year
- Reward Model을 이용하여 언어모델의 답변을 평가하기☆29Feb 23, 2024Updated last year
- [Google Meet] MLLM Arxiv Casual Talk☆52Mar 16, 2023Updated 2 years ago
- Repository for the "Understanding and Mitigating Language Confusion in LLMs" paper☆29Jun 28, 2024Updated last year
- The most modern LLM evaluation toolkit☆70Nov 9, 2025Updated 3 months ago
- Benchmark in Korean Context☆136Sep 26, 2023Updated 2 years ago
- [Findings of NAACL2022] A Dog Is Passing Over The Jet? A Text-Generation Dataset for Korean Commonsense Reasoning and Evaluation☆28Dec 9, 2022Updated 3 years ago
- Official repository for KoMT-Bench built by LG AI Research☆71Aug 8, 2024Updated last year
- hwpxlib 패키지 python에서 쉽게 사용 할수 있게 만든 github repo 입니다.☆36Mar 29, 2025Updated 10 months ago
- Weak Labeling (NER) using ChatGPT☆37Mar 28, 2023Updated 2 years ago