LG-AI-EXAONE / KoMT-BenchLinks

Official repository for KoMT-Bench built by LG AI Research

☆71

Alternatives and similar repositories for KoMT-Bench

Users that are interested in KoMT-Bench are comparing it to the libraries listed below

Sorting:

songys / huggingface_KoreanDataset
huggingface에 있는 한국어 데이터 세트
☆35Updated last year
HAE-RAE / haerae-evaluation-toolkit
The most modern LLM evaluation toolkit
☆70Updated 2 months ago
paust-team / pko-t5
bpe based korean t5 model for text-to-text unified framework
☆63Updated last year
MrBananaHuman / PangyoCorpora
☆36Updated 2 years ago
davidkim205 / kollm_evaluation
자체 구축한 한국어 평가 데이터셋을 이용한 한국어 모델 평가
☆31Updated last year
J-Seo / KoCommonGEN-V2
KoCommonGEN v2: A Benchmark for Navigating Korean Commonsense Reasoning Challenges in Large Language Models
☆25Updated last year
kakao / FunctionChat-Bench
☆114Updated 5 months ago
Marker-Inc-Korea / KO-Platypus
[KO-Platy🥮] Korean-Open-platypus를 활용하여 llama-2-ko를 fine-tuning한 KO-platypus model
☆74Updated 4 months ago
rladmstn1714 / CLIcK
CLIcK: A Benchmark Dataset of Cultural and Linguistic Intelligence in Korean
☆47Updated last year
HeegyuKim / ko-rm-judge
Reward Model을 이용하여 언어모델의 답변을 평가하기
☆29Updated last year
Beomi / ko-lm-evaluation-harness
Forked repo from https://github.com/EleutherAI/lm-evaluation-harness/commit/1f66adc
☆81Updated last year
gyunggyung / MLLMArxivTalk
[Google Meet] MLLM Arxiv Casual Talk
☆52Updated 2 years ago
Beomi / Gemma-EasyLM
Train GEMMA on TPU/GPU! (Codebase for training Gemma-Ko Series)
☆48Updated last year
metterian / korean_bert_score
BERT score for text generation
☆12Updated 11 months ago
workdd / LLM_Foreign_Block
LLM 모델의 외국어 토큰 생성을 막는 코드 구현
☆82Updated 5 months ago
wisenut-research / KoT5
한국어 T5 모델
☆55Updated 4 years ago
HAE-RAE / HAE-RAE-BENCH
Benchmark in Korean Context
☆135Updated 2 years ago
daekeun-ml / evaluate-llm-on-korean-dataset
Performs benchmarking on two Korean datasets with minimal time and effort.
☆44Updated 2 weeks ago
wandb / llm-kr-eval
☆20Updated last year
Marker-Inc-Korea / AutoRAG-example-korean-embedding-benchmark
AutoRAG example about benchmarking Korean embeddings.
☆42Updated last year
JoJo0217 / rlhf_korean_dataset
For the rlhf learning environment of Koreans
☆25Updated 2 years ago
daje0601 / CoT-Reasoning_without_Prompting
구글에서 발표한 Chain-of-Thought Reasoning without Prompting을 코드로 구현한 레포입니다.
☆66Updated last year
sionic-ai / Data_KoSuperNI
StrategyQA 데이터 세트 번역
☆23Updated last year
Beomi / easy-lm-trainer
🤗 최소한의 세팅으로 LM을 학습하기 위한 샘플코드
☆58Updated 2 years ago
EleutherAI / hae-rae
☆33Updated 2 years ago
krafton-ai / KORani
☆107Updated 2 years ago
monoclear-ai / monoclear.ai
한국어 LLM 리더보드 및 모델 성능/안전성 관리
☆22Updated 2 years ago
DopeorNope-Lee / Ko-Fine-tuning_DataGen
☆69Updated last year
42dot / 42dot_LLM
42dot LLM consists of a pre-trained language model, 42dot LLM-PLM, and a fine-tuned model, 42dot LLM-SFT, which is trained to respond to …
☆130Updated last year
smilegate-ai / OPELA
☆30Updated 3 years ago