HAE-RAE / haerae-evaluation-toolkitLinks

The most modern LLM evaluation toolkit

☆70

Alternatives and similar repositories for haerae-evaluation-toolkit

Users that are interested in haerae-evaluation-toolkit are comparing it to the libraries listed below

Sorting:

HAE-RAE / HAE-RAE-BENCH
Benchmark in Korean Context
☆136Updated 2 years ago
kakao / FunctionChat-Bench
☆112Updated 4 months ago
Marker-Inc-Korea / KO-Platypus
[KO-Platy🥮] Korean-Open-platypus를 활용하여 llama-2-ko를 fine-tuning한 KO-platypus model
☆75Updated 3 months ago
Beomi / ko-lm-evaluation-harness
Forked repo from https://github.com/EleutherAI/lm-evaluation-harness/commit/1f66adc
☆81Updated last year
LG-AI-EXAONE / KoMT-Bench
Official repository for KoMT-Bench built by LG AI Research
☆70Updated last year
DopeorNope-Lee / Ko-Fine-tuning_DataGen
☆69Updated last year
songys / huggingface_KoreanDataset
huggingface에 있는 한국어 데이터 세트
☆33Updated last year
Marker-Inc-Korea / K-G-OAT
IA3방식으로 KoAlpaca를 fine tuning한 한국어 LLM모델
☆69Updated 2 years ago
liner-engineering / llm-meetup
Liner LLM Meetup archive
☆71Updated last year
workdd / LLM_Foreign_Block
LLM 모델의 외국어 토큰 생성을 막는 코드 구현
☆81Updated 4 months ago
instructkr / LogicKor
한국어 언어모델 다분야 사고력 벤치마크
☆199Updated last year
nlpai-lab / KURE
KURE: 고려대학교에서 개발한, 한국어 검색에 특화된 임베딩 모델
☆195Updated 3 months ago
naver-ai / korean-safety-benchmarks
Official datasets and pytorch implementation repository of SQuARe and KoSBi (ACL 2023)
☆248Updated 2 years ago
42dot / 42dot_LLM
42dot LLM consists of a pre-trained language model, 42dot LLM-PLM, and a fine-tuned model, 42dot LLM-SFT, which is trained to respond to …
☆130Updated last year
daje0601 / CoT-Reasoning_without_Prompting
구글에서 발표한 Chain-of-Thought Reasoning without Prompting을 코드로 구현한 레포입니다.
☆67Updated last year
SKT-AI / A.X-4.0
SKT A.X LLM 4.0
☆144Updated 4 months ago
Atipico1 / Kor-IR
Kor-IR: Korean Information Retrieval Benchmark
☆88Updated last year
ssisOneTeam / Korean-Embedding-Model-Performance-Benchmark-for-Retriever
Korean Sentence Embedding Model Performance Benchmark for RAG
☆49Updated 10 months ago
Marker-Inc-Korea / AutoRAG-example-korean-embedding-benchmark
AutoRAG example about benchmarking Korean embeddings.
☆41Updated last year
paust-team / pko-t5
bpe based korean t5 model for text-to-text unified framework
☆63Updated last year
Marker-Inc-Korea / COT_steering
This repository aims to develop CoT Steering based on CoT without Prompting. It focuses on enhancing the model’s latent reasoning capabil…
☆114Updated 5 months ago
davidkim205 / kollm_evaluation
자체 구축한 한국어 평가 데이터셋을 이용한 한국어 모델 평가
☆31Updated last year
jwj7140 / ko-medical-chat
한국어 의료 분야 특화 챗봇 프로젝트
☆32Updated 2 years ago
hyunwoongko / nlp-datasets
Curation note of NLP datasets
☆99Updated 3 years ago
MrBananaHuman / CounselGPT
한국어 심리 상담 데이터셋
☆80Updated 2 years ago
chanmuzi / Papers
Paper list and short/long summaries I've read for my research or interests
☆22Updated last year
gyunggyung / MLLMArxivTalk
[Google Meet] MLLM Arxiv Casual Talk
☆52Updated 2 years ago
HeegyuKim / ko-rm-judge
Reward Model을 이용하여 언어모델의 답변을 평가하기
☆29Updated last year
teamreboott / data-modori
☆40Updated last year
boostcampaitech7 / level2-nlp-generationfornlp-nlp-05-lv3
level2-nlp-generationfornlp-nlp-05-lv3 created by GitHub Classroom
☆14Updated 11 months ago