corca-ai / evaluating-gpt-4o-on-CLIcKLinks

Evaluate gpt-4o on CLIcK (Korean NLP Dataset)

☆20

Alternatives and similar repositories for evaluating-gpt-4o-on-CLIcK

Users that are interested in evaluating-gpt-4o-on-CLIcK are comparing it to the libraries listed below

Sorting:

metterian / korean_bert_score
BERT score for text generation
☆12Updated 10 months ago
rladmstn1714 / CLIcK
CLIcK: A Benchmark Dataset of Cultural and Linguistic Intelligence in Korean
☆47Updated 11 months ago
songys / huggingface_KoreanDataset
huggingface에 있는 한국어 데이터 세트
☆33Updated last year
J-Seo / KoCommonGEN-V2
KoCommonGEN v2: A Benchmark for Navigating Korean Commonsense Reasoning Challenges in Large Language Models
☆25Updated last year
LG-AI-EXAONE / KoMT-Bench
Official repository for KoMT-Bench built by LG AI Research
☆70Updated last year
HAE-RAE / haerae-evaluation-toolkit
The most modern LLM evaluation toolkit
☆70Updated last month
sionic-ai / Data_KoSuperNI
StrategyQA 데이터 세트 번역
☆23Updated last year
HeegyuKim / ko-rm-judge
Reward Model을 이용하여 언어모델의 답변을 평가하기
☆29Updated last year
choijhyeok / python-hwpxlib
hwpxlib 패키지 python에서 쉽게 사용 할수 있게 만든 github repo 입니다.
☆35Updated 8 months ago
Marker-Inc-Korea / Korean-OpenOrca
OpenOrca-KO dataset을 활용하여 llama2를 fine-tuning한 Korean-OpenOrca
☆19Updated 2 years ago
hist0613 / arxivbot
☆60Updated 2 months ago
teamreboott / data-modori
☆40Updated last year
42dot / 42dot_LLM
42dot LLM consists of a pre-trained language model, 42dot LLM-PLM, and a fine-tuned model, 42dot LLM-SFT, which is trained to respond to …
☆130Updated last year
daje0601 / CoT-Reasoning_without_Prompting
구글에서 발표한 Chain-of-Thought Reasoning without Prompting을 코드로 구현한 레포입니다.
☆67Updated last year
sionic-ai / flasma
High-performance vector search engine with no loss of accuracy through GPU and dynamic placement
☆31Updated 4 months ago
Marker-Inc-Korea / KO-Platypus
[KO-Platy🥮] Korean-Open-platypus를 활용하여 llama-2-ko를 fine-tuning한 KO-platypus model
☆75Updated 3 months ago
MrBananaHuman / CounselGPT
한국어 심리 상담 데이터셋
☆80Updated 2 years ago
daekeun-ml / evaluate-llm-on-korean-dataset
Performs benchmarking on two Korean datasets with minimal time and effort.
☆45Updated this week
liner-engineering / llm-meetup
Liner LLM Meetup archive
☆71Updated last year
gyunggyung / MLLMArxivTalk
[Google Meet] MLLM Arxiv Casual Talk
☆52Updated 2 years ago
upskyy / kf-deberta-multitask
금융 도메인에 특화된 한국어 임베딩 모델
☆23Updated last year
MrBananaHuman / PangyoCorpora
☆36Updated 2 years ago
sionic-ai / Llama4-Token-Editor
☆64Updated 4 months ago
Marker-Inc-Korea / K-G-OAT
IA3방식으로 KoAlpaca를 fine tuning한 한국어 LLM모델
☆69Updated 2 years ago
choijhyeok / python-hwplib
hwplib 패키지 python에서 쉽게 사용 할수 있게 만든 github repo 입니다.
☆50Updated 8 months ago
paust-team / pko-t5
bpe based korean t5 model for text-to-text unified framework
☆63Updated last year
liner-engineering / liner-pdf-chat-tutorial
LINER PDF Chat Tutorial with ChatGPT & Pinecone
☆48Updated 2 years ago
krafton-ai / KORani
☆107Updated 2 years ago
monoclear-ai / monoclear.ai
한국어 LLM 리더보드 및 모델 성능/안전성 관리
☆22Updated 2 years ago
davidkim205 / kollm_evaluation
자체 구축한 한국어 평가 데이터셋을 이용한 한국어 모델 평가
☆31Updated last year