qwopqwop200/ko-arena-hard-auto

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/qwopqwop200/ko-arena-hard-auto)

qwopqwop200 / ko-arena-hard-auto

Ko-Arena-Hard-Auto: An automatic LLM benchmark for Korean

☆22

Alternatives and similar repositories for ko-arena-hard-auto

Users that are interested in ko-arena-hard-auto are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

aotakeda / ai-json-fixer
View on GitHub
A simple JSON parser specifically designed to handle malformed JSON output from Large Language Models (LLMs) like GPT, Claude, and others…
☆27Jun 20, 2025Updated last year
su-park / mteb_ko_leaderboard
View on GitHub
한글 텍스트 임베딩 모델 리더보드
☆97Oct 22, 2024Updated last year
metterian / korean_bert_score
View on GitHub
BERT score for text generation
☆12Jan 15, 2025Updated last year
Marker-Inc-Korea / COT_steering
View on GitHub
This repository aims to develop CoT Steering based on CoT without Prompting. It focuses on enhancing the model’s latent reasoning capabil…
☆116Jun 25, 2025Updated last year
jwj7140 / Gunmo-emo-classification
View on GitHub
Gunmo-emo-classification: 한국어 감정 다중 분류 모델 제작법
☆28Dec 12, 2023Updated 2 years ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
sionic-ai / Llama4-Token-Editor
View on GitHub
☆64Jul 21, 2025Updated last year
felix01189 / SEED
View on GitHub
☆14Jan 31, 2025Updated last year
SharathChampzz / Leaf_Disease_Detection-Classification
View on GitHub
Flask App Which detects 15 variety of plants [Pepper , Potato , Tomato ]
☆11Aug 27, 2020Updated 5 years ago
hyunwoongko / nanoRLHF
View on GitHub
nanoRLHF: from-scratch journey into how LLMs and RLHF really work.
☆195Jul 13, 2026Updated 2 weeks ago
joungminsung / codex-discord-connector
View on GitHub
Run and manage local Codex workflows from trusted Discord channels.
☆19Apr 27, 2026Updated 3 months ago
HAE-RAE / haerae-evaluation-toolkit
View on GitHub
The most modern LLM evaluation toolkit
☆70Apr 30, 2026Updated 3 months ago
IVADL / tomato-disease-detector
View on GitHub
prototype of plant-disease-detector
☆10Apr 21, 2021Updated 5 years ago
Aloe-droid / Yolov8_Android
View on GitHub
☆10Dec 19, 2023Updated 2 years ago
deveworld / KorT
View on GitHub
Korean Translation Benchmark, LLM-as-a-judge
☆23Oct 23, 2025Updated 9 months ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
songys / huggingface_KoreanDataset
View on GitHub
huggingface에 있는 한국어 데이터 세트
☆37Oct 10, 2024Updated last year
dnotitia / smoothie-qwen
View on GitHub
A lightweight adjustment tool for smoothing token probabilities in the Qwen models to encourage balanced multilingual generation.
☆106Jul 9, 2025Updated last year
tsdata / langchain-ollama
View on GitHub
☆29Nov 10, 2024Updated last year
prometheus-eval / scaling-evaluation-compute
View on GitHub
Repository for "Scaling Evaluation-time Compute with Reasoning Models as Process Evaluators"
☆12Mar 25, 2025Updated last year
Zerohertz / Instruct_KR_2025_Summer_Meetup_vLLM
View on GitHub
🎹 Instruct.KR 2025 Summer Meetup: 오픈소스 LLM, vLLM으로 Production까지 🎹
☆23Aug 2, 2025Updated 11 months ago
J-Seo / KoCommonGEN-V2
View on GitHub
KoCommonGEN v2: A Benchmark for Navigating Korean Commonsense Reasoning Challenges in Large Language Models
☆25Aug 24, 2024Updated last year
wikibook / openai-llm
View on GitHub
《GPT-4, ChatGPT, 라마인덱스, 랭체인을 활용한 인공지능 프로그래밍》 예제 코드
☆10Jan 16, 2024Updated 2 years ago
OnAnd0n / ko-embedding-leaderboard
View on GitHub
Korean-MTEB
☆100May 12, 2026Updated 2 months ago
ssisOneTeam / Korean-Embedding-Model-Performance-Benchmark-for-Retriever
View on GitHub
Korean Sentence Embedding Model Performance Benchmark for RAG
☆49Jan 27, 2025Updated last year
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
clprice32 / Predicting-NBA-Game-Winners
View on GitHub
Using decision tree and random forest models, predict the winner of an NBA regular season game
☆15Jun 7, 2018Updated 8 years ago
gilbutITbook / 080456
View on GitHub
랭체인 & 랭그래프로 AI 에이전트 개발하기 소스 코드
☆12Mar 3, 2025Updated last year
Beomi / Gemma-EasyLM
View on GitHub
Train GEMMA on TPU/GPU! (Codebase for training Gemma-Ko Series)
☆50Mar 2, 2024Updated 2 years ago
MrBananaHuman / PangyoCorpora
View on GitHub
☆38Oct 4, 2023Updated 2 years ago
daekeun-ml / KoSimCSE-SageMaker
View on GitHub
This is a hands-on for ML beginners to perform SimCSE step-by-step. Implemented both supervised SimCSE and unsupervisied SimCSE, and dist…
☆22Oct 6, 2023Updated 2 years ago
overfit-brothers / KRX-2024
View on GitHub
☆12Dec 20, 2024Updated last year
Beomi / ko-lm-evaluation-harness
View on GitHub
Forked repo from https://github.com/EleutherAI/lm-evaluation-harness/commit/1f66adc
☆81Feb 28, 2024Updated 2 years ago
gauss5930 / iDUS
View on GitHub
An unofficial implementation of SOLAR-10.7B model and the newly proposed interlocked-DUS(iDUS) implementation and experiment details.
☆14Mar 20, 2024Updated 2 years ago
StableFluffy / EasyLLMFeaturePorter
View on GitHub
1-Click is all you need.
☆63Apr 29, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
wandb / llm-kr-eval
View on GitHub
☆20Jul 24, 2024Updated 2 years ago
choijhyeok / python-hwplib
View on GitHub
hwplib 패키지 python에서 쉽게 사용 할수 있게 만든 github repo 입니다.
☆54Mar 29, 2025Updated last year
dxlong2000 / FormatBiasEval
View on GitHub
Official codes for NAACL 2025 paper "LLMs Are Biased Towards Output Formats! Systematically Evaluating and Mitigating Output Format Bias …
☆11Nov 25, 2025Updated 8 months ago
Delve-ERAV1 / Phi-2-Vision-Language
View on GitHub
Pretraining and finetuning for visual instruction following with Mixture of Experts
☆15Jan 30, 2024Updated 2 years ago
airmang / hwpx-plugins
View on GitHub
Official onboarding skill for HWPX document automation with AI agents.
☆19Updated this week
RGLie / AgentBlue
View on GitHub
☆40Mar 9, 2026Updated 4 months ago
Zio-94 / HieraPlan
View on GitHub
HieraPlan - Hierarchical Task Planner for llm agents
☆17Mar 20, 2025Updated last year