Ko-Arena-Hard-Auto: An automatic LLM benchmark for Korean
โ22Apr 23, 2025Updated 11 months ago
Alternatives and similar repositories for ko-arena-hard-auto
Users that are interested in ko-arena-hard-auto are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ๐น Instruct.KR 2025 Summer Meetup: ์คํ์์ค LLM, vLLM์ผ๋ก Production๊น์ง ๐นโ23Aug 2, 2025Updated 7 months ago
- ํ๊ธ ํ ์คํธ ์๋ฒ ๋ฉ ๋ชจ๋ธ ๋ฆฌ๋๋ณด๋โ93Oct 22, 2024Updated last year
- BERT score for text generationโ12Jan 15, 2025Updated last year
- This repository aims to develop CoT Steering based on CoT without Prompting. It focuses on enhancing the modelโs latent reasoning capabilโฆโ115Jun 25, 2025Updated 9 months ago
- Gunmo-emo-classification: ํ๊ตญ์ด ๊ฐ์ ๋ค์ค ๋ถ๋ฅ ๋ชจ๋ธ ์ ์๋ฒโ27Dec 12, 2023Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean โข AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- โ64Jul 21, 2025Updated 8 months ago
- A comfyui typescript client for the bun runtimeโ16Nov 24, 2025Updated 4 months ago
- nanoRLHF: from-scratch journey into how LLMs and RLHF really work.โ174Jan 23, 2026Updated 2 months ago
- Korean Translation Benchmark, LLM-as-a-judgeโ22Oct 23, 2025Updated 5 months ago
- ํ๊ตญ์ด ์ธ์ด๋ชจ๋ธ ๋ค๋ถ์ผ ์ฌ๊ณ ๋ ฅ ๋ฒค์น๋งํฌโ201Oct 17, 2024Updated last year
- The most modern LLM evaluation toolkitโ70Nov 9, 2025Updated 4 months ago
- โ10Dec 19, 2023Updated 2 years ago
- huggingface์ ์๋ ํ๊ตญ์ด ๋ฐ์ดํฐ ์ธํธโ36Oct 10, 2024Updated last year
- โ29Nov 10, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways โข AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- A lightweight adjustment tool for smoothing token probabilities in the Qwen models to encourage balanced multilingual generation.โ104Jul 9, 2025Updated 8 months ago
- Homebrew MCP : Comprehensive brew support for installing, upgrading, searching, and maintaining macOS packages.โ25Jun 23, 2025Updated 9 months ago
- ๋ญ์ฒด์ธ & ๋ญ๊ทธ๋ํ๋ก AI ์์ด์ ํธ ๊ฐ๋ฐํ๊ธฐ ์์ค ์ฝ๋โ12Mar 3, 2025Updated last year
- KoCommonGEN v2: A Benchmark for Navigating Korean Commonsense Reasoning Challenges in Large Language Modelsโ25Aug 24, 2024Updated last year
- Korean Sentence Embedding Model Performance Benchmark for RAGโ50Jan 27, 2025Updated last year
- Train GEMMA on TPU/GPU! (Codebase for training Gemma-Ko Series)โ48Mar 2, 2024Updated 2 years ago
- ใGPT-4, ChatGPT, ๋ผ๋ง์ธ๋ฑ์ค, ๋ญ์ฒด์ธ์ ํ์ฉํ ์ธ๊ณต์ง๋ฅ ํ๋ก๊ทธ๋๋ฐใ ์์ ์ฝ๋โ10Jan 16, 2024Updated 2 years ago
- โ32Jul 20, 2024Updated last year
- โ36Oct 4, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways โข AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Korean Benchmark for Korean Legal Language Understandingโ18Nov 16, 2024Updated last year
- This is a hands-on for ML beginners to perform SimCSE step-by-step. Implemented both supervised SimCSE and unsupervisied SimCSE, and distโฆโ22Oct 6, 2023Updated 2 years ago
- โ123Apr 21, 2023Updated 2 years ago
- Pretraining and finetuning for visual instruction following with Mixture of Expertsโ16Jan 30, 2024Updated 2 years ago
- Parses, Analyzes and Predicts for the Korean Baseball Leagueโ17Dec 8, 2022Updated 3 years ago
- Codebase the paper "The Remarkable Robustness of LLMs: Stages of Inference?"โ19Jun 11, 2025Updated 9 months ago
- An unofficial implementation of SOLAR-10.7B model and the newly proposed interlocked-DUS(iDUS) implementation and experiment details.โ14Mar 20, 2024Updated 2 years ago
- hwplib ํจํค์ง python์์ ์ฝ๊ฒ ์ฌ์ฉ ํ ์ ์๊ฒ ๋ง๋ github repo ์ ๋๋ค.โ54Mar 29, 2025Updated last year
- 1-Click is all you need.โ63Apr 29, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways โข AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Forked repo from https://github.com/EleutherAI/lm-evaluation-harness/commit/1f66adcโ82Feb 28, 2024Updated 2 years ago
- A curated and categorized paper list of gnn-based complex graph learning.โ11Apr 9, 2023Updated 2 years ago
- The source code of the game I made for the HuggingFace game jamโ16Jul 25, 2023Updated 2 years ago
- โ23Feb 11, 2026Updated last month
- โ20Jul 24, 2024Updated last year
- MurderMystery game for Nukkitโ14Feb 9, 2024Updated 2 years ago
- Using machine learning to diagnose foliar diseases in apple plantsโ10Jun 14, 2021Updated 4 years ago