Ko-Arena-Hard-Auto: An automatic LLM benchmark for Korean
โ22Apr 23, 2025Updated last year
Alternatives and similar repositories for ko-arena-hard-auto
Users that are interested in ko-arena-hard-auto are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ๐น Instruct.KR 2025 Summer Meetup: ์คํ์์ค LLM, vLLM์ผ๋ก Production๊น์ง ๐นโ23Aug 2, 2025Updated 9 months ago
- ํ๊ธ ํ ์คํธ ์๋ฒ ๋ฉ ๋ชจ๋ธ ๋ฆฌ๋๋ณด๋โ95Oct 22, 2024Updated last year
- BERT score for text generationโ12Jan 15, 2025Updated last year
- This repository aims to develop CoT Steering based on CoT without Prompting. It focuses on enhancing the modelโs latent reasoning capabilโฆโ115Jun 25, 2025Updated 10 months ago
- Gunmo-emo-classification: ํ๊ตญ์ด ๊ฐ์ ๋ค์ค ๋ถ๋ฅ ๋ชจ๋ธ ์ ์๋ฒโ27Dec 12, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways โข AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- โ64Jul 21, 2025Updated 9 months ago
- nanoRLHF: from-scratch journey into how LLMs and RLHF really work.โ180Jan 23, 2026Updated 3 months ago
- โ13Jan 31, 2025Updated last year
- Korean Translation Benchmark, LLM-as-a-judgeโ22Oct 23, 2025Updated 6 months ago
- The most modern LLM evaluation toolkitโ69Apr 30, 2026Updated last week
- โ10Dec 19, 2023Updated 2 years ago
- huggingface์ ์๋ ํ๊ตญ์ด ๋ฐ์ดํฐ ์ธํธโ36Oct 10, 2024Updated last year
- โ29Nov 10, 2024Updated last year
- A lightweight adjustment tool for smoothing token probabilities in the Qwen models to encourage balanced multilingual generation.โ105Jul 9, 2025Updated 10 months ago
- Managed hosting for WordPress and PHP on Cloudways โข AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Repository for "Scaling Evaluation-time Compute with Reasoning Models as Process Evaluators"โ12Mar 25, 2025Updated last year
- prototype of plant-disease-detectorโ10Apr 21, 2021Updated 5 years ago
- Homebrew MCP : Comprehensive brew support for installing, upgrading, searching, and maintaining macOS packages.โ28Jun 23, 2025Updated 10 months ago
- ๋ญ์ฒด์ธ & ๋ญ๊ทธ๋ํ๋ก AI ์์ด์ ํธ ๊ฐ๋ฐํ๊ธฐ ์์ค ์ฝ๋โ12Mar 3, 2025Updated last year
- KoCommonGEN v2: A Benchmark for Navigating Korean Commonsense Reasoning Challenges in Large Language Modelsโ25Aug 24, 2024Updated last year
- Korean Sentence Embedding Model Performance Benchmark for RAGโ50Jan 27, 2025Updated last year
- Miscellaneous codes and writings for MLOpsโ15Apr 8, 2026Updated last month
- Train GEMMA on TPU/GPU! (Codebase for training Gemma-Ko Series)โ48Mar 2, 2024Updated 2 years ago
- ใGPT-4, ChatGPT, ๋ผ๋ง์ธ๋ฑ์ค, ๋ญ์ฒด์ธ์ ํ์ฉํ ์ธ๊ณต์ง๋ฅ ํ๋ก๊ทธ๋๋ฐใ ์์ ์ฝ๋โ10Jan 16, 2024Updated 2 years ago
- Proton VPN Special Offer - Get 70% off โข AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- โ36Oct 4, 2023Updated 2 years ago
- Korean Benchmark for Korean Legal Language Understandingโ18Nov 16, 2024Updated last year
- This is a hands-on for ML beginners to perform SimCSE step-by-step. Implemented both supervised SimCSE and unsupervisied SimCSE, and distโฆโ22Oct 6, 2023Updated 2 years ago
- โ123Apr 21, 2023Updated 3 years ago
- Parses, Analyzes and Predicts for the Korean Baseball Leagueโ17Dec 8, 2022Updated 3 years ago
- Codebase the paper "The Remarkable Robustness of LLMs: Stages of Inference?"โ19Jun 11, 2025Updated 10 months ago
- An unofficial implementation of SOLAR-10.7B model and the newly proposed interlocked-DUS(iDUS) implementation and experiment details.โ14Mar 20, 2024Updated 2 years ago
- hwplib ํจํค์ง python์์ ์ฝ๊ฒ ์ฌ์ฉ ํ ์ ์๊ฒ ๋ง๋ github repo ์ ๋๋ค.โ54Mar 29, 2025Updated last year
- 1-Click is all you need.โ63Apr 29, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer โข AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Forked repo from https://github.com/EleutherAI/lm-evaluation-harness/commit/1f66adcโ81Feb 28, 2024Updated 2 years ago
- ์ธ๊ณต์ง๋ฅ์ ๋ํ ๋ฐฐ๊ฒฝ์ง์์ด ์์ด๋ LLM์ ํ์ต์์ผ ๋๋ง์ GPT๋ฅผ ๋ง๋ค ์ ์๋ ์คํ์์ค ์๋ฃจ์ | ๐ 2023 ๊ณต๊ฐSW ๊ฐ๋ฐ์๋ํ ์ฅ๋ ค์โ11Nov 9, 2023Updated 2 years ago
- A curated and categorized paper list of gnn-based complex graph learning.โ11Apr 9, 2023Updated 3 years ago
- The source code of the game I made for the HuggingFace game jamโ16Jul 25, 2023Updated 2 years ago
- โ20Jul 24, 2024Updated last year
- โ13Mar 23, 2023Updated 3 years ago
- softpool implementation(Refining activation downsampling with SoftPool) This is an unofficial implementation. https://arxiv.org/pdf/2101.โฆโ15Jan 20, 2021Updated 5 years ago