Ko-Arena-Hard-Auto: An automatic LLM benchmark for Korean
☆22Apr 23, 2025Updated last year
Alternatives and similar repositories for ko-arena-hard-auto
Users that are interested in ko-arena-hard-auto are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A simple JSON parser specifically designed to handle malformed JSON output from Large Language Models (LLMs) like GPT, Claude, and others…☆27Jun 20, 2025Updated 11 months ago
- 🎹 Instruct.KR 2025 Summer Meetup: 오픈소스 LLM, vLLM으로 Production까지 🎹☆23Aug 2, 2025Updated 10 months ago
- 한글 텍스트 임베딩 모델 리더보드☆96Oct 22, 2024Updated last year
- BERT score for text generation☆12Jan 15, 2025Updated last year
- This repository aims to develop CoT Steering based on CoT without Prompting. It focuses on enhancing the model’s latent reasoning capabil…☆116Jun 25, 2025Updated 11 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Gunmo-emo-classification: 한국어 감정 다중 분류 모델 제작법☆28Dec 12, 2023Updated 2 years ago
- ☆64Jul 21, 2025Updated 10 months ago
- nanoRLHF: from-scratch journey into how LLMs and RLHF really work.☆190Jan 23, 2026Updated 4 months ago
- ☆14Jan 31, 2025Updated last year
- Korean Translation Benchmark, LLM-as-a-judge☆22Oct 23, 2025Updated 7 months ago
- ☆15Sep 6, 2024Updated last year
- 한국어 언어모델 다분야 사고력 벤치마크☆209Oct 17, 2024Updated last year
- ☆10Dec 19, 2023Updated 2 years ago
- A lightweight adjustment tool for smoothing token probabilities in the Qwen models to encourage balanced multilingual generation.☆106Jul 9, 2025Updated 11 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆38Feb 6, 2026Updated 4 months ago
- Repository for "Scaling Evaluation-time Compute with Reasoning Models as Process Evaluators"☆12Mar 25, 2025Updated last year
- ✨ NovelAI api python sdk, easy to use, modern and user-friendly.☆48Dec 2, 2025Updated 6 months ago
- prototype of plant-disease-detector☆10Apr 21, 2021Updated 5 years ago
- 랭체인 & 랭그래프로 AI 에이전트 개발하기 소스 코드☆12Mar 3, 2025Updated last year
- Korean Sentence Embedding Model Performance Benchmark for RAG☆50Jan 27, 2025Updated last year
- KoCommonGEN v2: A Benchmark for Navigating Korean Commonsense Reasoning Challenges in Large Language Models☆25Aug 24, 2024Updated last year
- Train GEMMA on TPU/GPU! (Codebase for training Gemma-Ko Series)☆49Mar 2, 2024Updated 2 years ago
- 《GPT-4, ChatGPT, 라마인덱스, 랭체인을 활용한 인공지능 프로그래밍》 예제 코드☆10Jan 16, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆36Oct 4, 2023Updated 2 years ago
- Korean Benchmark for Korean Legal Language Understanding☆19Nov 16, 2024Updated last year
- This is a hands-on for ML beginners to perform SimCSE step-by-step. Implemented both supervised SimCSE and unsupervisied SimCSE, and dist…☆22Oct 6, 2023Updated 2 years ago
- ☆123Apr 21, 2023Updated 3 years ago
- Parses, Analyzes and Predicts for the Korean Baseball League☆17Dec 8, 2022Updated 3 years ago
- Pretraining and finetuning for visual instruction following with Mixture of Experts☆15Jan 30, 2024Updated 2 years ago
- Codebase the paper "The Remarkable Robustness of LLMs: Stages of Inference?"☆20Jun 11, 2025Updated last year
- An unofficial implementation of SOLAR-10.7B model and the newly proposed interlocked-DUS(iDUS) implementation and experiment details.☆14Mar 20, 2024Updated 2 years ago
- hwplib 패키지 python에서 쉽게 사용 할수 있게 만든 github repo 입니다.☆54Mar 29, 2025Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- 1-Click is all you need.☆63Apr 29, 2024Updated 2 years ago
- Forked repo from https://github.com/EleutherAI/lm-evaluation-harness/commit/1f66adc☆81Feb 28, 2024Updated 2 years ago
- 인공지능에 대한 배경지식이 없어도 LLM을 학습시켜 나만의 GPT를 만들 수 있는 오픈소스 솔루션 | 🏆 2023 공개SW 개발자대회 장려상☆11Nov 9, 2023Updated 2 years ago
- The source code of the game I made for the HuggingFace game jam☆16Jul 25, 2023Updated 2 years ago
- ☆20Jul 24, 2024Updated last year
- ☆13Mar 23, 2023Updated 3 years ago
- Using machine learning to diagnose foliar diseases in apple plants☆10Jun 14, 2021Updated 5 years ago