Ko-Arena-Hard-Auto: An automatic LLM benchmark for Korean
☆22Apr 23, 2025Updated last year
Alternatives and similar repositories for ko-arena-hard-auto
Users that are interested in ko-arena-hard-auto are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A simple JSON parser specifically designed to handle malformed JSON output from Large Language Models (LLMs) like GPT, Claude, and others…☆27Jun 20, 2025Updated 11 months ago
- 🎹 Instruct.KR 2025 Summer Meetup: 오픈소스 LLM, vLLM으로 Production까지 🎹☆23Aug 2, 2025Updated 9 months ago
- 한글 텍스트 임베딩 모델 리더보드☆96Oct 22, 2024Updated last year
- BERT score for text generation☆12Jan 15, 2025Updated last year
- [arXiv 2026] Official PyTorch Repository for "Coarse-Guided Visual Generation via Weighted h-Transform Sampling"☆42May 8, 2026Updated 3 weeks ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- This repository aims to develop CoT Steering based on CoT without Prompting. It focuses on enhancing the model’s latent reasoning capabil…☆115Jun 25, 2025Updated 11 months ago
- Gunmo-emo-classification: 한국어 감정 다중 분류 모델 제작법☆28Dec 12, 2023Updated 2 years ago
- ☆64Jul 21, 2025Updated 10 months ago
- nanoRLHF: from-scratch journey into how LLMs and RLHF really work.☆182Jan 23, 2026Updated 4 months ago
- ☆13Jan 31, 2025Updated last year
- Korean Translation Benchmark, LLM-as-a-judge☆22Oct 23, 2025Updated 7 months ago
- 한국어 언어모델 다분야 사고력 벤치마크☆209Oct 17, 2024Updated last year
- huggingface에 있는 한국어 데이터 세트☆36Oct 10, 2024Updated last year
- Repository for "Scaling Evaluation-time Compute with Reasoning Models as Process Evaluators"☆12Mar 25, 2025Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- ☆29Nov 10, 2024Updated last year
- 랭체인 & 랭그래프로 AI 에이전트 개발하기 소스 코드☆12Mar 3, 2025Updated last year
- Korean Sentence Embedding Model Performance Benchmark for RAG☆50Jan 27, 2025Updated last year
- KoCommonGEN v2: A Benchmark for Navigating Korean Commonsense Reasoning Challenges in Large Language Models☆25Aug 24, 2024Updated last year
- A comprehensive mcp server to post on x.com, with oauth v1 and v2, as well as v1.1 and v2 x API implementation☆20Jun 27, 2025Updated 11 months ago
- Miscellaneous codes and writings for MLOps☆15Apr 8, 2026Updated last month
- ☆11Aug 13, 2023Updated 2 years ago
- Train GEMMA on TPU/GPU! (Codebase for training Gemma-Ko Series)☆48Mar 2, 2024Updated 2 years ago
- 《GPT-4, ChatGPT, 라마인덱스, 랭체인을 활용한 인공지능 프로그래밍》 예제 코드☆10Jan 16, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆36Oct 4, 2023Updated 2 years ago
- Korean Benchmark for Korean Legal Language Understanding☆19Nov 16, 2024Updated last year
- This is a hands-on for ML beginners to perform SimCSE step-by-step. Implemented both supervised SimCSE and unsupervisied SimCSE, and dist…☆22Oct 6, 2023Updated 2 years ago
- Codebase the paper "The Remarkable Robustness of LLMs: Stages of Inference?"☆19Jun 11, 2025Updated 11 months ago
- hwplib 패키지 python에서 쉽게 사용 할수 있게 만든 github repo 입니다.☆54Mar 29, 2025Updated last year
- This is a simple node for comfyUI that accesses any openAI API server the user specifies and enables simple text generation with a string…☆29Jun 14, 2024Updated last year
- 1-Click is all you need.☆63Apr 29, 2024Updated 2 years ago
- 인공지능에 대한 배경지식이 없어도 LLM을 학습시켜 나만의 GPT를 만들 수 있는 오픈소스 솔루션 | 🏆 2023 공개SW 개발자대회 장려상☆11Nov 9, 2023Updated 2 years ago
- A curated and categorized paper list of gnn-based complex graph learning.☆11Apr 9, 2023Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- The source code of the game I made for the HuggingFace game jam☆16Jul 25, 2023Updated 2 years ago
- ☆20Jul 24, 2024Updated last year
- ☆13Mar 23, 2023Updated 3 years ago
- MurderMystery game for Nukkit☆15Feb 9, 2024Updated 2 years ago
- ☆12Dec 20, 2024Updated last year
- This repository contains the code for the paper: Beyond Chain-of-Thought, Effective Graph-of-Thought Reasoning in Language Models☆21Apr 27, 2024Updated 2 years ago
- 자체 구축한 한국어 평가 데이터셋을 이용한 한국어 모델 평가☆31May 31, 2024Updated last year