Ko-Arena-Hard-Auto: An automatic LLM benchmark for Korean
☆22Apr 23, 2025Updated 11 months ago
Alternatives and similar repositories for ko-arena-hard-auto
Users that are interested in ko-arena-hard-auto are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A simple JSON parser specifically designed to handle malformed JSON output from Large Language Models (LLMs) like GPT, Claude, and others…☆27Jun 20, 2025Updated 9 months ago
- 🎹 Instruct.KR 2025 Summer Meetup: 오픈소스 LLM, vLLM으로 Production까지 🎹☆23Aug 2, 2025Updated 8 months ago
- 한글 텍스트 임베딩 모델 리더보드☆94Oct 22, 2024Updated last year
- BERT score for text generation☆12Jan 15, 2025Updated last year
- This repository aims to develop CoT Steering based on CoT without Prompting. It focuses on enhancing the model’s latent reasoning capabil…☆114Jun 25, 2025Updated 9 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Gunmo-emo-classification: 한국어 감정 다중 분류 모델 제작법☆27Dec 12, 2023Updated 2 years ago
- ☆64Jul 21, 2025Updated 8 months ago
- nanoRLHF: from-scratch journey into how LLMs and RLHF really work.☆176Jan 23, 2026Updated 2 months ago
- ☆13Jan 31, 2025Updated last year
- Korean Translation Benchmark, LLM-as-a-judge☆22Oct 23, 2025Updated 5 months ago
- ☆15Sep 6, 2024Updated last year
- 한국어 언어모 델 다분야 사고력 벤치마크☆207Oct 17, 2024Updated last year
- The most modern LLM evaluation toolkit☆70Nov 9, 2025Updated 5 months ago
- huggingface에 있는 한국어 데이터 세트☆36Oct 10, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆29Nov 10, 2024Updated last year
- A lightweight adjustment tool for smoothing token probabilities in the Qwen models to encourage balanced multilingual generation.☆104Jul 9, 2025Updated 9 months ago
- Repository for "Scaling Evaluation-time Compute with Reasoning Models as Process Evaluators"☆12Mar 25, 2025Updated last year
- prototype of plant-disease-detector☆11Apr 21, 2021Updated 4 years ago
- ☆28Dec 15, 2025Updated 4 months ago
- 랭체인 & 랭그래프로 AI 에이전트 개발하기 소스 코드☆12Mar 3, 2025Updated last year
- KoCommonGEN v2: A Benchmark for Navigating Korean Commonsense Reasoning Challenges in Large Language Models☆25Aug 24, 2024Updated last year
- Korean Sentence Embedding Model Performance Benchmark for RAG☆50Jan 27, 2025Updated last year
- Train GEMMA on TPU/GPU! (Codebase for training Gemma-Ko Series)☆48Mar 2, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- 《GPT-4, ChatGPT, 라마인덱스, 랭체인을 활용한 인공지능 프로그래밍》 예제 코드☆10Jan 16, 2024Updated 2 years ago
- ☆36Oct 4, 2023Updated 2 years ago
- Korean Benchmark for Korean Legal Language Understanding☆18Nov 16, 2024Updated last year
- This is a hands-on for ML beginners to perform SimCSE step-by-step. Implemented both supervised SimCSE and unsupervisied SimCSE, and dist…☆22Oct 6, 2023Updated 2 years ago
- Pretraining and finetuning for visual instruction following with Mixture of Experts☆15Jan 30, 2024Updated 2 years ago
- An unofficial implementation of SOLAR-10.7B model and the newly proposed interlocked-DUS(iDUS) implementation and experiment details.☆14Mar 20, 2024Updated 2 years ago
- hwplib 패키지 python에서 쉽게 사용 할수 있게 만든 github repo 입니다.☆54Mar 29, 2025Updated last year
- Forked repo from https://github.com/EleutherAI/lm-evaluation-harness/commit/1f66adc☆81Feb 28, 2024Updated 2 years ago
- 1-Click is all you need.☆63Apr 29, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- 인공지능에 대한 배경지식이 없어도 LLM을 학습시켜 나만의 GPT를 만들 수 있는 오픈소스 솔루션 | 🏆 2023 공개SW 개발자대회 장려상☆11Nov 9, 2023Updated 2 years ago
- A curated and categorized paper list of gnn-based complex graph learning.☆11Apr 9, 2023Updated 3 years ago
- This repository contains the code for the paper: Beyond Chain-of-Thought, Effective Graph-of-Thought Reasoning in Language Models☆20Apr 27, 2024Updated last year
- The source code of the game I made for the HuggingFace game jam☆16Jul 25, 2023Updated 2 years ago
- ☆23Feb 11, 2026Updated 2 months ago
- ☆20Jul 24, 2024Updated last year
- ☆13Mar 23, 2023Updated 3 years ago