Ko-Arena-Hard-Auto: An automatic LLM benchmark for Korean
☆22Apr 23, 2025Updated 10 months ago
Alternatives and similar repositories for ko-arena-hard-auto
Users that are interested in ko-arena-hard-auto are comparing it to the libraries listed below
Sorting:
- A simple JSON parser specifically designed to handle malformed JSON output from Large Language Models (LLMs) like GPT, Claude, and others…☆26Jun 20, 2025Updated 8 months ago
- 한글 텍스트 임베딩 모델 리더보드☆93Oct 22, 2024Updated last year
- BERT score for text generation☆12Jan 15, 2025Updated last year
- 🎹 Instruct.KR 2025 Summer Meetup: 오픈소스 LLM, vLLM으로 Production까지 🎹☆23Aug 2, 2025Updated 7 months ago
- This repository aims to develop CoT Steering based on CoT without Prompting. It focuses on enhancing the model’s latent reasoning capabil…☆115Jun 25, 2025Updated 8 months ago
- Gunmo-emo-classification: 한국어 감정 다중 분류 모델 제작법☆27Dec 12, 2023Updated 2 years ago
- ☆36Oct 4, 2023Updated 2 years ago
- CPython 파헤치기 스터디☆16Jul 13, 2024Updated last year
- Korean Translation Benchmark, LLM-as-a-judge☆22Oct 23, 2025Updated 4 months ago
- nanoRLHF: from-scratch journey into how LLMs and RLHF really work.☆167Jan 23, 2026Updated last month
- VITS implementation of Japanese, Chinese, Korean, Sanskrit and Thai☆36Updated this week
- 한국어 언어모델 다분야 사고력 벤치마크☆201Oct 17, 2024Updated last year
- NovelAi Image Studio☆16Feb 20, 2026Updated 2 weeks ago
- StrategyQA 데이터 세 트 번역☆23Apr 12, 2024Updated last year
- Korean Sentence Embedding Model Performance Benchmark for RAG☆50Jan 27, 2025Updated last year
- Train GEMMA on TPU/GPU! (Codebase for training Gemma-Ko Series)☆48Mar 2, 2024Updated 2 years ago
- ☆29Nov 10, 2024Updated last year
- This is a hands-on for ML beginners to perform SimCSE step-by-step. Implemented both supervised SimCSE and unsupervisied SimCSE, and dist…☆22Oct 6, 2023Updated 2 years ago
- KoCommonGEN v2: A Benchmark for Navigating Korean Commonsense Reasoning Challenges in Large Language Models☆25Aug 24, 2024Updated last year
- ☆64Jul 21, 2025Updated 7 months ago
- 1-Click is all you need.☆63Apr 29, 2024Updated last year
- Sample files, code snippets and downloads for ContextualAI☆30Jan 21, 2026Updated last month
- Designing a Dashboard for Transparency and Control of Conversational AI, https://arxiv.org/abs/2406.07882☆33Oct 7, 2025Updated 5 months ago
- KURE: 고려대학교에서 개발한, 한국어 검색에 특화된 임베딩 모델☆205Feb 26, 2026Updated last week
- The most modern LLM evaluation toolkit☆70Nov 9, 2025Updated 4 months ago
- ☆12Sep 9, 2022Updated 3 years ago
- Get text from documents format☆29Nov 22, 2017Updated 8 years ago
- 자체 구축한 한국어 평가 데이터셋을 이용한 한국어 모델 평가☆31May 31, 2024Updated last year
- Crispy reranking models by Mixedbread☆47Sep 17, 2025Updated 5 months ago
- huggingface에 있는 한국어 데이터 세트☆36Oct 10, 2024Updated last year
- Official codes for NAACL 2025 paper "LLMs Are Biased Towards Output Formats! Systematically Evaluating and Mitigating Output Format Bias …☆11Nov 25, 2025Updated 3 months ago
- 한국어 의료 분야 특화 챗봇 프로젝트☆33Nov 20, 2023Updated 2 years ago
- fine-tuning tutorial☆18Feb 20, 2026Updated 2 weeks ago
- attempt to perma root the NEC Terrain android phone☆10Jul 24, 2015Updated 10 years ago
- User-friendly viewer for Parquet files☆10Updated this week
- DOS Program Development☆13Nov 9, 2022Updated 3 years ago
- ☆12Oct 23, 2020Updated 5 years ago
- 인공지능에 대한 배경지식이 없어도 LLM을 학습시켜 나만의 GPT를 만들 수 있는 오픈소스 솔루션 | 🏆 2023 공개SW 개발자대회 장려상☆10Nov 9, 2023Updated 2 years ago
- Adversarial Test Dataset for Korean Multi-turn Response Selection☆34Dec 16, 2021Updated 4 years ago