qwopqwop200 / ko-arena-hard-autoView external linksLinks
Ko-Arena-Hard-Auto: An automatic LLM benchmark for Korean
☆22Apr 23, 2025Updated 9 months ago
Alternatives and similar repositories for ko-arena-hard-auto
Users that are interested in ko-arena-hard-auto are comparing it to the libraries listed below
Sorting:
- A simple JSON parser specifically designed to handle malformed JSON output from Large Language Models (LLMs) like GPT, Claude, and others…☆26Jun 20, 2025Updated 7 months ago
- 한글 텍스트 임베딩 모델 리더보드☆93Oct 22, 2024Updated last year
- BERT score for text generation☆12Jan 15, 2025Updated last year
- 🎹 Instruct.KR 2025 Summer Meetup: 오픈소스 LLM, vLLM으로 Production까지 🎹☆24Aug 2, 2025Updated 6 months ago
- This repository aims to develop CoT Steering based on CoT without Prompting. It focuses on enhancing the model’s latent reasoning capabil…☆115Jun 25, 2025Updated 7 months ago
- Gunmo-emo-classification: 한국어 감정 다중 분류 모델 제작법☆27Dec 12, 2023Updated 2 years ago
- ☆36Oct 4, 2023Updated 2 years ago
- nanoRLHF: from-scratch journey into how LLMs and RLHF really work.☆159Jan 23, 2026Updated 3 weeks ago
- CPython 파헤치기 스터디☆16Jul 13, 2024Updated last year
- Korean Translation Benchmark, LLM-as-a-judge☆23Oct 23, 2025Updated 3 months ago
- 한국어 언어모델 다분야 사고력 벤치마크☆201Oct 17, 2024Updated last year
- NovelAi Image Studio☆11Feb 9, 2026Updated last week
- StrategyQA 데이터 세트 번역☆23Apr 12, 2024Updated last year
- Train GEMMA on TPU/GPU! (Codebase for training Gemma-Ko Series)☆48Mar 2, 2024Updated last year
- Korean Sentence Embedding Model Performance Benchmark for RAG☆50Jan 27, 2025Updated last year
- This is a hands-on for ML beginners to perform SimCSE step-by-step. Implemented both supervised SimCSE and unsupervisied SimCSE, and dist…☆22Oct 6, 2023Updated 2 years ago
- ☆64Jul 21, 2025Updated 6 months ago
- KoCommonGEN v2: A Benchmark for Navigating Korean Commonsense Reasoning Challenges in Large Language Models☆25Aug 24, 2024Updated last year
- 1-Click is all you need.☆63Apr 29, 2024Updated last year
- KURE: 고려대학교에서 개발한, 한국어 검색에 특화된 임베딩 모델☆206Sep 10, 2025Updated 5 months ago
- The most modern LLM evaluation toolkit☆70Nov 9, 2025Updated 3 months ago
- 카카오뱅크 & 에프엔가이드에서 학습한 금융 도메인 특화 언어모델☆121Jan 16, 2024Updated 2 years ago
- Get text from documents format☆29Nov 22, 2017Updated 8 years ago
- 자체 구축한 한국어 평가 데이터셋을 이용한 한국어 모델 평가☆31May 31, 2024Updated last year
- Crispy reranking models by Mixedbread☆46Sep 17, 2025Updated 5 months ago
- huggingface에 있는 한국어 데이터 세트☆36Oct 10, 2024Updated last year
- Production-ready RAG backend. Start in 5 min, swap Vector DB/LLM/Reranker with 1 line config. 6 DBs, 4 LLMs, GraphRAG included.☆81Feb 1, 2026Updated 2 weeks ago
- 인공지능에 대한 배경지식이 없어도 LLM을 학습시켜 나만의 GPT를 만들 수 있는 오픈소스 솔루션 | 🏆 2023 공개SW 개발자대회 장려상☆10Nov 9, 2023Updated 2 years ago
- DOS Program Development☆12Nov 9, 2022Updated 3 years ago
- Official codes for NAACL 2025 paper "LLMs Are Biased Towards Output Formats! Systematically Evaluating and Mitigating Output Format Bias …☆11Nov 25, 2025Updated 2 months ago
- ☆12Oct 23, 2020Updated 5 years ago
- Adversarial Test Dataset for Korean Multi-turn Response Selection☆34Dec 16, 2021Updated 4 years ago
- User-friendly viewer for Parquet files☆10Jan 10, 2026Updated last month
- fine-tuning tutorial☆17Dec 13, 2025Updated 2 months ago
- 한국어 의료 분야 특화 챗봇 프로젝트☆32Nov 20, 2023Updated 2 years ago
- attempt to perma root the NEC Terrain android phone☆10Jul 24, 2015Updated 10 years ago
- 빠른 속도와 준수한 정확도를 목표로하는 한국어 띄어쓰기 교정 모델입니다. (It is a Korean spacing correction model that aims for fast speed and moderate accuracy.)☆36Nov 25, 2022Updated 3 years ago
- ✨ NovelAI api python sdk, easy to use, modern and user-friendly.☆43Dec 2, 2025Updated 2 months ago
- Forked repo from https://github.com/EleutherAI/lm-evaluation-harness/commit/1f66adc☆82Feb 28, 2024Updated last year