wandb / llm-kr-evalView external linksLinks
☆20Jul 24, 2024Updated last year
Alternatives and similar repositories for llm-kr-eval
Users that are interested in llm-kr-eval are comparing it to the libraries listed below
Sorting:
- Official repository for KoMT-Bench built by LG AI Research☆71Aug 8, 2024Updated last year
- Project of llm evaluation to Japanese tasks☆91Feb 4, 2026Updated last week
- 한국어 언어모델 다분야 사고력 벤치마크☆201Oct 17, 2024Updated last year
- huggingface에 있는 한국어 데이터 세트☆35Oct 10, 2024Updated last year
- [KO-Platy🥮] Korean-Open-platypus를 활용하여 llama-2-ko를 fine-tuning한 KO-platypus model☆73Aug 24, 2025Updated 5 months ago
- Kor-IR: Korean Information Retrieval Benchmark☆87Jul 3, 2024Updated last year
- AutoRAG example about benchmarking Korean embeddings.☆43Oct 2, 2024Updated last year
- BERT score for text generation☆12Jan 15, 2025Updated last year
- An experimental web framework for creating user interfaces☆12Jan 30, 2024Updated 2 years ago
- KoCommonGEN v2: A Benchmark for Navigating Korean Commonsense Reasoning Challenges in Large Language Models☆25Aug 24, 2024Updated last year
- This repository contains the training and evaluation code for llm-jp-modernbert-base.☆14Jun 17, 2025Updated 7 months ago
- The Universe of Evaluation. All about the evaluation for LLMs.☆232Jul 9, 2024Updated last year
- Korean Multi-task Instruction Tuning☆156Dec 20, 2023Updated 2 years ago
- This hands-on lab aims to alleviate some of that headache by demonstrating how to create/augment a QnA dataset from complex unstructured …☆64Apr 29, 2025Updated 9 months ago
- ☆64Jul 21, 2025Updated 6 months ago
- KURE: 고려대학교에서 개발한, 한국어 검색에 특화된 임베딩 모델☆206Sep 10, 2025Updated 5 months ago
- 🤗 Transformers: State-of-the-art Natural Language Processing for TensorFlow 2.0 and PyTorch.☆17Jun 5, 2025Updated 8 months ago
- Submodule of evalverse forked from [google-research/instruction_following_eval](https://github.com/google-research/google-research/tree/m…☆14May 4, 2024Updated last year
- ☆36Oct 4, 2023Updated 2 years ago
- Coord: A Unified Interface for All Models☆18Feb 2, 2026Updated last week
- The most modern LLM evaluation toolkit☆70Nov 9, 2025Updated 3 months ago
- 카카오톡 GPT☆19Apr 9, 2024Updated last year
- ☆15Aug 26, 2023Updated 2 years ago
- Developing a Korean LLM model : Hate Speech Filtering, Improving conversational skills, Finetuning with the RLHF method☆19May 27, 2025Updated 8 months ago
- ☆19Jan 29, 2023Updated 3 years ago
- ☁️ 구름(KULLM): 고려대학교에서 개발한, 한국어에 특화된 LLM☆591May 1, 2024Updated last year
- Forked repo from https://github.com/EleutherAI/lm-evaluation-harness/commit/1f66adc☆82Feb 28, 2024Updated last year
- Gugugo: 한국어 오픈소스 번역 모델 프로젝트☆85Apr 7, 2024Updated last year
- CLIcK: A Benchmark Dataset of Cultural and Linguistic Intelligence in Korean☆47Dec 23, 2024Updated last year
- Train GEMMA on TPU/GPU! (Codebase for training Gemma-Ko Series)☆48Mar 2, 2024Updated last year
- 언어모델을 학습하기 위한 공개 한국어 instruction dataset들을 모아두었습니다.☆19Jul 16, 2023Updated 2 years ago
- For the rlhf learning environment of Koreans☆25Sep 25, 2023Updated 2 years ago
- Claude-router is a best project for using open model in claude-code☆55Sep 4, 2025Updated 5 months ago
- It shows how to use model-context-protocol.☆39Feb 7, 2026Updated last week
- ☆61Sep 18, 2025Updated 4 months ago
- A simple no-install web UI for Ollama and OAI-Compatible APIs!☆31Jan 30, 2025Updated last year
- Make running benchmark simple yet maintainable, again. Now only supports Korean-based cross-encoder.☆29Dec 2, 2025Updated 2 months ago
- 1-Click is all you need.☆63Apr 29, 2024Updated last year
- MemoChat: Tuning LLMs to Use Memos for Consistent Long-Range Open-Domain Conversation☆28Apr 18, 2024Updated last year