The most modern LLM evaluation toolkit
☆70Nov 9, 2025Updated 4 months ago
Alternatives and similar repositories for haerae-evaluation-toolkit
Users that are interested in haerae-evaluation-toolkit are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- KoCommonGEN v2: A Benchmark for Navigating Korean Commonsense Reasoning Challenges in Large Language Models☆25Aug 24, 2024Updated last year
- Official repository for KoMT-Bench built by LG AI Research☆71Aug 8, 2024Updated last year
- Performs benchmarking on two Korean datasets with minimal time and effort.☆46Jan 22, 2026Updated 2 months ago
- Repository for "Scaling Evaluation-time Compute with Reasoning Models as Process Evaluators"☆12Mar 25, 2025Updated last year
- 백준 문제 추천 서비스를 위한 디스코드 봇☆17Feb 19, 2026Updated last month
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- 한국어 벤치마크 평가 코드 통합본(?)☆20Nov 15, 2024Updated last year
- ☆116Feb 25, 2026Updated last month
- CLIcK: A Benchmark Dataset of Cultural and Linguistic Intelligence in Korean☆48Dec 23, 2024Updated last year
- Difference-based Contrastive Learning for Korean Sentence Embeddings☆23Mar 11, 2026Updated 2 weeks ago
- 한국어 LLM 리더보드 및 모델 성능/안전성 관리☆22Sep 26, 2023Updated 2 years ago
- ☆19Oct 24, 2023Updated 2 years ago
- 언어모델을 학습하기 위한 공개 한국어 instruction dataset들을 모아두었습니다.☆455Apr 13, 2025Updated 11 months ago
- Benchmark in Korean Context☆138Sep 26, 2023Updated 2 years ago
- huggingface에 있는 한국어 데이터 세트☆36Oct 10, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- BERT score for text generation☆12Jan 15, 2025Updated last year
- 한국어 언어모델 다분야 사고력 벤치마크☆201Oct 17, 2024Updated last year
- An unofficial implementation of SOLAR-10.7B model and the newly proposed interlocked-DUS(iDUS) implementation and experiment details.☆14Mar 20, 2024Updated 2 years ago
- Ko-Arena-Hard-Auto: An automatic LLM benchmark for Korean☆22Apr 23, 2025Updated 11 months ago
- ChatGPT의 RLHF를 학습을 위한 3가지 step별 한국어 데이터셋☆41Nov 21, 2023Updated 2 years ago
- ☆20Jul 24, 2024Updated last year
- Telegram chatbot for ChatGPT that can be used personally☆11Apr 18, 2023Updated 2 years ago
- Kor-IR: Korean Information Retrieval Benchmark☆87Jul 3, 2024Updated last year
- 어린이를 위한 동화 제작 서비스, My AI Fairy-Tale☆11Apr 7, 2023Updated 2 years ago
- NordVPN Threat Protection Pro™ • AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- KURE: 고려대학교에서 개발한, 한국어 검색에 특화된 임베딩 모델☆209Feb 26, 2026Updated last month
- hwplib 패키지 python에서 쉽게 사용 할수 있게 만든 github repo 입니다.☆54Mar 29, 2025Updated last year
- This is a hands-on for ML beginners to perform SimCSE step-by-step. Implemented both supervised SimCSE and unsupervisied SimCSE, and dist…☆22Oct 6, 2023Updated 2 years ago
- bpe based korean t5 model for text-to-text unified framework☆63Apr 17, 2024Updated last year
- API server for converts hwp files - thanks to hwplib & hwpxlib☆12Jun 9, 2023Updated 2 years ago
- Official datasets and pytorch implementation repository of SQuARe and KoSBi (ACL 2023)☆249Jun 29, 2023Updated 2 years ago
- A simple fixed-throughput latency testing tool☆16Mar 19, 2022Updated 4 years ago
- ☆33Aug 30, 2023Updated 2 years ago
- my useful torch lightning training template☆32Mar 12, 2023Updated 3 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Open Source + Multilingual MLLM + Fine-tuning + Distillation + More efficient models and learning + ?☆18Jan 31, 2025Updated last year
- Evaluating Multimodal Generative AI with Korean Educational Standards, NAACL 2025.☆26May 15, 2025Updated 10 months ago
- A hackable, simple, and reseach-friendly GRPO Training Framework with high speed weight synchronization in a multinode environment.☆37Aug 27, 2025Updated 7 months ago
- ☆64Jul 21, 2025Updated 8 months ago
- [KO-Platy🥮] Korean-Open-platypus를 활용하여 llama-2-ko를 fine-tuning한 KO-platypus model☆73Aug 24, 2025Updated 7 months ago
- ☆61Sep 18, 2025Updated 6 months ago
- ☆123Apr 21, 2023Updated 2 years ago