Marker-Inc-Korea/KoLLM_Eval

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Marker-Inc-Korea/KoLLM_Eval)

Marker-Inc-Korea / KoLLM_Eval

한국어 벤치마크 평가 코드 통합본(?)

☆21

Alternatives and similar repositories for KoLLM_Eval

Users that are interested in KoLLM_Eval are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

LG-AI-EXAONE / KoMT-Bench
View on GitHub
Official repository for KoMT-Bench built by LG AI Research
☆73Aug 8, 2024Updated last year
KyujinHan / KO-stable-diffusion-anything
View on GitHub
Diffusion-based korean text-to-image generation model
☆12Aug 16, 2023Updated 2 years ago
NomaDamas / IdolGAN
View on GitHub
Project for restoring beautiful K-pop Idols Images to high quality.
☆14Mar 19, 2023Updated 3 years ago
HAE-RAE / haerae-evaluation-toolkit
View on GitHub
The most modern LLM evaluation toolkit
☆70Apr 30, 2026Updated 2 months ago
korean-named-entity / konec
View on GitHub
Korean Named Entity Corpus
☆25May 12, 2023Updated 3 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
Beomi / ko-lm-evaluation-harness
View on GitHub
Forked repo from https://github.com/EleutherAI/lm-evaluation-harness/commit/1f66adc
☆81Feb 28, 2024Updated 2 years ago
aotakeda / ai-json-fixer
View on GitHub
A simple JSON parser specifically designed to handle malformed JSON output from Large Language Models (LLMs) like GPT, Claude, and others…
☆27Jun 20, 2025Updated last year
tencent-ailab / OASum
View on GitHub
☆15Oct 20, 2023Updated 2 years ago
hanbit / blueprints-text
View on GitHub
『파이썬 라이브러리를 활용한 텍스트 분석』(한빛미디어, 2022)의 예제 코드 저장소입니다.
☆11Sep 22, 2022Updated 3 years ago
groomata / vision
View on GitHub
Clean, reproducible, boilerplate-free deep learning project template.
☆19May 3, 2023Updated 3 years ago
Marker-Inc-Korea / KO-VLM-Benchmark
View on GitHub
실제 한국어 문서 데이터셋을 기반으로 만든 VLM 벤치마크 데이터셋
☆29Jan 25, 2026Updated 5 months ago
leemingo / tigt
View on GitHub
This repository is the official implementation of Topology-Informed Graph Transformer (Choi et al., GRaM Workshop at ICML 2024).
☆12Dec 28, 2024Updated last year
kakao / OrchestrationBench
View on GitHub
☆48Apr 17, 2026Updated 3 months ago
hyunbool / Text-Segmentation
View on GitHub
Text Segmentation 관련 논문 정리
☆20Jan 21, 2021Updated 5 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
chanmuzi / NLP-Paper-News
View on GitHub
The list of NLP paper and news I've checked. There might be short description of them (abstract) in Korean.
☆38Updated this week
suhan1433 / LLM-as-a-judge-using-G-eval
View on GitHub
LLM-as-a-judge using G-eval Scratch
☆15Oct 12, 2025Updated 9 months ago
whybe-choi / kovidore-benchmark
View on GitHub
[ACL'26 Workshop] KoViDoRe: Korean Visual Document Retrieval Benchmark
☆24Jul 2, 2026Updated 2 weeks ago
THU-KEG / R-Eval
View on GitHub
[KDD24-ADS] R-Eval: A Unified Toolkit for Evaluating Domain Knowledge of Retrieval Augmented Large Language Models
☆11Apr 9, 2024Updated 2 years ago
trailerAI / KoTAN
View on GitHub
KoTAN: Korean Translation and Augmentation with fine-tuned NLLB
☆23Jan 4, 2024Updated 2 years ago
kyegomez / COT-SC
View on GitHub
Plug in and Play Prompt Technique to Boost Model reasoning by 40%
☆12May 30, 2023Updated 3 years ago
Hugging-Face-KREW / Ko-AgentBench
View on GitHub
☆66Feb 6, 2026Updated 5 months ago
snuhcc / DICE-Bench
View on GitHub
[ACL 2025] DICE-BENCH: Evaluating the Tool-Use Capabilities of Large Language Models in Multi-Round, Multi-Party Dialogues
☆26Jul 10, 2025Updated last year
human-rights-corpus / HRC
View on GitHub
#인권코퍼스
☆31Oct 6, 2023Updated 2 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
songys / huggingface_KoreanDataset
View on GitHub
huggingface에 있는 한국어 데이터 세트
☆37Oct 10, 2024Updated last year
NomaDamas / hwp-converter-api
View on GitHub
API server for converts hwp files - thanks to hwplib & hwpxlib
☆13Jun 9, 2023Updated 3 years ago
Marker-Inc-Korea / KO-Platypus
View on GitHub
[KO-Platy🥮] Korean-Open-platypus를 활용하여 llama-2-ko를 fine-tuning한 KO-platypus model
☆73Aug 24, 2025Updated 10 months ago
K-intelligence-Midm / Midm-2.0
View on GitHub
Official repository for Mi:dm 2.0, the large language model developed by KT.
☆60Oct 29, 2025Updated 8 months ago
HLTCHKUST / Perplexity-FactChecking
View on GitHub
Towards Few-Shot Fact-Checking via Perplexity
☆13Jun 11, 2021Updated 5 years ago
Marker-Inc-Korea / COT_steering
View on GitHub
This repository aims to develop CoT Steering based on CoT without Prompting. It focuses on enhancing the model’s latent reasoning capabil…
☆116Jun 25, 2025Updated last year
kakao / FunctionChat-Bench
View on GitHub
☆119Feb 25, 2026Updated 4 months ago
aqweteddy / ChatVector
View on GitHub
Official code for the ACL 2024 paper: Chat Vector: A Simple Approach to Equip LLMs with Instruction Following and Model Alignment in New …
☆61May 22, 2024Updated 2 years ago
Astro36 / kokoa
View on GitHub
Unsupervised Learning Korean Kernel Object Analyzer
☆13Feb 27, 2019Updated 7 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
teddylee777 / Kor-IR
View on GitHub
Kor-IR: Korean Information Retrieval Benchmark
☆17Jul 3, 2024Updated 2 years ago
DopeorNope-Lee / Ko-Fine-tuning_DataGen
View on GitHub
☆67Mar 21, 2024Updated 2 years ago
HAE-RAE / HAERAE-VISION
View on GitHub
Evaluation code for HAERAE-Vision benchmark
☆15Apr 29, 2026Updated 2 months ago
lsjsj92 / airflow_tutorial
View on GitHub
python airflow tutorial and example
☆13Mar 23, 2022Updated 4 years ago
SeoroMin / Prompt4LLM-Eval
View on GitHub
☆19Nov 26, 2023Updated 2 years ago
FacerAin / facerain.github.io
View on GitHub
☆10Feb 16, 2025Updated last year
NomaDamas / girlfriend-in-cli
View on GitHub
An AI girlfriend or boyfriend that lives inside your terminal. Real personas, real conversations, your shell as a chat room.
☆58May 7, 2026Updated 2 months ago