Performs benchmarking on two Korean datasets with minimal time and effort.
☆45Jan 22, 2026Updated 4 months ago
Alternatives and similar repositories for evaluate-llm-on-korean-dataset
Users that are interested in evaluate-llm-on-korean-dataset are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This hands-on walks you through fine-tuning an open source LLM on Azure and serving the fine-tuned model on Azure. It is intended for Dat…☆12Jun 23, 2024Updated last year
- AutoRAG example about benchmarking Korean embeddings.☆45Oct 2, 2024Updated last year
- This lab is a 1-day/2-day end-to-end SLM workshop led and developed by AI GBB. Attendees will learn how to quickly and easily perform the…☆46Jan 22, 2026Updated 4 months ago
- KoCommonGEN v2: A Benchmark for Navigating Korean Commonsense Reasoning Challenges in Large Language Models☆25Aug 24, 2024Updated last year
- Adversarial Test Dataset for Korean Multi-turn Response Selection☆34Dec 16, 2021Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- The most modern LLM evaluation toolkit☆69Apr 30, 2026Updated last month
- This hands-on lab aims to alleviate some of that headache by demonstrating how to create/augment a QnA dataset from complex unstructured …☆63Apr 21, 2026Updated last month
- This is a hands-on for ML beginners to perform SimCSE step-by-step. Implemented both supervised SimCSE and unsupervisied SimCSE, and dist…☆22Oct 6, 2023Updated 2 years ago
- Open Source + Multilingual MLLM + Fine-tuning + Distillation + More efficient models and learning + ?☆18Jan 31, 2025Updated last year
- ☆105Apr 11, 2025Updated last year
- It shows how to use strands agent.☆28Apr 23, 2026Updated last month
- ☆33Aug 30, 2023Updated 2 years ago
- This lab is a starter for quickly and easily applying SLM/LLM fine-tuning, evaluation, and quantization with torchtune on Azure ML.☆15Apr 21, 2026Updated last month
- Agent Innovator Lab – building AI agents on Azure, covering search optimization, agent design, evaluation, and RAG best practices.☆55Feb 20, 2026Updated 3 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- This hands-on walks you through fine-tuning an open source LLM on Azure and serving the fine-tuned model on Azure. It is intended for Dat…☆61Mar 17, 2025Updated last year
- A powerful PowerPoint translation tool that leverages Amazon Bedrock models for high-quality translation. This service can be used both a…☆64May 1, 2026Updated 3 weeks ago
- A framework for benchmarking embedding models in hybrid search scenarios (BM25 + vector search) using Weaviate.☆40May 20, 2026Updated last week
- Evaluate gpt-4o on CLIcK (Korean NLP Dataset)☆20May 18, 2024Updated 2 years ago
- A collection of Korean NLP hands-on labs on Amazon SageMaker☆19Dec 20, 2023Updated 2 years ago
- KURE: 고려대학교에서 개발한, 한국어 검색에 특화된 임베딩 모델☆219Apr 14, 2026Updated last month
- 한국어 언어모델 다분야 사고력 벤치마크☆209Oct 17, 2024Updated last year
- Korean Nested Named Entity Corpus☆20May 13, 2023Updated 3 years ago
- The training codes of Jasper-Token-Compression-600M☆19Nov 19, 2025Updated 6 months ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Kor-IR: Korean Information Retrieval Benchmark☆87Jul 3, 2024Updated last year
- Korean Math Word Problems☆59Jan 14, 2022Updated 4 years ago
- ☆64Jul 21, 2025Updated 10 months ago
- Korean Visual Question Answering☆59Feb 18, 2020Updated 6 years ago
- It shows how to use model-context-protocol.☆40May 12, 2026Updated 2 weeks ago
- This repo is for Korean wiki table question answering datasets described in the paper of Korean-Specific Dataset for Table Question Answe…☆91Oct 22, 2024Updated last year
- Official repository for KoMT-Bench built by LG AI Research☆73Aug 8, 2024Updated last year
- [KO-Platy🥮] Korean-Open-platypus를 활용하여 llama-2-ko를 fine-tuning한 KO-platypus model☆73Aug 24, 2025Updated 9 months ago
- A hackable, simple, and reseach-friendly GRPO Training Framework with high speed weight synchronization in a multinode environment.☆37Aug 27, 2025Updated 9 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- 카카오뱅크 & 에프 엔가이드에서 학습한 금융 도메인 특화 언어모델☆122Jan 16, 2024Updated 2 years ago
- Official datasets and pytorch implementation repository of SQuARe and KoSBi (ACL 2023)☆250Jun 29, 2023Updated 2 years ago
- Korean Multi-task Instruction Tuning☆156Dec 20, 2023Updated 2 years ago
- 한국어 LLM 리더보드 및 모델 성능/안전성 관리☆22Sep 26, 2023Updated 2 years ago
- Liner LLM Meetup archive☆70Mar 27, 2024Updated 2 years ago
- bb25 is a fast, self-contained BM25 + Bayesian calibration implementation with a minimal Python API.☆147Mar 17, 2026Updated 2 months ago
- 한국어 심리 상담 데이터셋☆80Jun 20, 2023Updated 2 years ago