자체 구축한 한국어 평가 데이터셋을 이용한 한국어 모델 평가
☆31May 31, 2024Updated last year
Alternatives and similar repositories for kollm_evaluation
Users that are interested in kollm_evaluation are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- "자연어처리 알고리즘을 활용한 느린학습자 교육 컨텐츠 제작" 프로젝트 "애움길" 팀입니다. 데이터 수집(크롤링)/EDA/Preprocessing, 쉬운말 생성요약 AI 모델링(NLP - KoBERT, KoBART), 프로토타입 제작을 진행했습니다…☆13Mar 24, 2022Updated 4 years ago
- Hate speech detection corpus in Korean, shared with EMNLP 2023 paper☆17Apr 19, 2024Updated 2 years ago
- BERT score for text generation☆12Jan 15, 2025Updated last year
- An unofficial implementation of SOLAR-10.7B model and the newly proposed interlocked-DUS(iDUS) implementation and experiment details.☆14Mar 20, 2024Updated 2 years ago
- KoCommonGEN v2: A Benchmark for Navigating Korean Commonsense Reasoning Challenges in Large Language Models☆25Aug 24, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- huggingface에 있는 한국어 데이터 세트☆36Oct 10, 2024Updated last year
- 언어모델을 학습하기 위한 공개 한국어 instruction dataset들을 모아두었습니다.☆461Apr 13, 2025Updated last year
- Repository for "Scaling Evaluation-time Compute with Reasoning Models as Process Evaluators"☆12Mar 25, 2025Updated last year
- Efficient fine-tuning for ko-llm models☆183Mar 18, 2024Updated 2 years ago
- ☆13Apr 17, 2024Updated 2 years ago
- [Findings of NAACL2022] A Dog Is Passing Over The Jet? A Text-Generation Dataset for Korean Commonsense Reasoning and Evaluation☆28Dec 9, 2022Updated 3 years ago
- 구글에서 발표한 Chain-of-Thought Reasoning without Prompting을 코드로 구현한 레포입니다.☆65Sep 28, 2024Updated last year
- Make running benchmark simple yet maintainable, again. Now only supports Korean-based cross-encoder.