Kor-IR: Korean Information Retrieval Benchmark
☆87Jul 3, 2024Updated last year
Alternatives and similar repositories for Kor-IR
Users that are interested in Kor-IR are comparing it to the libraries listed below
Sorting:
- ☆36Oct 4, 2023Updated 2 years ago
- KURE: 고려대학교에서 개발한, 한국어 검색에 특화된 임베딩 모델☆205Feb 26, 2026Updated last week
- 한국어 언어모델 다분야 사고력 벤치마크☆201Oct 17, 2024Updated last year
- KoCommonGEN v2: A Benchmark for Navigating Korean Commonsense Reasoning Challenges in Large Language Models☆25Aug 24, 2024Updated last year
- ☆19Sep 3, 2024Updated last year
- ☆20Jul 24, 2024Updated last year
- Official repository for KoMT-Bench built by LG AI Research☆71Aug 8, 2024Updated last year
- 언어모델을 학습하기 위한 공개 한국어 instruction dataset들을 모아두었습니다.☆453Apr 13, 2025Updated 10 months ago
- AutoRAG example about benchmarking Korean embeddings.☆43Oct 2, 2024Updated last year
- huggingface에 있는 한국어 데이터 세트☆36Oct 10, 2024Updated last year
- Make running benchmark simple yet maintainable, again. Now only supports Korean-based cross-encoder.☆29Dec 2, 2025Updated 3 months ago
- ☆10Oct 28, 2024Updated last year
- 문장단위로 분절된 한국어 위키피디아 코퍼스. Releases에서 다운로드 받거나 tfds-korean으로 사용해주세요.☆24Sep 6, 2023Updated 2 years ago
- baikal.ai's pre-trained BERT models: descriptions and sample codes☆12Jun 24, 2021Updated 4 years ago
- KoLLaVA: Korean Large Language-and-Vision Assistant (feat.LLaVA)☆297Sep 20, 2024Updated last year
- Forked repo from https://github.com/EleutherAI/lm-evaluation-harness/commit/1f66adc☆82Feb 28, 2024Updated 2 years ago
- LLM 모델의 외국어 토큰 생성을 막는 코드 구현☆83Aug 7, 2025Updated 6 months ago
- Performs benchmarking on two Korean datasets with minimal time and effort.☆46Jan 22, 2026Updated last month
- 언어모델을 학습하기 위한 공개 한국어 instruction dataset들을 모아두었습니다.☆19Jul 16, 2023Updated 2 years ago
- Korean Sentence Embedding Repository☆210Dec 1, 2024Updated last year
- LangChain 공식 Document, Cookbook, 그 밖의 실용 예제를 바탕으로 작성한 한국어 튜토리얼입니다. 본 튜토리얼을 통해 LangChain을 더 쉽고 효과적으로 사용하는 방법을 배울 수 있습니다.☆1,986Aug 18, 2025Updated 6 months ago
- LangChain 을 더 쉽게 구현하기 위한 유틸 함수, 클래스를 만들어서 패키지로 배포하였습니다.☆123Aug 16, 2025Updated 6 months ago
- Korean Nested Named Entity Corpus☆20May 13, 2023Updated 2 years ago
- KSS: Korean String processing Suite☆469Nov 13, 2025Updated 3 months ago
- ☆102Apr 11, 2025Updated 10 months ago
- Korean-MTEB☆74Jan 25, 2026Updated last month
- ChatGPT의 RLHF를 학습을 위한 3가지 step별 한국어 데이터셋☆40Nov 21, 2023Updated 2 years ago
- BERT score for text generation☆12Jan 15, 2025Updated last year
- 한국어 상호참조해결 (개체 후보 대상)☆10Aug 12, 2020Updated 5 years ago
- 🎹 Instruct.KR 2025 Summer Meetup: 오픈소스 LLM, vLLM으로 Production까지 🎹☆23Aug 2, 2025Updated 7 months ago
- 한국어 어휘 의미 분석 모델☆22Apr 4, 2022Updated 3 years ago
- Korean text data preprocess toolkit for NLP☆18Jun 11, 2019Updated 6 years ago
- bpe based korean t5 model for text-to-text unified framework☆63Apr 17, 2024Updated last year
- Official datasets and pytorch implementation repository of SQuARe and KoSBi (ACL 2023)☆249Jun 29, 2023Updated 2 years ago
- ☆19Sep 20, 2022Updated 3 years ago
- 나무위키덤프에서 정제된 텍스트를 얻기 위한 NamuwikiExtractor☆19Feb 27, 2022Updated 4 years ago
- 42dot LLM consists of a pre-trained language model, 42dot LLM-PLM, and a fine-tuned model, 42dot LLM-SFT, which is trained to respond to …☆131Mar 7, 2024Updated last year
- 자체 구축한 한국어 평가 데이터셋을 이용한 한국어 모델 평가☆31May 31, 2024Updated last year
- [Findings of NAACL2022] A Dog Is Passing Over The Jet? A Text-Generation Dataset for Korean Commonsense Reasoning and Evaluation☆28Dec 9, 2022Updated 3 years ago