OnAnd0n / ko-embedding-leaderboardView external linksLinks
Korean-MTEB
☆74Jan 25, 2026Updated 3 weeks ago
Alternatives and similar repositories for ko-embedding-leaderboard
Users that are interested in ko-embedding-leaderboard are comparing it to the libraries listed below
Sorting:
- Make running benchmark simple yet maintainable, again. Now only supports Korean-based cross-encoder.☆29Dec 2, 2025Updated 2 months ago
- The training codes of Jasper-Token-Compression-600M☆19Nov 19, 2025Updated 2 months ago
- ☆14Jul 7, 2024Updated last year
- ☆28Jul 11, 2025Updated 7 months ago
- KURE: 고려대학교에서 개발한, 한국어 검색에 특화된 임베딩 모델☆206Sep 10, 2025Updated 5 months ago
- Kor-IR: Korean Information Retrieval Benchmark☆87Jul 3, 2024Updated last year
- 🎹 Instruct.KR 2025 Summer Meetup: 오픈소스 LLM, vLLM으로 Production까지 🎹☆24Aug 2, 2025Updated 6 months ago
- AutoRAG example about benchmarking Korean embeddings.☆43Oct 2, 2024Updated last year
- Performs benchmarking on two Korean datasets with minimal time and effort.☆45Jan 22, 2026Updated 3 weeks ago
- ☆11Mar 12, 2025Updated 11 months ago
- Generate fixed dimensional embeddings for multi-dimensional vectors in python based on Muvera from Google.☆20Jun 28, 2025Updated 7 months ago
- bb25 is a fast, self-contained BM25 + Bayesian calibration implementation with a minimal Python API.☆35Feb 8, 2026Updated last week
- Evaluate state-of-the-art sparse embedding models on the LIMIT dataset (`limit-small` and `limit`) from google's paper `On the Theoretica…☆15Sep 4, 2025Updated 5 months ago
- A framework for benchmarking embedding models in hybrid search scenarios (BM25 + vector search) using Weaviate.☆38Feb 3, 2026Updated last week
- Code for the paper "Multi-Field Adaptive Retrieval," a research project on a semi-structured document retrieval☆15Updated this week
- An extensible framework to automate your entire newsletter workflow. Handles data collection, LLM-based content analysis, and email gener…☆38Updated this week
- The Python Implementation of CRISP: Clustering Multi-Vector Representations for Denoising and Pruning☆27Jul 27, 2025Updated 6 months ago
- This repository aims to develop CoT Steering based on CoT without Prompting. It focuses on enhancing the model’s latent reasoning capabil…☆115Jun 25, 2025Updated 7 months ago
- MEXMA: Token-level objectives improve sentence representations☆42Jan 6, 2025Updated last year
- ☆34Feb 27, 2024Updated last year
- ☆39Mar 11, 2025Updated 11 months ago
- ☆19May 16, 2024Updated last year
- ☆20Updated this week
- nanoRLHF: from-scratch journey into how LLMs and RLHF really work.☆155Jan 23, 2026Updated 3 weeks ago
- CPython 파헤치기 스터디☆16Jul 13, 2024Updated last year
- It shows a bedrock agent.☆21Jun 3, 2025Updated 8 months ago
- Welcome to the Storm Cookbook! This is your go to guide for Building with STORM Solution.☆39Aug 14, 2025Updated 6 months ago
- It shows how to deploy and use an agent with LLM.☆19Mar 1, 2025Updated 11 months ago
- ☆57Jan 26, 2025Updated last year
- Efficient fine-tuning for ko-llm models☆185Mar 18, 2024Updated last year
- ☆65Feb 6, 2026Updated last week
- 한글 텍스트 임베딩 모델 리더보드☆93Oct 22, 2024Updated last year
- Korean Sentence Embedding Model Performance Benchmark for RAG☆50Jan 27, 2025Updated last year
- NLP 역사부터 서빙까지 한 권의 책에서 다룹니다.☆24Dec 6, 2025Updated 2 months ago
- ☆26Feb 11, 2025Updated last year
- Ko-Arena-Hard-Auto: An automatic LLM benchmark for Korean☆22Apr 23, 2025Updated 9 months ago
- Korean Sentence Embedding Repository☆210Dec 1, 2024Updated last year
- ☆25Jul 24, 2024Updated last year
- KoCommonGEN v2: A Benchmark for Navigating Korean Commonsense Reasoning Challenges in Large Language Models☆25Aug 24, 2024Updated last year