nouu-me/document_vector_search_benchmark

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/nouu-me/document_vector_search_benchmark)

nouu-me / document_vector_search_benchmark

Benchmark for Japanese document embedding & vector search

☆29

Alternatives and similar repositories for document_vector_search_benchmark

Users that are interested in document_vector_search_benchmark are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

hotchpotch / JQaRA
View on GitHub
JQaRA: Japanese Question Answering with Retrieval Augmentation - 検索拡張(RAG)評価のための日本語Q&Aデータセット
☆44Sep 9, 2025Updated 10 months ago
hppRC / bert-classification-tutorial-2024
View on GitHub
【2024年版】BERTによるテキスト分類
☆30Jul 8, 2024Updated 2 years ago
AnswerDotAI / msglm
View on GitHub
msglm makes it a little easier to create messages for language models like Claude and OpenAI GPTs.
☆15Apr 6, 2026Updated 3 months ago
nlp-waseda / JMMLU
View on GitHub
日本語マルチタスク言語理解ベンチマーク Japanese Massive Multitask Language Understanding Benchmark
☆40Oct 7, 2025Updated 9 months ago
sbintuitions / JMTEB
View on GitHub
The evaluation scripts of JMTEB (Japanese Massive Text Embedding Benchmark)
☆93Mar 16, 2026Updated 4 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
DaisukeBekki / JSeM
View on GitHub
Japanese semantic test suite (FraCaS counterpart and extensions)
☆13Apr 21, 2026Updated 3 months ago
ku-nlp / ja-vicuna-qa-benchmark
View on GitHub
☆33Jul 31, 2024Updated last year
Aratako / Task-Vector-Merge-Optimzier
View on GitHub
☆16Apr 11, 2024Updated 2 years ago
osekilab / JCoLA
View on GitHub
☆19Apr 21, 2026Updated 3 months ago
webbigdata-jp / JTransBench
View on GitHub
A tool to easily benchmark Japanese translation skills
☆13Oct 11, 2025Updated 9 months ago
shyaginuma / cibook-study-python
View on GitHub
効果検証入門のコードをPythonで実装しました。
☆19May 9, 2020Updated 6 years ago
oshizo / JapaneseEmbeddingEval
View on GitHub
☆183Oct 9, 2024Updated last year
hppRC / simple-simcse-ja
View on GitHub
Exploring Japanese SimCSE
☆69Oct 31, 2023Updated 2 years ago
hppRC / llm-translator
View on GitHub
Mixtral-based Ja-En (En-Ja) Translation model
☆20Jan 6, 2025Updated last year
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
hppRC / bert-classification-tutorial
View on GitHub
【2023年版】BERTによるテキスト分類
☆234May 28, 2024Updated 2 years ago
pappitti / modernbert-mlx
View on GitHub
Implementation of ModernBERT in MLX
☆21Jan 7, 2026Updated 6 months ago
llm-jp / llm-jp-modernbert
View on GitHub
This repository contains the training and evaluation code for llm-jp-modernbert-base.
☆17Jun 17, 2025Updated last year
izuna385 / Wikia-and-Wikipedia-EL-Dataset-Creator
View on GitHub
You can create datasets from Wikia/Wikipedia that can be used for entity recognition and Entity Linking. Dumps for ja-wiki and VTuber-wik…
☆18May 2, 2021Updated 5 years ago
yuzu-ai / japanese-llm-ranking
View on GitHub
☆50Apr 10, 2024Updated 2 years ago
tkengo / tf
View on GitHub
TensorFlow samples
☆15Feb 25, 2016Updated 10 years ago
retarfi / language-pretraining
View on GitHub
Pre-training Language Models for Japanese
☆50Jul 2, 2023Updated 3 years ago
megagonlabs / instruction_ja
View on GitHub
Japanese instruction data (日本語指示データ)
☆24Jul 13, 2023Updated 3 years ago
kunishou / do-not-answer-ja
View on GitHub
☆24Dec 15, 2023Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
airreader / datastudio-2-slack
View on GitHub
☆17Mar 8, 2021Updated 5 years ago
tsafavi / cascader
View on GitHub
CascadER: Cross-Modal Cascading for Knowledge Graph Link Prediction (arXiv 22)
☆13Jun 17, 2022Updated 4 years ago
pfnet-research / pfgen-bench
View on GitHub
Preferred Generation Benchmark
☆102Mar 6, 2026Updated 4 months ago
aiwolf / AIWolfPy
View on GitHub
☆50Jan 9, 2022Updated 4 years ago
shisa-ai / shaberi
View on GitHub
Lightblue LLM Eval Framework: tengu, elyza100, ja-mtbench, rakuda
☆19Apr 29, 2026Updated 2 months ago
Aratako / Japanese-RP-Bench
View on GitHub
☆19Sep 29, 2024Updated last year
MilanistaDev / SwipePageChanger
View on GitHub
This app is a sample app that links the tab displayed in the Navigation Bar and the paging of the content. The Tab part is scrollable, an…
☆12Apr 1, 2023Updated 3 years ago
maxdotio / neural-solr
View on GitHub
Neural Solr = Solr 9 + Mighty Inference + Node
☆18Jun 9, 2022Updated 4 years ago
ncodepro / pdfchatbot
View on GitHub
Create a QnA bot on a pdf
☆16May 27, 2023Updated 3 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
stuartemiddleton / glosat_table_dataset
View on GitHub
GloSAT Historical Measurement Table Dataset
☆11Dec 3, 2025Updated 7 months ago
informagi / mmead
View on GitHub
MS Marco Entity Annotations Disambiguation
☆14May 19, 2023Updated 3 years ago
hsivonen / recode_rs
View on GitHub
Test/sample app for encoding_rs in Rust
☆12Jan 21, 2022Updated 4 years ago
shisa-ai / shisa-v2
View on GitHub
Japanese / English Bilingual LLM
☆34Dec 23, 2025Updated 7 months ago
COMBINE-lab / piscem-infer
View on GitHub
☆15May 22, 2026Updated 2 months ago
megagonlabs / UD_Japanese-GSD
View on GitHub
Japanese data from the Google UDT 2.0.
☆28Mar 24, 2023Updated 3 years ago
CyberAgentAILab / model-acceleration-tutorial
View on GitHub
CyberAgent AI Lab研修： "モデルコードの高速化・最適化チュートリアル"
☆35Mar 13, 2025Updated last year