hurutoriya / doraemon-himitsu-dogu-searchLinks

Doraemon Himitsu Dogu Japanese hybrid search based on Elascticsearch ANN x multi match

☆9

Alternatives and similar repositories for doraemon-himitsu-dogu-search

Users that are interested in doraemon-himitsu-dogu-search are comparing it to the libraries listed below

Sorting:

ubie-oss / esqa
Testing tool to verify the search qualities of the Elasticsearch indices
☆29Updated 2 years ago
WorksApplications / chikkarpy
Japanese synonym library
☆53Updated 3 years ago
megagonlabs / jrte-corpus
Japanese Realistic Textual Entailment Corpus (NLP 2020, LREC 2020)
☆76Updated 2 years ago
chakki-works / Japanese-Company-Lexicon
☆98Updated 2 years ago
ir100 / ir100
情報検索100本ノック
☆91Updated 2 years ago
WorksApplications / SudachiTra
Japanese tokenizer for Transformers
☆79Updated last year
kajyuuen / daaja
This repository has implementations of data augmentation for NLP for Japanese.
☆64Updated 2 years ago
kajyuuen / funer
Funer is Rule based Named Entity Recognition tool.
☆22Updated 3 years ago
yagays / ja-timex
自然言語で書かれた時間情報表現を抽出/規格化するルールベースの解析器
☆140Updated 5 months ago
sbintuitions / JMTEB
The evaluation scripts of JMTEB (Japanese Massive Text Embedding Benchmark)
☆70Updated 2 weeks ago
rejasupotaro / amazon-product-search
☆22Updated 2 weeks ago
megagonlabs / ginza-transformers
Use custom tokenizers in spacy-transformers
☆16Updated 3 years ago
oreilly-japan / building-search-app-w-ml
『機械学習による検索ランキング改善ガイド』のサンプルコードのリポジトリ
☆21Updated 2 years ago
po3rin / kuro2sudachi
kuro2sudachi lets you to convert kuromoji user dict to sudachi user dict.
☆11Updated 3 months ago
yagays / swem
Python implementation of SWEM (Simple Word-Embedding-based Methods)
☆30Updated 3 years ago
KnowledgeGraphJapan / KGRC-RDF
RDF data for Knowledge Graph Reasoning Challenge.
☆19Updated 5 months ago
daac-tools / find-simdoc
Finding all pairs of similar documents time- and memory-efficiently
☆61Updated 4 months ago
yagays / nayose-wikipedia-ja
Wikipediaから作成した日本語名寄せデータセット
☆35Updated 5 years ago
ujiuji1259 / uke_japanese
☆13Updated 3 years ago
wwwcojp / ja_sentence_segmenter
japanese sentence segmentation library for python
☆71Updated 2 years ago
nobu-g / cohesion-analysis
Code for COLING 2020 Paper
☆13Updated last week
GENZITSU / UsefulMaterials
☆34Updated 5 years ago
himkt / awesome-bert-japanese
📝 A list of pre-trained BERT models for Japanese with word/subword tokenization + vocabulary construction algorithm information
☆131Updated 2 years ago
polm / ipadic-py
IPAdic packaged for easy use from Python.
☆24Updated 3 years ago
m3dev / kannon
Kannon is a wrapper for the gokart library that allows gokart tasks to be easily executed in a distributed and parallel manner on multipl…
☆25Updated 6 months ago
nobu-g / JGLUE-evaluation-scripts
Training and evaluation scripts for JGLUE, a Japanese language understanding benchmark
☆17Updated last week
megagonlabs / UD_Japanese-GSD
Japanese data from the Google UDT 2.0.
☆28Updated 2 years ago
megagonlabs / bunkai
Sentence boundary disambiguation tool for Japanese texts (日本語文境界判定器)
☆193Updated last year
ikuyamada / wikipedia-nlp
Sample code for natural language processing using Wikipedia
☆19Updated 6 years ago
stockmarkteam / ner-wikipedia-dataset
Wikipediaを用いた日本語の固有表現抽出データセット
☆141Updated last year