mocobeta / lucene-postings-formatView external linksLinks
At-a-glance overview diagrams of Apache Lucene's default PostingsFormat (inverted index binary format).
☆82Mar 25, 2023Updated 2 years ago
Alternatives and similar repositories for lucene-postings-format
Users that are interested in lucene-postings-format are comparing it to the libraries listed below
Sorting:
- Yada is a yet another double-array trie library aiming for fast search and compact data representation.☆44Feb 25, 2024Updated last year
- Utility scripts for running Lucene performance tests, and 15 years of nightly benchmarks☆222Updated this week
- ☆12Mar 6, 2020Updated 5 years ago
- kuro2sudachi lets you to convert kuromoji user dict to sudachi user dict.☆11Apr 26, 2025Updated 9 months ago
- Japanese tokenizer for rust☆38Nov 5, 2019Updated 6 years ago
- WIP: Full text search engine library written in Go with 1.18+ Generics, heavily inspired by Tantivy☆14Apr 5, 2023Updated 2 years ago
- ☆12May 12, 2021Updated 4 years ago
- 情報検索100本ノック☆93Dec 3, 2025Updated 2 months ago
- A Japanese Morphological Analyzer written in pure Rust☆26Oct 25, 2019Updated 6 years ago
- Testing tool to verify the search qualities of the Elasticsearch indices☆29Jan 8, 2023Updated 3 years ago
- go-active-learning is a command line annotation tool for binary classification problem written in Go.☆15Apr 3, 2021Updated 4 years ago
- 「仕事ではじめる検索システム」という本があったなら,という想像の産物です -> 「 検索システム ― 実務者のための開発改善ガイドブック」になりました☆142May 16, 2022Updated 3 years ago
- Advanced desktop search/corpus exploration prototype☆21Jun 23, 2021Updated 4 years ago
- High performance fulltext search engine☆12Oct 7, 2024Updated last year
- A DropWizard wrapper around Apache Tika.☆10Dec 22, 2016Updated 9 years ago
- ☆12Jan 12, 2026Updated last month
- ☆10Oct 2, 2021Updated 4 years ago
- 🧭 From Confluence to clean Markdown, images and all — just one command☆20Feb 2, 2026Updated 2 weeks ago
- SIMD algorithms for integer compression via bitpacking. This crate is a port of a C library called simdcomp.☆327Feb 9, 2026Updated last week
- 🦞 Rust library of natural language dictionaries using character-wise double-array tries.☆36Jan 13, 2025Updated last year
- ☆13Oct 5, 2020Updated 5 years ago
- Sparse Embedding Compression for Scalable Retrieval in Recommender Systems☆33Nov 21, 2025Updated 2 months ago
- PISA: Performant Indexes and Search for Academia☆1,046Feb 5, 2026Updated last week
- ☆15Apr 14, 2023Updated 2 years ago
- Yosina is a transliteration library deals with the letters and symbols used in Japanese writing.☆20Sep 24, 2025Updated 4 months ago
- The Solr Package Directory and Sanctuary☆13Oct 14, 2025Updated 4 months ago
- bqiam is an admin tool for managing BigQuery permissions☆12Dec 25, 2025Updated last month
- 🚍 Find public transport departures in the VVO/DVB network from Alfred☆15Nov 19, 2017Updated 8 years ago
- Search engine benchmark (Tantivy, Lucene, PISA, ...)☆103Jan 30, 2026Updated 2 weeks ago
- A pushdown automaton low memory JSON bytes stream checker☆13Dec 24, 2021Updated 4 years ago
- A Rust crate for a Bucket Queue data structure that can be used as a Priority Queue.☆19Nov 20, 2018Updated 7 years ago
- SWIM Protocol in Java☆10Apr 1, 2020Updated 5 years ago
- engula-operator creates/configures/manages engula clusters atop Kubernetes☆12Jan 5, 2022Updated 4 years ago
- ☆30Sep 16, 2025Updated 5 months ago
- Japanese synonym library☆55Feb 7, 2022Updated 4 years ago
- A high performance gRPC server on top of Apache Lucene☆304Feb 5, 2026Updated last week
- Finding all pairs of similar documents time- and memory-efficiently☆62Mar 13, 2025Updated 11 months ago
- eskeeper synchronizes index and alias with configuration files while ensuring idempotency.☆37Aug 23, 2022Updated 3 years ago
- A tool for visualizing the internal structures of morphological analyzer Sudachi☆18Jun 9, 2022Updated 3 years ago