A Japanese tokenizer for Tantivy, based on TinySegmenter.
☆14Mar 27, 2021Updated 5 years ago
Alternatives and similar repositories for tantivy-tokenizer-tiny-segmenter
Users that are interested in tantivy-tokenizer-tiny-segmenter are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Compact Japanese tokenizer☆16Aug 17, 2018Updated 7 years ago
- go-active-learning is a command line annotation tool for binary classification problem written in Go.☆15Apr 3, 2021Updated 4 years ago
- At-a-glance overview diagrams of Apache Lucene's default PostingsFormat (inverted index binary format).☆82Mar 25, 2023Updated 3 years ago
- A Japanese Morphological Analyzer written in pure Rust☆26Oct 25, 2019Updated 6 years ago
- Lindera tokenizer for Tantivy.☆68Jan 11, 2026Updated 2 months ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- A git tool to restore the commit logs☆12Mar 1, 2026Updated 3 weeks ago
- ☆11May 12, 2024Updated last year
- Helper macros to write faster, portable and robust init script☆43Apr 14, 2022Updated 3 years ago
- Userland NAT64 implementation on Linux in Ruby☆11Sep 25, 2025Updated 6 months ago
- なさそう☆11Apr 30, 2024Updated last year
- Streaming loader for Amazon Redshift Spectrum☆10May 21, 2025Updated 10 months ago
- Wikipediaから作成した日本語名寄せデータセット☆35Mar 10, 2020Updated 6 years ago
- a (work-in-progress) grammatical WISYWIG text editor☆12Sep 13, 2018Updated 7 years ago
- ☆11Aug 26, 2021Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- 🦀 A Rust implementation of a RoBERTa classification model for the SNLI dataset☆13Sep 13, 2021Updated 4 years ago
- A partial port of elasticlunr to Rust. Intended to be used for generating compatible search indices.☆56Mar 21, 2025Updated last year
- ☆10Sep 14, 2022Updated 3 years ago
- Wavelet Matrix implementation written in Rust☆16Sep 13, 2022Updated 3 years ago
- Moyuk turns TypeScript functions into web apps in a few seconds.☆14Jun 1, 2025Updated 9 months ago
- ☆10Jun 19, 2023Updated 2 years ago
- ☆13Oct 23, 2017Updated 8 years ago
- Composable, strict CLI framework with static analysis for Rust☆18Jun 19, 2022Updated 3 years ago
- A safe Rust wrapper for libsixel☆40Aug 30, 2023Updated 2 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Profiling C code with Linux perf made easy☆22Apr 20, 2020Updated 5 years ago
- A terrible little hack to integrate BibTeX into markdown in a way that is independent of the markdown parser.☆14Apr 1, 2020Updated 5 years ago
- A tool for visualizing the internal structures of morphological analyzer Sudachi☆18Jun 9, 2022Updated 3 years ago
- Yet another ActivityPub server implementation written in OCaml☆49Dec 25, 2025Updated 3 months ago
- The world's fastest online dictionary☆15Jul 28, 2018Updated 7 years ago
- Rocker is a minimal docker implementation for educational purposes.☆19Apr 18, 2021Updated 4 years ago
- ☆11May 1, 2021Updated 4 years ago
- ☆29Updated this week
- WebRTC network connector for Yjs/Yrs update gossips☆13Jan 20, 2024Updated 2 years ago
- NordVPN Threat Protection Pro™ • AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- A monolingual parallel corpus for sentence simplification☆11Jul 4, 2016Updated 9 years ago
- Slack API Client for Rust☆18Apr 28, 2023Updated 2 years ago
- SKK input method plugin for fcitx5 that uses LibCSKK☆53Jan 12, 2026Updated 2 months ago
- DEPRECATED: (previously a thin wrapper around conda for xonsh)☆10Aug 6, 2019Updated 6 years ago
- Lists, Texts, ByteStrings and Vectors with type-encoded length☆10Jul 11, 2021Updated 4 years ago
- ☆73Aug 3, 2025Updated 7 months ago
- Now it is exported as an official example☆13Jan 24, 2018Updated 8 years ago