Japanese tokenizer for rust
☆39Nov 5, 2019Updated 6 years ago
Alternatives and similar repositories for kuromoji-rs
Users that are interested in kuromoji-rs are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- CC-CEDICT-MeCab is a MeCab dictionary for Chinese (Mandarin) text segmentation☆13Apr 9, 2020Updated 6 years ago
- Yada is a yet another double-array trie library aiming for fast search and compact data representation.☆48Feb 25, 2024Updated 2 years ago
- A Japanese tokenizer for Tantivy, based on TinySegmenter.☆14Mar 27, 2021Updated 5 years ago
- Japanese text preprocessor for Text-to-Speech applications (OpenJTalk rewrite in rust language)☆54Updated this week
- A multilingual morphological analysis library.☆618Updated this week
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- At-a-glance overview diagrams of Apache Lucene's default PostingsFormat (inverted index binary format).☆82Mar 25, 2023Updated 3 years ago
- This is an example project for grpc-go☆12Jan 5, 2016Updated 10 years ago
- Supporting example for "A Rust SentencePiece implementation"☆20Jun 7, 2020Updated 5 years ago
- ☆32Feb 16, 2026Updated 2 months ago
- sqlite3 fts5 mecab☆23Aug 9, 2019Updated 6 years ago
- Compact Japanese tokenizer☆16Aug 17, 2018Updated 7 years ago
- ☆10Oct 2, 2021Updated 4 years ago
- A blend of the compact and sparse hash table implementations.☆15Aug 20, 2021Updated 4 years ago
- Sample code for natural language processing using Wikipedia☆19Oct 23, 2018Updated 7 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Hybrid Search (BM25 & Vector) with SQLite☆32Aug 13, 2024Updated last year
- Rust library providing fast language model queries in compressed space☆25Oct 1, 2022Updated 3 years ago
- ☆14Aug 20, 2020Updated 5 years ago
- A multi-language segmenter using high-order CRF.☆17Feb 27, 2020Updated 6 years ago
- Telescope extension wrapper around `:scriptnames`☆10Mar 7, 2023Updated 3 years ago
- An in-memory SQL database in Rust.☆14Aug 15, 2021Updated 4 years ago
- ☆12Apr 11, 2026Updated 2 weeks ago
- Fun Self-Management for macOS☆13Updated this week
- Bleve Extensions☆45Mar 23, 2024Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- a (work-in-progress) grammatical WISYWIG text editor☆12Sep 13, 2018Updated 7 years ago
- ☆14Feb 26, 2021Updated 5 years ago
- Board support crate for the STM32F103C8T6 bluepill☆11Jun 9, 2017Updated 8 years ago
- animated png encoder 🦀☆36Feb 14, 2024Updated 2 years ago
- 🛥 Vaporetto: Very accelerated pointwise prediction based tokenizer☆270Feb 7, 2026Updated 2 months ago
- A Japanese law parser☆25Jan 25, 2024Updated 2 years ago
- NativeScript bindings for Solid.JS☆15Jan 2, 2022Updated 4 years ago
- A simple iOS application that demonstrates how the end-to-end encryption works.☆13Feb 7, 2020Updated 6 years ago
- Mach-O Fat Binary Reader and Writer☆25Feb 3, 2026Updated 2 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- A compile time sized array of bits☆12Aug 15, 2021Updated 4 years ago
- Viterbi-based accelerated tokenizer (Python wrapper)☆43Sep 4, 2024Updated last year
- Moyuk turns TypeScript functions into web apps in a few seconds.☆14Jun 1, 2025Updated 10 months ago
- MacBook Pro keyboard written in SwiftUI.☆12Jan 19, 2021Updated 5 years ago
- Simple implementation of a GPT (training and inference) in PyTorch.☆13Dec 11, 2023Updated 2 years ago
- ☆10Jun 19, 2023Updated 2 years ago
- ☆22Apr 9, 2019Updated 7 years ago