๐ค vibrato: Viterbi-based accelerated tokenizer
โ406Feb 7, 2026Updated 2 months ago
Alternatives and similar repositories for vibrato
Users that are interested in vibrato are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ๐ฅ Vaporetto: Very accelerated pointwise prediction based tokenizerโ257Feb 7, 2026Updated 2 months ago
- Viterbi-based accelerated tokenizer (Python wrapper)โ43Sep 4, 2024Updated last year
- A multilingual morphological analysis library.โ617Updated this week
- Sudachi in Rust ๐ฆ and new generation of SudachiPyโ438Updated this week
- Rust implementation of SIF and uSIF: Simple and fast sentence embeddingโ19Jan 22, 2025Updated last year
- Deploy open-source AI quickly and easily - Bonus Offer โข AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Japanese Morphological Analyzer written in Rustโ109Feb 25, 2026Updated last month
- A tool for visualizing the internal structures of morphological analyzer Sudachiโ18Jun 9, 2022Updated 3 years ago
- Japanese text preprocessor for Text-to-Speech applications (OpenJTalk rewrite in rust language)โ54Updated this week
- Japanese Morphological Analysis written in Rustโ83Dec 30, 2021Updated 4 years ago
- ๐ A fast implementation of the Aho-Corasick algorithm using the compact double-array data structure in Rust.โ256Updated this week
- Sentence boundary disambiguation tool for Japanese texts (ๆฅๆฌ่ชๆๅข็ๅคๅฎๅจ)โ199Mar 26, 2024Updated 2 years ago
- Yada is a yet another double-array trie library aiming for fast search and compact data representation.โ48Feb 25, 2024Updated 2 years ago
- โ1,603Apr 12, 2026Updated last week
- A lexicon for Sudachiโ289Jan 20, 2026Updated 3 months ago
- 1-Click AI Models by DigitalOcean Gradient โข AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Python linting made easy. Also a casual yet honorific way to address individuals who have entered an organization prior to you.โ494Dec 22, 2025Updated 3 months ago
- ๐ฆ Rust library of natural language dictionaries using character-wise double-array tries.โ37Jan 13, 2025Updated last year
- ๐ฅ Vaporetto is a fast and lightweight pointwise prediction based tokenizer. This is a Python wrapper for Vaporetto.โ21Jun 1, 2025Updated 10 months ago
- Python binding for Jagger(C++ implementation of Pattern-based Japanese Morphological Analyzer)โ13Dec 16, 2025Updated 4 months ago
- Finding all pairs of similar documents time- and memory-efficientlyโ62Mar 13, 2025Updated last year
- Self-contained Japanese Morphological Analyzer written in pure Goโ961Apr 3, 2026Updated 2 weeks ago
- A Japanese Tokenizer for Businessโ956Apr 13, 2026Updated last week
- Japanese synonym libraryโ55Feb 7, 2022Updated 4 years ago
- ๐ฟ An easy-to-use Japanese Text Processing tool, which makes it possible to switch tokenizers with small changes of code.โ261Updated this week
- Managed hosting for WordPress and PHP on Cloudways โข AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ใใผใทใ2ใกใใใญใใใฏใญใผใซใใฆไฝๆใใๅฏพ่ฉฑใณใผใในโ99Jun 6, 2021Updated 4 years ago
- An integrated Japanese analyzer based on foundation modelsโ142Apr 6, 2026Updated last week
- japanese sentence segmentation library for pythonโ74Apr 3, 2023Updated 3 years ago
- Wikipediaใ็จใใๆฅๆฌ่ชใฎๅบๆ่กจ็พๆฝๅบใใผใฟใปใใโ143Sep 2, 2023Updated 2 years ago
- A Cython MeCab wrapper for fast, pythonic Japanese tokenization and morphological analysis.โ518Oct 24, 2025Updated 5 months ago
- Fast match expression optimized for string comparisonโ41Jan 29, 2024Updated 2 years ago
- AJIMEE-Bench (Advanced Japanese IME Evaluation Benchmark)โ20Jan 13, 2025Updated last year
- A Japanese NLP Library using spaCy as framework based on Universal Dependenciesโ844Mar 30, 2024Updated 2 years ago
- Pytorch implementation and pre-trained Japanese model for CANINE, the efficient character-level transformer.โ89Nov 3, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial โข AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Lindera tokenizer for Tantivy.โ70Jan 11, 2026Updated 3 months ago
- pyopenjtalk-plus: A Python wrapper for OpenJTalk with additional improvementsโ57Mar 30, 2026Updated 2 weeks ago
- Yet another Japanese IME for IBus/Linuxโ250Updated this week
- Japanese tokenizer for Transformersโ79Dec 15, 2023Updated 2 years ago
- Neologism dictionary based on the language resources on the Web for mecab-ipadicโ2,788Dec 27, 2023Updated 2 years ago
- Evidence-based Explanation Dataset (AACL-IJCNLP 2020)โ18Dec 17, 2020Updated 5 years ago
- โ100Jul 23, 2023Updated 2 years ago