๐ค vibrato: Viterbi-based accelerated tokenizer
โ404Feb 7, 2026Updated last month
Alternatives and similar repositories for vibrato
Users that are interested in vibrato are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ๐ฅ Vaporetto: Very accelerated pointwise prediction based tokenizerโ254Feb 7, 2026Updated last month
- Viterbi-based accelerated tokenizer (Python wrapper)โ43Sep 4, 2024Updated last year
- A multilingual morphological analysis library.โ610Updated this week
- Sudachi in Rust ๐ฆ and new generation of SudachiPyโ434Mar 19, 2026Updated last week
- Rust implementation of SIF and uSIF: Simple and fast sentence embeddingโ19Jan 22, 2025Updated last year
- Managed Database hosting by DigitalOcean โข AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Japanese Morphological Analyzer written in Rustโ109Feb 25, 2026Updated last month
- A tool for visualizing the internal structures of morphological analyzer Sudachiโ18Jun 9, 2022Updated 3 years ago
- Japanese text preprocessor for Text-to-Speech applications (OpenJTalk rewrite in rust language)โ53Updated this week
- Japanese Morphological Analysis written in Rustโ83Dec 30, 2021Updated 4 years ago
- ๐ A fast implementation of the Aho-Corasick algorithm using the compact double-array data structure in Rust.โ245Jan 26, 2026Updated 2 months ago
- Sentence boundary disambiguation tool for Japanese texts (ๆฅๆฌ่ชๆๅข็ๅคๅฎๅจ)โ199Mar 26, 2024Updated 2 years ago
- Yada is a yet another double-array trie library aiming for fast search and compact data representation.โ47Feb 25, 2024Updated 2 years ago
- โ1,595Mar 19, 2026Updated last week
- A lexicon for Sudachiโ283Jan 20, 2026Updated 2 months ago
- GPU virtual machines on DigitalOcean Gradient AI โข AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Python linting made easy. Also a casual yet honorific way to address individuals who have entered an organization prior to you.โ493Dec 22, 2025Updated 3 months ago
- ๐ฆ Rust library of natural language dictionaries using character-wise double-array tries.โ37Jan 13, 2025Updated last year
- ๐ฅ Vaporetto is a fast and lightweight pointwise prediction based tokenizer. This is a Python wrapper for Vaporetto.โ21Jun 1, 2025Updated 9 months ago
- Python binding for Jagger(C++ implementation of Pattern-based Japanese Morphological Analyzer)โ13Dec 16, 2025Updated 3 months ago
- Finding all pairs of similar documents time- and memory-efficientlyโ62Mar 13, 2025Updated last year
- Self-contained Japanese Morphological Analyzer written in pure Goโ956Mar 17, 2026Updated last week
- A Japanese Tokenizer for Businessโ951Jun 17, 2025Updated 9 months ago
- Japanese synonym libraryโ55Feb 7, 2022Updated 4 years ago
- ๐ฟ An easy-to-use Japanese Text Processing tool, which makes it possible to switch tokenizers with small changes of code.โ261Mar 1, 2026Updated 3 weeks ago
- Proton VPN Special Offer - Get 70% off โข AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ใใผใทใ2ใกใใใญใใใฏใญใผใซใใฆไฝๆใใๅฏพ่ฉฑใณใผใในโ99Jun 6, 2021Updated 4 years ago
- An integrated Japanese analyzer based on foundation modelsโ139Mar 2, 2026Updated 3 weeks ago
- japanese sentence segmentation library for pythonโ74Apr 3, 2023Updated 2 years ago
- Wikipediaใ็จใใๆฅๆฌ่ชใฎๅบๆ่กจ็พๆฝๅบใใผใฟใปใใโ142Sep 2, 2023Updated 2 years ago
- A Cython MeCab wrapper for fast, pythonic Japanese tokenization and morphological analysis.โ516Oct 24, 2025Updated 5 months ago
- Fast match expression optimized for string comparisonโ41Jan 29, 2024Updated 2 years ago
- AJIMEE-Bench (Advanced Japanese IME Evaluation Benchmark)โ20Jan 13, 2025Updated last year
- A Japanese NLP Library using spaCy as framework based on Universal Dependenciesโ838Mar 30, 2024Updated 2 years ago
- Pytorch implementation and pre-trained Japanese model for CANINE, the efficient character-level transformer.โ89Nov 3, 2023Updated 2 years ago
- NordVPN Special Discount Offer โข AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- Lindera tokenizer for Tantivy.โ68Jan 11, 2026Updated 2 months ago
- pyopenjtalk-plus: A Python wrapper for OpenJTalk with additional improvementsโ56Mar 22, 2026Updated last week
- Yet another Japanese IME for IBus/Linuxโ247Mar 22, 2026Updated last week
- Japanese tokenizer for Transformersโ79Dec 15, 2023Updated 2 years ago
- Evidence-based Explanation Dataset (AACL-IJCNLP 2020)โ18Dec 17, 2020Updated 5 years ago
- Neologism dictionary based on the language resources on the Web for mecab-ipadicโ2,783Dec 27, 2023Updated 2 years ago
- โ100Jul 23, 2023Updated 2 years ago