Static memory-efficient Trie-like structures for Python based on marisa-trie C++ library.
☆1,124Apr 8, 2026Updated last month
Alternatives and similar repositories for marisa-trie
Users that are interested in marisa-trie are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- DAFSA-based dictionary-like read-only objects for Python. Based on `dawgdic` C++ library.☆306Jun 11, 2024Updated last year
- Fast, efficiently stored Trie for Python. Uses libdatrie.☆547Jan 6, 2026Updated 4 months ago
- MARISA: Matching Algorithm with Recursively Implemented StorAge☆611Mar 17, 2026Updated last month
- Pure-python reader for DAWGs created by dawgdic C++ library or DAWG Python extension.☆50Sep 11, 2023Updated 2 years ago
- Python library implementing a trie data structure.☆820Apr 10, 2021Updated 5 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- HAT-Trie for Python☆87Feb 8, 2016Updated 10 years ago
- Python module (C extension and plain python) implementing Aho-Corasick algorithm☆1,100Apr 27, 2026Updated last week
- Bidirectional LSTM-CRF and ELMo for Named-Entity Recognition, Part-of-Speech Tagging and so on.☆1,486Dec 7, 2022Updated 3 years ago
- Approximate Nearest Neighbors in C++/Python optimized for memory usage and loading/saving to disk☆14,232Oct 29, 2025Updated 6 months ago
- Library for fast text representation and classification.☆26,521Mar 22, 2024Updated 2 years ago
- A clone of Darts (Double-ARray Trie System)☆161May 14, 2025Updated 11 months ago
- Non-Metric Space Library (NMSLIB): An efficient similarity search library and a toolkit for evaluation of k-NN methods for generic non-me…☆3,582Apr 13, 2026Updated 3 weeks ago
- Python package for lexicon; Trie and DAWG implementation.☆56Updated this week
- Extract Keywords from sentence or Replace keywords in sentences.☆5,711Apr 13, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- KenLM: Faster and Smaller Language Model Queries☆2,770Mar 30, 2025Updated last year
- Python extension module for accelerating regular expressions using libesm☆132Oct 4, 2023Updated 2 years ago
- Learning embeddings for classification, retrieval and ranking.☆3,955Dec 4, 2022Updated 3 years ago
- MinHash, LSH, LSH Forest, Weighted MinHash, HyperLogLog, HyperLogLog++, LSH Ensemble and HNSW☆2,917Apr 18, 2026Updated 3 weeks ago
- Unsupervised text tokenizer for Neural Network-based text generation.☆11,803Updated this week
- Fast multi-keyword search engine for text strings☆258Sep 14, 2024Updated last year
- Python search module for fast approximate string matching☆54Jan 25, 2023Updated 3 years ago
- Quality information extraction at web scale.☆466Dec 27, 2018Updated 7 years ago
- Scalable, fast, and lightweight system for large-scale topic modeling☆844Dec 28, 2020Updated 5 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- 💫 Industrial-strength Natural Language Processing (NLP) in Python☆33,544Mar 28, 2026Updated last month
- A tool for extracting plain text from Wikipedia dumps☆3,985May 23, 2024Updated last year
- Topic Modelling for Humans☆16,403Nov 1, 2025Updated 6 months ago
- Fuzzy String Matching in Python☆9,259Feb 24, 2023Updated 3 years ago
- Ultra fast asyncio event loop.☆11,779May 2, 2026Updated last week
- An efficient trie implementation.☆254Nov 25, 2020Updated 5 years ago
- Code for the paper "DeepType: Multilingual Entity Linking by Neural Type System Evolution"☆656Apr 2, 2023Updated 3 years ago
- Cython implementation of Toolz: High performance functional utilities☆1,107Dec 1, 2025Updated 5 months ago
- Facilitating the design, comparison and sharing of deep text matching models.☆3,849Aug 2, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- GloVe Word Embedding model's implementation in theano☆36May 18, 2017Updated 8 years ago
- A library for efficient similarity search and clustering of dense vectors.☆39,918May 2, 2026Updated last week
- LSTM language model with CNN over characters☆836Aug 24, 2016Updated 9 years ago
- Sent2Vec encoder and training code from the paper "Skip-Thought Vectors"☆2,051Jun 9, 2020Updated 5 years ago
- C++ implementation of a fast and memory efficient HAT-trie☆865Nov 11, 2025Updated 5 months ago
- A Toolkit for Industrial Topic Modeling☆2,646Jul 1, 2021Updated 4 years ago
- 🦆 Contextually-keyed word vectors☆1,673Mar 27, 2026Updated last month