Static memory-efficient Trie-like structures for Python based on marisa-trie C++ library.
☆1,124Apr 8, 2026Updated last week
Alternatives and similar repositories for marisa-trie
Users that are interested in marisa-trie are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- DAFSA-based dictionary-like read-only objects for Python. Based on `dawgdic` C++ library.☆306Jun 11, 2024Updated last year
- Fast, efficiently stored Trie for Python. Uses libdatrie.☆546Jan 6, 2026Updated 3 months ago
- MARISA: Matching Algorithm with Recursively Implemented StorAge☆608Mar 17, 2026Updated last month
- Pure-python reader for DAWGs created by dawgdic C++ library or DAWG Python extension.☆50Sep 11, 2023Updated 2 years ago
- Python library implementing a trie data structure.☆821Apr 10, 2021Updated 5 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- HAT-Trie for Python☆87Feb 8, 2016Updated 10 years ago
- Python module (C extension and plain python) implementing Aho-Corasick algorithm☆1,097Dec 17, 2025Updated 4 months ago
- Python binding of cedar (implementation of efficiently-updatable double-array trie) using Cython☆17Mar 1, 2020Updated 6 years ago
- Bidirectional LSTM-CRF and ELMo for Named-Entity Recognition, Part-of-Speech Tagging and so on.☆1,485Dec 7, 2022Updated 3 years ago
- Approximate Nearest Neighbors in C++/Python optimized for memory usage and loading/saving to disk☆14,221Oct 29, 2025Updated 5 months ago
- Library for fast text representation and classification.☆26,510Mar 22, 2024Updated 2 years ago
- A clone of Darts (Double-ARray Trie System)☆161May 14, 2025Updated 11 months ago
- Non-Metric Space Library (NMSLIB): An efficient similarity search library and a toolkit for evaluation of k-NN methods for generic non-me…☆3,578Updated this week
- Python package for lexicon; Trie and DAWG implementation.☆56Feb 23, 2026Updated last month
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Extract Keywords from sentence or Replace keywords in sentences.☆5,709Apr 13, 2025Updated last year
- KenLM: Faster and Smaller Language Model Queries☆2,762Mar 30, 2025Updated last year
- Python extension module for accelerating regular expressions using libesm☆132Oct 4, 2023Updated 2 years ago
- Learning embeddings for classification, retrieval and ranking.☆3,957Dec 4, 2022Updated 3 years ago
- MinHash, LSH, LSH Forest, Weighted MinHash, HyperLogLog, HyperLogLog++, LSH Ensemble and HNSW☆2,904Updated this week
- Unsupervised text tokenizer for Neural Network-based text generation.☆11,765Updated this week
- Fast multi-keyword search engine for text strings☆258Sep 14, 2024Updated last year
- Python search module for fast approximate string matching☆54Jan 25, 2023Updated 3 years ago
- Quality information extraction at web scale.☆465Dec 27, 2018Updated 7 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Scalable, fast, and lightweight system for large-scale topic modeling☆844Dec 28, 2020Updated 5 years ago
- 💫 Industrial-strength Natural Language Processing (NLP) in Python☆33,473Mar 28, 2026Updated 3 weeks ago
- A tool for extracting plain text from Wikipedia dumps☆3,976May 23, 2024Updated last year
- Topic Modelling for Humans☆16,389Nov 1, 2025Updated 5 months ago
- Fuzzy String Matching in Python☆9,259Feb 24, 2023Updated 3 years ago
- Ultra fast asyncio event loop.☆11,749Jan 30, 2026Updated 2 months ago
- An efficient trie implementation.☆255Nov 25, 2020Updated 5 years ago
- Code for the paper "DeepType: Multilingual Entity Linking by Neural Type System Evolution"☆656Apr 2, 2023Updated 3 years ago
- Cython implementation of Toolz: High performance functional utilities☆1,107Dec 1, 2025Updated 4 months ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Facilitating the design, comparison and sharing of deep text matching models.☆3,850Aug 2, 2024Updated last year
- GloVe Word Embedding model's implementation in theano☆36May 18, 2017Updated 8 years ago
- A library for efficient similarity search and clustering of dense vectors.☆39,720Updated this week
- LSTM language model with CNN over characters☆836Aug 24, 2016Updated 9 years ago
- Sent2Vec encoder and training code from the paper "Skip-Thought Vectors"☆2,051Jun 9, 2020Updated 5 years ago
- C++ implementation of a fast and memory efficient HAT-trie☆864Nov 11, 2025Updated 5 months ago
- A Toolkit for Industrial Topic Modeling☆2,646Jul 1, 2021Updated 4 years ago