Static memory-efficient Trie-like structures for Python based on marisa-trie C++ library.
☆1,123Mar 10, 2026Updated 2 weeks ago
Alternatives and similar repositories for marisa-trie
Users that are interested in marisa-trie are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- DAFSA-based dictionary-like read-only objects for Python. Based on `dawgdic` C++ library.☆306Jun 11, 2024Updated last year
- Fast, efficiently stored Trie for Python. Uses libdatrie.☆546Jan 6, 2026Updated 2 months ago
- MARISA: Matching Algorithm with Recursively Implemented StorAge☆600Mar 17, 2026Updated last week
- Pure-python reader for DAWGs created by dawgdic C++ library or DAWG Python extension.☆50Sep 11, 2023Updated 2 years ago
- Python library implementing a trie data structure.☆823Apr 10, 2021Updated 4 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- HAT-Trie for Python☆87Feb 8, 2016Updated 10 years ago
- Python module (C extension and plain python) implementing Aho-Corasick algorithm☆1,093Dec 17, 2025Updated 3 months ago
- Python binding of cedar (implementation of efficiently-updatable double-array trie) using Cython☆17Mar 1, 2020Updated 6 years ago
- Bidirectional LSTM-CRF and ELMo for Named-Entity Recognition, Part-of-Speech Tagging and so on.☆1,484Dec 7, 2022Updated 3 years ago
- Approximate Nearest Neighbors in C++/Python optimized for memory usage and loading/saving to disk☆14,188Oct 29, 2025Updated 5 months ago
- Library for fast text representation and classification.☆26,514Mar 22, 2024Updated 2 years ago
- A clone of Darts (Double-ARray Trie System)☆160May 14, 2025Updated 10 months ago
- Non-Metric Space Library (NMSLIB): An efficient similarity search library and a toolkit for evaluation of k-NN methods for generic non-me…☆3,575Jan 12, 2026Updated 2 months ago
- Python package for lexicon; Trie and DAWG implementation.☆56Feb 23, 2026Updated last month
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Extract Keywords from sentence or Replace keywords in sentences.☆5,707Apr 13, 2025Updated 11 months ago
- KenLM: Faster and Smaller Language Model Queries☆2,749Mar 30, 2025Updated 11 months ago
- Python extension module for accelerating regular expressions using libesm☆132Oct 4, 2023Updated 2 years ago
- Learning embeddings for classification, retrieval and ranking.☆3,957Dec 4, 2022Updated 3 years ago
- MinHash, LSH, LSH Forest, Weighted MinHash, HyperLogLog, HyperLogLog++, LSH Ensemble and HNSW☆2,892Updated this week
- Unsupervised text tokenizer for Neural Network-based text generation.☆11,716Mar 1, 2026Updated 3 weeks ago
- Fast multi-keyword search engine for text strings☆258Sep 14, 2024Updated last year
- Python search module for fast approximate string matching☆54Jan 25, 2023Updated 3 years ago
- Quality information extraction at web scale.☆464Dec 27, 2018Updated 7 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Scalable, fast, and lightweight system for large-scale topic modeling☆846Dec 28, 2020Updated 5 years ago
- 💫 Industrial-strength Natural Language Processing (NLP) in Python☆33,390Updated this week
- A tool for extracting plain text from Wikipedia dumps☆3,972May 23, 2024Updated last year
- Topic Modelling for Humans☆16,378Nov 1, 2025Updated 4 months ago
- Fuzzy String Matching in Python☆9,262Feb 24, 2023Updated 3 years ago
- Ultra fast asyncio event loop.☆11,719Jan 30, 2026Updated 2 months ago
- An efficient trie implementation.☆255Nov 25, 2020Updated 5 years ago
- Code for the paper "DeepType: Multilingual Entity Linking by Neural Type System Evolution"☆655Apr 2, 2023Updated 2 years ago
- Cython implementation of Toolz: High performance functional utilities☆1,103Dec 1, 2025Updated 3 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Facilitating the design, comparison and sharing of deep text matching models.☆3,853Aug 2, 2024Updated last year
- GloVe Word Embedding model's implementation in theano☆36May 18, 2017Updated 8 years ago
- A library for efficient similarity search and clustering of dense vectors.☆39,484Updated this week
- LSTM language model with CNN over characters☆836Aug 24, 2016Updated 9 years ago
- Sent2Vec encoder and training code from the paper "Skip-Thought Vectors"☆2,051Jun 9, 2020Updated 5 years ago
- C++ implementation of a fast and memory efficient HAT-trie☆859Nov 11, 2025Updated 4 months ago
- A Toolkit for Industrial Topic Modeling☆2,645Jul 1, 2021Updated 4 years ago