mmihaltz / pysettrie
python3 package supporting efficient storage and querying of sets of sets using the trie data structure. Supports finding all the supersets/subsets of a given set from a collection of sets. Also includes a trie-based mapping container where the keys are sets.
☆24Updated last year
Related projects ⓘ
Alternatives and complementary repositories for pysettrie
- An index data structure for approximate string search.☆23Updated 5 years ago
- Scalable String Similarity Joins in Python☆39Updated 4 months ago
- Yet another regression toolkit☆12Updated 11 years ago
- A disk-based key/value store in Python with no dependencies.☆21Updated 9 years ago
- Hidden alignment conditional random field for classifying string pairs.☆25Updated 2 months ago
- Dask and Spark interactions☆21Updated 7 years ago
- Sketch and LSH Index library for Java, including OPH methods as well as the Lazo method☆13Updated 11 months ago
- Python 3 implementation and documentation of the Hermina-Janos local graph clustering algorithm.☆21Updated last year
- deep entity resolution lite version☆11Updated 5 years ago
- A Cython implementation of the affine gap string distance☆58Updated last year
- Set-oriented Operations in Pandas☆24Updated 4 years ago
- scikit-learn addon to operate on set/"group"-based features☆41Updated 8 years ago
- This is an Object Oriented implementation of a Trie in python. The class contains setter and getter methods, and implements several usefu…☆14Updated 6 years ago
- ☆24Updated 6 years ago
- A project for clustering text streams using locality-sensitive hashing (LSH) in Python☆27Updated 13 years ago
- Algorithms for "schema matching"☆25Updated 8 years ago
- ☆29Updated 2 years ago
- Load embeddings and featurize your sentences.☆28Updated last month
- Python search module for fast approximate string matching☆53Updated last year
- Ranking Entity Types using the Web of Data☆30Updated 8 years ago
- A DeepWalk implementation for ontologies using NetworkX and Gensim☆19Updated 7 years ago
- Python library for declarative, constrained, structured-output prediction.☆21Updated last year
- Python bindings to Succinct Data Structure Library 2.0☆30Updated 5 years ago
- Formal concept analysis lattice generation and query in Python☆13Updated 10 years ago
- Official repository of Quickscorer: a fast algorithm to rank documents with additive ensembles of regression trees.☆18Updated 8 years ago
- Some convenient natural language tools that build on NLTK.☆85Updated 10 years ago
- [hibernating] Dynamic topic models☆39Updated 9 years ago
- Pandas Msgpack☆23Updated 2 years ago
- brat rapid annotation tool (brat) - for all your textual annotation needs☆10Updated 6 years ago