Locality-sensitive hashing algorithm for text similarity comparisons
☆58Apr 9, 2025Updated 11 months ago
Alternatives and similar repositories for py-nilsimsa
Users that are interested in py-nilsimsa are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Extends zip() and itertools.zip_longest() to generate named tuples.☆22May 13, 2019Updated 6 years ago
- common data interchange format for document processing pipelines that apply natural language processing tools to large streams of text☆35Sep 30, 2016Updated 9 years ago
- A project for clustering text streams using locality-sensitive hashing (LSH) in Python☆26Sep 23, 2011Updated 14 years ago
- Genyris presents a paradigm in which objects can belong to multiple classes independent from construction allowing data to be classified …☆17Updated this week
- Extract Icon from PE Executable using Python☆26Jul 2, 2023Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Python-based cloud node for local use☆11Mar 7, 2018Updated 8 years ago
- AI examples with TensorFlow, Keras☆16Feb 8, 2017Updated 9 years ago
- A library for Time-Series exploration, analysis & modelling.☆17Dec 10, 2020Updated 5 years ago
- Comparison of bandit algorithms from the Reinforcement Learning bible.☆17Jun 6, 2018Updated 7 years ago
- ☆15Dec 26, 2021Updated 4 years ago
- ReconNER, Debug annotated Named Entity Recognition (NER) data for inconsistencies and get insights on improving the quality of your data.☆34Jul 26, 2020Updated 5 years ago
- An autoscaling python script for Heroku☆28May 16, 2012Updated 13 years ago
- Distributed Web Crawler, Parser and Search Engine.☆10Jun 16, 2016Updated 9 years ago
- stav text annotation visualiser☆34Nov 2, 2011Updated 14 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Search with a hash index☆33Mar 27, 2019Updated 6 years ago
- Building Event Extraction and Trending Framework for Twitter☆14Sep 13, 2017Updated 8 years ago
- A friendly pandas wrapper with a more composable grammar support.☆14Mar 7, 2017Updated 9 years ago
- Official repository of "Efficient and Effective Query Expansion for Web Search", Short Paper @ CIKM 2018☆15Nov 17, 2019Updated 6 years ago
- Shared memory based Hash Table extension for Python☆45Nov 9, 2021Updated 4 years ago
- Ghidra consonance and make it more ida-ish☆16Mar 11, 2019Updated 7 years ago
- A Text Comprehension Engine in Python☆15Aug 23, 2015Updated 10 years ago
- ☆18Jun 12, 2023Updated 2 years ago
- Learning M-Way Tree - Web Scale Clustering - EM-tree, K-tree, k-means, TSVQ, repeated k-means, bitwise clustering☆78Feb 7, 2022Updated 4 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Funnel plot☆45Apr 11, 2023Updated 2 years ago
- Getting more out of Django!☆35Nov 14, 2019Updated 6 years ago
- Data and code for the experiments in the Outlier Detection task proposed by Camacho-Collados et al.☆13Aug 28, 2018Updated 7 years ago
- Slinky, a high-performance web crawler / text analytics in Python, Redis, Hadoop, R, Gephi☆41Aug 30, 2010Updated 15 years ago
- Very accessible code for my MSc thesis. Inexpensive quantization method for ANN search also known as Enhanced Residual VQ.☆14Jun 15, 2020Updated 5 years ago
- IoC's, PCRE's, YARA's etc☆23Mar 25, 2025Updated last year
- A python module provides content extraction and summarization of a web page even if the web page was broken.☆18Apr 14, 2023Updated 2 years ago
- Assorted tools and utility functions, mainly for doing NLP with Python☆23Sep 12, 2025Updated 6 months ago
- Co-reference resolution for the English language.☆17Jan 12, 2015Updated 11 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Python framework for fast (approximated) nearest neighbour search in large, high-dimensional data sets using different locality-sensitive…☆771Feb 23, 2023Updated 3 years ago
- Nextcloud App that shows File-Upload Details in Progress Bar☆12Feb 15, 2020Updated 6 years ago
- Calculates the probability of a haplotype given a population reference panel☆12Dec 9, 2024Updated last year
- A ROS1/ROS2 compatible, RDFlib-backed knowledge base for robotic application. Mostly KB-API conformant.☆16Sep 12, 2025Updated 6 months ago
- Code for Fast Information-theoretic Bayesian Optimisation☆16Jun 7, 2018Updated 7 years ago
- Apache Nutch extensions☆34Mar 21, 2022Updated 4 years ago
- TextFlows is an open-source online platform for composition, execution, and sharing of interactive text mining and natural language proce…☆19Dec 1, 2017Updated 8 years ago