A project for clustering text streams using locality-sensitive hashing (LSH) in Python
☆26Sep 23, 2011Updated 14 years ago
Alternatives and similar repositories for streaming_lsh
Users that are interested in streaming_lsh are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A platform for collecting, analyzing, and visualizing social media data.☆13Dec 27, 2020Updated 5 years ago
- Tweets annotated with coarse-grained sense labels (supersenses)☆13Jun 13, 2014Updated 11 years ago
- A friendlier interface to `socket`.☆14Apr 11, 2015Updated 11 years ago
- Stability analysis for topic models☆52Oct 16, 2016Updated 9 years ago
- The Community-enRiched Open WordNet (CROWN)☆18Dec 3, 2015Updated 10 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- A Text Comprehension Engine in Python☆15Aug 23, 2015Updated 10 years ago
- A tool for semantic relation extraction. The program finds pairs of semantically related words based on the text definitions coming from …☆26Jul 1, 2014Updated 11 years ago
- Slinky, a high-performance web crawler / text analytics in Python, Redis, Hadoop, R, Gephi☆40Aug 30, 2010Updated 15 years ago
- A streaming cross-cat inference engine☆20Mar 27, 2024Updated 2 years ago
- A ROS1/ROS2 compatible, RDFlib-backed knowledge base for robotic application. Mostly KB-API conformant.☆16Apr 2, 2026Updated last month
- Topic Model or LDA in Cython☆21Apr 9, 2011Updated 15 years ago
- common data interchange format for document processing pipelines that apply natural language processing tools to large streams of text☆35Sep 30, 2016Updated 9 years ago
- Code implementing the experiments described in the NeurIPS 2018 paper "With Friends Like These, Who Needs Adversaries?".☆13Sep 11, 2020Updated 5 years ago
- Distributed text analysis suite based on Celery☆96Dec 15, 2022Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Statistical Natural Language Processing with Annotated Suffix Trees☆22Jul 22, 2016Updated 9 years ago
- Set up MIT's CLIFF geolocation service with Vagrant☆17May 5, 2015Updated 11 years ago
- A simple and fast search engine☆70Jun 21, 2022Updated 3 years ago
- Alembic extension that adds support for arbitrary user-defined objects like views or functions in autogenerate command.☆13Feb 6, 2025Updated last year
- JavaScript port of lmfit☆15Jan 13, 2023Updated 3 years ago
- A DSL to build Lucene text queries in Python.☆38Jan 5, 2017Updated 9 years ago
- Python Approximate Nearest Neighbor Search in very high dimensional spaces with optimised indexing.☆219Oct 7, 2021Updated 4 years ago
- ☆17Jul 21, 2023Updated 2 years ago
- Transform unstructured document collections to structured Linked Data☆29Sep 12, 2025Updated 8 months ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- [TMLR] Unsupervised Network Embedding Beyond Homophily (https://arxiv.org/abs/2203.10866) Resources☆11Mar 21, 2023Updated 3 years ago
- Python API for Various DB-Backed Simhash Clusters☆64Mar 16, 2017Updated 9 years ago
- A simple solution for organizing your FastAPI endpoints☆14Jan 31, 2023Updated 3 years ago
- A Docker image for the CLIFF geolocation software.☆10Jun 12, 2018Updated 7 years ago
- stav text annotation visualiser☆34Nov 2, 2011Updated 14 years ago
- Tools, wrappers, etc... for data science with a concentration on text processing☆207Nov 9, 2022Updated 3 years ago
- A service for allowing subreddits to publish their moderator logs☆14Jan 25, 2023Updated 3 years ago
- Probabilistic structure discovery for rich relational systems☆14Jul 9, 2024Updated last year
- An agent-based modeling toolkit written in Python.☆39Mar 18, 2014Updated 12 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Knowledge extraction from web data☆92May 7, 2018Updated 8 years ago
- ☆11Oct 14, 2015Updated 10 years ago
- A python library to find differences between audio and transcriptions☆20Nov 14, 2023Updated 2 years ago
- Shared memory based Hash Table extension for Python☆45Nov 9, 2021Updated 4 years ago
- Sunburnt offspring solr client☆27Mar 21, 2022Updated 4 years ago
- collection of modules to build distributed and reliable concurrent systems in Python.☆207Sep 14, 2013Updated 12 years ago
- Detection of microblogs novel events using an online variant of topic model☆72May 6, 2013Updated 13 years ago