A project for clustering text streams using locality-sensitive hashing (LSH) in Python
☆26Sep 23, 2011Updated 14 years ago
Alternatives and similar repositories for streaming_lsh
Users that are interested in streaming_lsh are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A platform for collecting, analyzing, and visualizing social media data.☆13Dec 27, 2020Updated 5 years ago
- How to spot first stories on Twitter using Storm.☆124Dec 17, 2023Updated 2 years ago
- Stability analysis for topic models☆52Oct 16, 2016Updated 9 years ago
- The Community-enRiched Open WordNet (CROWN)☆18Dec 3, 2015Updated 10 years ago
- A Text Comprehension Engine in Python☆15Aug 23, 2015Updated 10 years ago
- NordVPN Threat Protection Pro™ • AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- Slinky, a high-performance web crawler / text analytics in Python, Redis, Hadoop, R, Gephi☆41Aug 30, 2010Updated 15 years ago
- Implementation of Monte Carlo Word Movers Distance in Python with TensorFlow☆12Sep 12, 2016Updated 9 years ago
- A streaming cross-cat inference engine☆20Mar 27, 2024Updated last year
- Topic Model or LDA in Cython☆21Apr 9, 2011Updated 14 years ago
- Code implementing the experiments described in the NeurIPS 2018 paper "With Friends Like These, Who Needs Adversaries?".☆13Sep 11, 2020Updated 5 years ago
- common data interchange format for document processing pipelines that apply natural language processing tools to large streams of text☆35Sep 30, 2016Updated 9 years ago
- Statistical Natural Language Processing with Annotated Suffix Trees☆22Jul 22, 2016Updated 9 years ago
- Set up MIT's CLIFF geolocation service with Vagrant☆17May 5, 2015Updated 10 years ago
- A simple and fast search engine☆70Jun 21, 2022Updated 3 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- A DSL to build Lucene text queries in Python.☆38Jan 5, 2017Updated 9 years ago
- OpenBlock is a web application and RESTful service that allows users to browse and search their local area for "hyper-local news☆61Jun 10, 2021Updated 4 years ago
- Transform unstructured document collections to structured Linked Data☆29Sep 12, 2025Updated 6 months ago
- A framework for large-scale feature extraction, indexing and retrieval.☆60Mar 4, 2016Updated 10 years ago
- A fast Python implementation of locality sensitive hashing.☆71Mar 13, 2015Updated 11 years ago
- Python API for Various DB-Backed Simhash Clusters☆64Mar 16, 2017Updated 9 years ago
- A simple solution for organizing your FastAPI endpoints☆14Jan 31, 2023Updated 3 years ago
- A Docker image for the CLIFF geolocation software.☆10Jun 12, 2018Updated 7 years ago
- stav text annotation visualiser☆34Nov 2, 2011Updated 14 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- nanoservice is a small Python library for writing lightweight networked services using nanomsg☆31Dec 29, 2015Updated 10 years ago
- Recreating react-suspense features using elm☆15Oct 28, 2018Updated 7 years ago
- Knowledge extraction from web data☆92May 7, 2018Updated 7 years ago
- An agent-based modeling toolkit written in Python.☆39Mar 18, 2014Updated 12 years ago
- Sunburnt offspring solr client☆27Mar 21, 2022Updated 4 years ago
- collection of modules to build distributed and reliable concurrent systems in Python.☆206Sep 14, 2013Updated 12 years ago
- Locality-sensitive hashing algorithm for text similarity comparisons☆58Apr 9, 2025Updated 11 months ago
- Elm bindings to the "Sign in With Google" widget☆11Jan 14, 2023Updated 3 years ago
- TaxTea 🐸 ☕️ - Django app that calculates tax rates for SaaS products☆19Jul 2, 2024Updated last year
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Provides syntax highlighting for Apptainer/Singularity definition files.☆10Dec 24, 2025Updated 3 months ago
- Using Shodan to get a breakdown of the most common key names in public Redis servers.☆13Dec 10, 2017Updated 8 years ago
- WordNet Domains, WordNet Affect and SentiWords☆49Jan 8, 2016Updated 10 years ago
- The Tweets2013 Internet Archive collection☆10Aug 7, 2020Updated 5 years ago
- Natural language hashing library.☆10Nov 24, 2014Updated 11 years ago
- Python library for creating word clouds from text☆51Jun 4, 2019Updated 6 years ago
- webstore is a web-api enabled datastore backed onto sql databases especially sqlite. It supports the RESTful JSON APIs standard to nosql …☆40Oct 18, 2019Updated 6 years ago