A project for clustering text streams using locality-sensitive hashing (LSH) in Python
☆26Sep 23, 2011Updated 14 years ago
Alternatives and similar repositories for streaming_lsh
Users that are interested in streaming_lsh are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A platform for collecting, analyzing, and visualizing social media data.☆13Dec 27, 2020Updated 5 years ago
- A friendlier interface to `socket`.☆14Apr 11, 2015Updated 11 years ago
- Stability analysis for topic models☆52Oct 16, 2016Updated 9 years ago
- The Community-enRiched Open WordNet (CROWN)☆18Dec 3, 2015Updated 10 years ago
- A tool for semantic relation extraction. The program finds pairs of semantically related words based on the text definitions coming from …☆26Jul 1, 2014Updated 11 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Data Mining and User Portrait reports, with a score of 95/100, ranking first, including experiments of 3 kaggle data sets and a small pap…☆10Feb 5, 2020Updated 6 years ago
- CCKS蚂蚁金服事件主体抽取☆14Jun 13, 2019Updated 7 years ago
- Implementation of Monte Carlo Word Movers Distance in Python with TensorFlow☆12Sep 12, 2016Updated 9 years ago
- A ROS1/ROS2 compatible, RDFlib-backed knowledge base for robotic application. Mostly KB-API conformant.☆16Apr 2, 2026Updated 2 months ago
- Topic Model or LDA in Cython☆21Apr 9, 2011Updated 15 years ago
- common data interchange format for document processing pipelines that apply natural language processing tools to large streams of text☆35Sep 30, 2016Updated 9 years ago
- Statistical Natural Language Processing with Annotated Suffix Trees☆22Jul 22, 2016Updated 9 years ago
- ☆17Aug 29, 2019Updated 6 years ago
- A simple and fast search engine☆70Jun 21, 2022Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Alembic extension that adds support for arbitrary user-defined objects like views or functions in autogenerate command.☆13Feb 6, 2025Updated last year
- A DSL to build Lucene text queries in Python.☆38Jan 5, 2017Updated 9 years ago
- Python Approximate Nearest Neighbor Search in very high dimensional spaces with optimised indexing.☆219Oct 7, 2021Updated 4 years ago
- OpenBlock is a web application and RESTful service that allows users to browse and search their local area for "hyper-local news☆60Jun 10, 2021Updated 5 years ago
- Transform unstructured document collections to structured Linked Data☆29Updated this week
- A fast Python implementation of locality sensitive hashing.☆71Mar 13, 2015Updated 11 years ago
- Python API for Various DB-Backed Simhash Clusters☆64Mar 16, 2017Updated 9 years ago
- A simple solution for organizing your FastAPI endpoints☆14Jan 31, 2023Updated 3 years ago
- stav text annotation visualiser☆34Nov 2, 2011Updated 14 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Tools, wrappers, etc... for data science with a concentration on text processing☆207Nov 9, 2022Updated 3 years ago
- An agent-based modeling toolkit written in Python.☆39Mar 18, 2014Updated 12 years ago
- ☆11Oct 14, 2015Updated 10 years ago
- Sunburnt offspring solr client☆27Mar 21, 2022Updated 4 years ago
- Triple pattern matching over non-RDF datasources with inference☆16Feb 6, 2019Updated 7 years ago
- collection of modules to build distributed and reliable concurrent systems in Python.☆207Sep 14, 2013Updated 12 years ago
- Locality-sensitive hashing algorithm for text similarity comparisons☆58Apr 9, 2025Updated last year
- Elm bindings to the "Sign in With Google" widget☆11Jan 14, 2023Updated 3 years ago
- TaxTea 🐸 ☕️ - Django app that calculates tax rates for SaaS products☆19Jul 2, 2024Updated last year
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Using Shodan to get a breakdown of the most common key names in public Redis servers.☆12Dec 10, 2017Updated 8 years ago
- WordNet Domains, WordNet Affect and SentiWords☆51Jan 8, 2016Updated 10 years ago
- Large scale sentential paraphrases collection and annotation☆46Dec 31, 2022Updated 3 years ago
- Python library for creating word clouds from text☆51Jun 4, 2019Updated 7 years ago
- Natural language hashing library.☆10Nov 24, 2014Updated 11 years ago
- webstore is a web-api enabled datastore backed onto sql databases especially sqlite. It supports the RESTful JSON APIs standard to nosql …☆40Oct 18, 2019Updated 6 years ago
- Command line tool for manipulating and analyzing text☆29May 27, 2022Updated 4 years ago