exhuma / python-cluster
Simple clustering library for python.
☆65Updated 4 years ago
Alternatives and similar repositories for python-cluster:
Users that are interested in python-cluster are comparing it to the libraries listed below
- A fast Python implementation of locality sensitive hashing.☆70Updated 10 years ago
- Simple approximate-nearest-neighbours in Python using locality sensitive hashing.☆140Updated 12 years ago
- A Topic Modeling toolbox☆92Updated 8 years ago
- unofficial git mirror of http://svn.whoosh.ca svn repo☆49Updated 15 years ago
- Collection of dask example notebooks☆58Updated 7 years ago
- A pure python implementation of locality sensitive hashing for text documents☆85Updated 9 years ago
- Experimental parallel data analysis toolkit.☆121Updated 3 years ago
- Tools, wrappers, etc... for data science with a concentration on text processing☆206Updated 2 years ago
- Tool to visualize data quickly with no brain usage for plot creation☆46Updated 6 years ago
- templatemaker is a Python library that can extract data from files with a similar format, like HTML pages.☆63Updated 4 years ago
- [NO LONGER MAINTAINED AS OPEN SOURCE - USE SCALETEXT.COM INSTEAD]☆108Updated 11 years ago
- A tool to segment text based on frequencies and the Viterbi algorithm "#TheBoyWhoLived" => ['#', 'The', 'Boy', 'Who', 'Lived']☆82Updated 8 years ago
- Crab is a flexible, fast recommender engine for Python that integrates classic information filtering recommendation algorithms in the world…☆127Updated 11 years ago
- Python Logging for Humans☆119Updated 8 years ago
- High Level Kafka Scanner☆19Updated 7 years ago
- HAT-Trie for Python☆86Updated 9 years ago
- ☆18Updated 8 years ago
- A high-performance distributed web crawling & scraping framework written with golang and python.☆30Updated 8 years ago
- All the Harry Potter clusters you could ever want☆33Updated 9 years ago
- common data interchange format for document processing pipelines that apply natural language processing tools to large streams of text☆35Updated 8 years ago
- A benchmark framework for testing algorithms and pairwise metrics.☆67Updated 12 years ago
- Updates to Zope's keyphrase extractor (forked from 1.1.0)☆67Updated 7 years ago
- Material for talk "Machine Learning 101" https://speakerdeck.com/kastnerkyle/pycon2015 https://us.pycon.org/2015/schedule/presentation/36…☆87Updated 10 years ago
- Estimating how similar are two sets using MinHash (Jaccard similarity coefficient)☆30Updated 12 years ago
- Wabbit Wappa is a full-featured Python wrapper for the Vowpal Wabbit machine learning utility.☆101Updated 7 years ago
- [UNMAINTAINED] Firefox addon for Scrapely☆5Updated 9 years ago
- Recommender Systems in Depth: An introduction to Recommender Systems using Python and Crab☆44Updated 11 years ago
- Data analysis tool.☆85Updated 2 years ago
- Data science tools from Moz☆22Updated 8 years ago
- POC IDS anomaly detection engine built with iPython notebook, matplotlib, pandas, numpy, scikit-learn, d3.js, hyperloglog implementation,…☆79Updated 10 years ago