andrewclegg / sketchyLinks
Simple approximate-nearest-neighbours in Python using locality sensitive hashing.
☆141Updated 13 years ago
Alternatives and similar repositories for sketchy
Users that are interested in sketchy are comparing it to the libraries listed below
Sorting:
- A Topic Modeling toolbox☆92Updated 9 years ago
- Probabilistic Data Structures in Python (originally presented at PyData 2013)☆55Updated 3 years ago
- Python forecasting and smoothing library☆67Updated 6 years ago
- Algorithm's team Jupyter Notebooks☆113Updated 6 months ago
- Topic modeling web application☆40Updated 10 years ago
- scikit-learn addon to operate on set/"group"-based features☆41Updated 9 years ago
- Talk on "Tree models with Scikit-Learn: Great learners with little assumptions" presented at PyPata Paris 2015☆50Updated 10 years ago
- Tools, wrappers, etc... for data science with a concentration on text processing☆207Updated 3 years ago
- Low-level primitives for collapsed Gibbs sampling in python and C++☆33Updated last year
- Experimental parallel data analysis toolkit.☆122Updated 4 years ago
- Latent Dirichlet Allocation for topic modeling of streamed data sources☆101Updated 10 years ago
- ☆81Updated 9 years ago
- Latent dirichlet allocation (LDA) for datamicroscopes☆41Updated 10 years ago
- Scripts to Analyze Pronto's Data Release☆24Updated 10 years ago
- A RESTful web service that runs microtasks across multiple crowds, provides quality control techniques, and is easily extensible.☆52Updated 8 years ago
- A set of distinct value estimators that give probabilistic bounds on a sets cardinality☆22Updated 5 years ago
- A fast Python implementation of locality sensitive hashing.☆71Updated 10 years ago
- PyTennessee 2014: Statistical Data Analysis in Python☆85Updated 11 years ago
- Modeling Social Data, Applied Mathematics, Columbia University (Spring 2015)☆33Updated 6 years ago
- Collection of dask example notebooks☆57Updated 7 years ago
- Compute association strength over semantic networks in a dimensionality-reduced form.☆32Updated 10 years ago
- A Python wrapper for MADlib(http://madlib.net) - an open source library for scalable in-database machine learning algorithms☆63Updated 5 years ago
- A PyData 2013 talk on straightforward, data-driven ways to handle natural language text in Python.☆51Updated 11 years ago
- Code for "Performance shootout between nearest-neighbour libraries": http://radimrehurek.com/2013/11/performance-shootout-of-nearest-neig…☆98Updated 10 years ago
- [NO LONGER MAINTAINED AS OPEN SOURCE - USE SCALETEXT.COM INSTEAD]☆107Updated 12 years ago
- Fast, easy and intuitive machine learning prototyping.☆124Updated 11 years ago
- Concept discovery and recommendation library built on top of the IBM Watson cognitive API.☆24Updated 9 years ago
- ☆21Updated 9 years ago
- A library that allows serialization of SciKit-Learn estimators into PMML☆72Updated 6 years ago
- Refinery - A locally deployable open-source web platform for analysis of large document collections☆101Updated 9 years ago