andrewclegg / sketchyLinks
Simple approximate-nearest-neighbours in Python using locality sensitive hashing.
☆140Updated 13 years ago
Alternatives and similar repositories for sketchy
Users that are interested in sketchy are comparing it to the libraries listed below
Sorting:
- A Topic Modeling toolbox☆92Updated 9 years ago
- Probabilistic Data Structures in Python (originally presented at PyData 2013)☆55Updated 3 years ago
- Low-level primitives for collapsed Gibbs sampling in python and C++☆33Updated last year
- Latent Dirichlet Allocation for topic modeling of streamed data sources☆100Updated 10 years ago
- Python forecasting and smoothing library☆67Updated 6 years ago
- Experimental parallel data analysis toolkit.☆121Updated 3 years ago
- Talk on "Tree models with Scikit-Learn: Great learners with little assumptions" presented at PyPata Paris 2015☆50Updated 10 years ago
- Topic modeling web application☆41Updated 9 years ago
- Algorithm's team Jupyter Notebooks☆113Updated last month
- Compute association strength over semantic networks in a dimensionality-reduced form.☆32Updated 9 years ago
- Refinery - A locally deployable open-source web platform for analysis of large document collections☆101Updated 8 years ago
- Collection of dask example notebooks☆58Updated 7 years ago
- Latent dirichlet allocation (LDA) for datamicroscopes☆41Updated 9 years ago
- Repo for experiments on pyspark and sklearn☆79Updated 11 years ago
- ☆81Updated 9 years ago
- workflow support for reproducible deduplication and merging☆16Updated 2 years ago
- scikit-learn addon to operate on set/"group"-based features☆41Updated 8 years ago
- PMML evaluator library for the PostgreSQL database (http://www.postgresql.org/)☆11Updated 10 years ago
- An implementation of gibbs sampling for Latent Dirichlet Allocation☆30Updated 13 years ago
- Scikit learn inspired library for gpu-accelerated machine learning☆38Updated 2 years ago
- Collection of pointers to slides and repositories from speakers at PyData Berlin 2016☆37Updated 9 years ago
- Predicting closed questions on Stack Overflow☆44Updated 7 years ago
- Jeremy's Machine Learning Library☆32Updated 7 years ago
- A set of distinct value estimators that give probabilistic bounds on a sets cardinality☆22Updated 5 years ago
- Fast Vector Operations on Pretty Big Data☆13Updated 9 years ago
- A streaming cross-cat inference engine☆20Updated last year
- A disk-based key/value store in Python with no dependencies.☆21Updated 10 years ago
- A Bayesian testing framework written in Python.☆94Updated 10 years ago
- IPython notebook storage on OpenStack clouds☆58Updated 6 years ago
- Python implementation of Markov Networks for neural computing.☆38Updated 4 months ago