andrewclegg / sketchyLinks
Simple approximate-nearest-neighbours in Python using locality sensitive hashing.
☆141Updated 13 years ago
Alternatives and similar repositories for sketchy
Users that are interested in sketchy are comparing it to the libraries listed below
Sorting:
- Python forecasting and smoothing library☆67Updated 6 years ago
- A Topic Modeling toolbox☆92Updated 9 years ago
- Topic modeling web application☆40Updated 10 years ago
- Probabilistic Data Structures in Python (originally presented at PyData 2013)☆55Updated 3 years ago
- Tools, wrappers, etc... for data science with a concentration on text processing☆207Updated 3 years ago
- Latent Dirichlet Allocation for topic modeling of streamed data sources☆101Updated 10 years ago
- scikit-learn addon to operate on set/"group"-based features☆41Updated 9 years ago
- Talk on "Tree models with Scikit-Learn: Great learners with little assumptions" presented at PyPata Paris 2015☆50Updated 10 years ago
- A fast Python implementation of locality sensitive hashing.☆71Updated 10 years ago
- Algorithm's team Jupyter Notebooks☆113Updated 6 months ago
- PyTennessee 2014: Statistical Data Analysis in Python☆85Updated 11 years ago
- Collection of dask example notebooks☆57Updated 7 years ago
- Python wrapper for the Vowpal Wabbit machine learning library.☆53Updated 12 years ago
- Topological Anomaly Detection (TAD) per Gartley and Basener 2009☆68Updated 5 years ago
- ☆81Updated 9 years ago
- Creates models to classify documents into categories☆66Updated 8 years ago
- Compute association strength over semantic networks in a dimensionality-reduced form.☆32Updated 10 years ago
- Refinery - A locally deployable open-source web platform for analysis of large document collections☆101Updated 9 years ago
- PDF and python files for creating time maps and downloading tweets☆59Updated 5 years ago
- Latent dirichlet allocation (LDA) for datamicroscopes☆41Updated 10 years ago
- Advanced git and github course material☆39Updated 7 years ago
- This repository is not maintained anymore. ConfusionMatrix is now part of pandas-ml☆19Updated 9 years ago
- Experimental parallel data analysis toolkit.☆122Updated 4 years ago
- Scripts to Analyze Pronto's Data Release☆24Updated 10 years ago
- Modeling Social Data, Applied Mathematics, Columbia University (Spring 2015)☆33Updated 6 years ago
- Tool to visualize data quickly with no brain usage for plot creation☆47Updated last month
- A simple example of containerized data science with python and Docker.☆51Updated 7 years ago
- A set of distinct value estimators that give probabilistic bounds on a sets cardinality☆22Updated 6 years ago
- A Python library for learning from dimensionality reduction, supporting sparse and dense matrices.☆78Updated 8 years ago
- An implementation of the multi-armed bandit optimization pattern as a Flask extension☆81Updated last month