andrewclegg / sketchy
Simple approximate-nearest-neighbours in Python using locality sensitive hashing.
☆140Updated 12 years ago
Alternatives and similar repositories for sketchy:
Users that are interested in sketchy are comparing it to the libraries listed below
- A Topic Modeling toolbox☆92Updated 8 years ago
- Python forecasting and smoothing library☆67Updated 5 years ago
- Fast Vector Operations on Pretty Big Data☆13Updated 9 years ago
- Scripts to Analyze Pronto's Data Release☆24Updated 9 years ago
- Tool to visualize data quickly with no brain usage for plot creation☆46Updated 5 years ago
- workflow support for reproducible deduplication and merging☆16Updated last year
- scikit-learn addon to operate on set/"group"-based features☆41Updated 8 years ago
- A tool to segment text based on frequencies and the Viterbi algorithm "#TheBoyWhoLived" => ['#', 'The', 'Boy', 'Who', 'Lived']☆82Updated 8 years ago
- Creates models to classify documents into categories☆66Updated 7 years ago
- This repository is not maintained anymore. ConfusionMatrix is now part of pandas-ml☆19Updated 8 years ago
- Talk on "Tree models with Scikit-Learn: Great learners with little assumptions" presented at PyPata Paris 2015☆50Updated 10 years ago
- Code for "Performance shootout between nearest-neighbour libraries": http://radimrehurek.com/2013/11/performance-shootout-of-nearest-neig…☆99Updated 9 years ago
- Implementation of Bayesian Sets for fast similarity searches.☆14Updated 13 years ago
- Fast Dot Products on Pretty Big Data☆15Updated 6 years ago
- Python (PyMC) adaptation of the R code from "Doing Bayesian Data Analysis"☆64Updated 7 years ago
- Demo code for learning_text_transformer☆25Updated 10 years ago
- Realtime semantic similarity visualization with gensim, d3.js, and hookbox☆40Updated 11 years ago
- A set of distinct value estimators that give probabilistic bounds on a sets cardinality☆22Updated 5 years ago
- ☆34Updated 8 years ago
- spy on your random forests☆19Updated 4 years ago
- Compute association strength over semantic networks in a dimensionality-reduced form.☆32Updated 9 years ago
- Repo for experiments on pyspark and sklearn☆79Updated 11 years ago
- A Python library for dealing with splittable files☆42Updated 5 years ago
- Statistical Dependency Parser using SVM as proposed by Yamada et al☆29Updated 9 years ago
- Low-level primitives for collapsed Gibbs sampling in python and C++☆33Updated last year
- An implementation of the multi-armed bandit optimization pattern as a Flask extension☆81Updated this week
- PMML evaluator library for the PostgreSQL database (http://www.postgresql.org/)☆11Updated 10 years ago
- ☆21Updated 9 years ago
- Common post-estimation tasks for scikit-learn☆17Updated 8 years ago
- Data science tools from Moz☆22Updated 8 years ago