ogrisel / spylearn
Repo for experiments on pyspark and sklearn
☆79Updated 11 years ago
Alternatives and similar repositories for spylearn:
Users that are interested in spylearn are comparing it to the libraries listed below
- Talk on "Tree models with Scikit-Learn: Great learners with little assumptions" presented at PyPata Paris 2015☆50Updated 10 years ago
- Machine Learning with Scikit-Learn (material for pydata Amsterdam 2016)☆30Updated 9 years ago
- Scikit-learn Tutorial at EuroPython 2014☆43Updated 6 years ago
- Quick & dirty repo for hosting the Notebook for t-SNE presentation at delivered at Python Quants and PyData London meetups☆9Updated 9 years ago
- A Bayesian testing framework written in Python.☆94Updated 10 years ago
- Material and slides for Boston NLP meetup May 23rd 2016☆17Updated 8 years ago
- Mirror of Apache Spark☆24Updated 9 years ago
- A simple example of containerized data science with python and Docker.☆51Updated 7 years ago
- This repo contain the exercies of the Next.ML 2015 presentation☆24Updated 10 years ago
- Notebooks (and slides) for my PyData NYC 2014 tutorial on the more advanced features of scikit-learn.☆69Updated 10 years ago
- Machine learning evaluation database☆24Updated 7 years ago
- scikit-learn addon to operate on set/"group"-based features☆41Updated 8 years ago
- Latency numbers every data scientist should know (aka the pyramid of analytical tasks) - the order of magnitude of computational time for…☆20Updated 8 years ago
- Latent dirichlet allocation (LDA) for datamicroscopes☆40Updated 9 years ago
- Material for open source machine learning practical☆21Updated 9 years ago
- ☆58Updated 9 years ago
- Scikit-learn quickstart tutorial for Webstep☆19Updated 8 years ago
- Second-ranked solution to the Kaggle "Flavours of Physics" competition☆25Updated 9 years ago
- ☆25Updated 9 years ago
- ☆36Updated 9 years ago
- Large scale matrix factorization on GPU☆19Updated 8 years ago
- Predicting closed questions on Stack Overflow☆44Updated 7 years ago
- Spark library for doing exploratory data analysis in a scalable way☆43Updated 9 years ago
- Run Nx2 Cross Validation for multiple binary classifiers in parallel with optional downsampling☆13Updated 10 years ago
- This project contains the code to translate between Apache Spark and SFrame.☆20Updated 8 years ago
- Creates models to classify documents into categories☆66Updated 7 years ago
- Jupyter notebooks and code for Intro to DL talk at Genesys☆14Updated 8 years ago
- Deep learning for hackers: a hands-on approach to machine learning and deep learning.☆68Updated 10 years ago
- Code to munge data between Kaggle .tsv Rotten Tomatoes Sentiment Analysis data set and Vowpal Wabbit☆24Updated 10 years ago
- Dato/Turi DS Conf talk on NLP and Elasticsearch analysis of reviews, plus JS implementation☆45Updated 8 years ago