eleflow / sparknotebook
A fast way of getting a Spark cluster up and running on AWS with the friendly IPython interface.
☆10Updated 9 years ago
Alternatives and similar repositories for sparknotebook:
Users that are interested in sparknotebook are comparing it to the libraries listed below
- Repo for experiments on pyspark and sklearn☆79Updated 11 years ago
- A library that allows serialization of SciKit-Learn estimators into PMML☆70Updated 5 years ago
- Luigi Plugin for Hubot☆35Updated 8 years ago
- Analyze the structure and dynamics of an open source project's developer community, using graph algorithms, etc.☆58Updated 3 years ago
- Sample repo for luigi tasks & config☆36Updated 8 years ago
- Machine learning evaluation database☆24Updated 7 years ago
- Additional files for the Otto Group Challenge hosted by Kaggle☆36Updated 9 years ago
- feng - feature engineering for machine-learning champions☆27Updated 7 years ago
- Python wrapper for the Vowpal Wabbit machine learning library.☆53Updated 11 years ago
- Fast, easy and intuitive machine learning prototyping.☆124Updated 10 years ago
- Send summary messages of your Luigi jobs to Slack☆46Updated 5 years ago
- My capstone project for Galvanize (Zipfian Academy)☆38Updated 6 years ago
- Latency numbers every data scientist should know (aka the pyramid of analytical tasks) - the order of magnitude of computational time for…☆20Updated 7 years ago
- Deploy Dask on Marathon☆10Updated 8 years ago
- Docker images for data science from Wise.io☆50Updated 8 years ago
- Talk on "Tree models with Scikit-Learn: Great learners with little assumptions" presented at PyPata Paris 2015☆50Updated 9 years ago
- Proposals for new Jupyter subprojects to enter into incubation☆18Updated 4 years ago
- A simple example of containerized data science with python and Docker.☆51Updated 6 years ago
- My machine learning model for the See Click Predict Fix Kaggle competition☆31Updated 7 years ago
- Notebooks (and slides) for my PyData NYC 2014 tutorial on the more advanced features of scikit-learn.☆69Updated 10 years ago
- Creates models to classify documents into categories☆66Updated 7 years ago
- Common post-estimation tasks for scikit-learn☆17Updated 8 years ago
- Scripts to Analyze Pronto's Data Release☆24Updated 9 years ago
- An example project for doing grid search in MLlib☆13Updated 10 years ago
- An implementation of the multi-armed bandit optimization pattern as a Flask extension☆81Updated 2 weeks ago
- Deprecated, please use https://github.com/jcrist/skein or https://github.com/dask/dask-yarn instead☆52Updated 6 years ago
- PyData Seattle 2015: Python Data Bikeshed☆127Updated 9 years ago
- Material of the Kaggle Berlin meetup group!☆35Updated 7 years ago
- Articles on Data Science, Jupyter, and Pandas☆18Updated 9 years ago
- Using Word2Vec on lists and sets☆34Updated 9 years ago