pranab / avenir
Set of Machine Learning and Stochastic Optimazion tools based on Hadoop, Spark and Storm https://pkghosh.wordpress.com/
☆174Updated 10 months ago
Related projects ⓘ
Alternatives and complementary repositories for avenir
- Recommender System Framework☆124Updated 7 years ago
- ☆160Updated 7 years ago
- Pydata Dallas 2015 Scikit-Learn Tutorial☆62Updated 9 years ago
- ☆77Updated 8 years ago
- Kaggle Criteo https://www.kaggle.com/c/criteo-display-ad-challenge☆96Updated 10 years ago
- General Assembly repo for Data Science 18☆36Updated 9 years ago
- Apache Spark (Scala, PySpark, SparkR) Code, Tricks, and References☆70Updated 5 years ago
- PyCon SG 2016 - Customer Segmentation in Python☆56Updated 8 years ago
- Code supporting Data Science articles at The Marketing Technologist, Floryn Tech Blog, and Pythom.nl☆70Updated last year
- environment setup for strata conference 2018☆68Updated 6 years ago
- Additional files for the Otto Group Challenge hosted by Kaggle☆36Updated 9 years ago
- This repository contains materials for demos, tutorials, and talks by Dato Inc.☆173Updated 8 years ago
- Code example to predict prices of Airbnb vacation rentals, using scikit-learn on Spark with spark-sklearn, on MapR.☆44Updated 8 years ago
- My machine learning model for the See Click Predict Fix Kaggle competition☆31Updated 7 years ago
- Word2Vec models with Twitter data using Spark. Blog:☆65Updated 5 years ago
- This is where all of the IPython Notebooks will be kept from the blog☆59Updated 6 years ago
- My winning solution for Kaggle Higgs Machine Learning Challenge (single classifier, xgboost)☆81Updated 10 years ago
- Containing codes of participation in Kaggle competitions.☆37Updated 8 years ago
- Bosch Kaggle competion: Reduce manufacturing failures (https://www.kaggle.com/c/bosch-production-line-performance)☆24Updated 8 years ago
- Tutorial repo for the article "ML in Production"☆30Updated last year
- Feature Engineering with Pipeline Talk at ODSC West 2016, Santa Clara☆17Updated 8 years ago
- This repository contains code files specifically IPython notebooks for the assignments in the course "Introduction to Big Data with Apach…☆114Updated 3 months ago
- Code for the Kaggle acquire valued shoppers challenge☆66Updated 10 years ago
- PySpark Notebook and Shiny App for Demo☆35Updated 7 years ago
- IPython notebook for PyData SF 2014 tutorial: "Gradient Boosted Regression Trees in scikit-learn"☆63Updated 7 years ago
- Notebooks (and slides) for my PyData NYC 2014 tutorial on the more advanced features of scikit-learn.☆69Updated 9 years ago
- Supporting content (slides and exercises) for the Addison-Wesley (Pearson) video series covering best practices for developing scalable S…☆66Updated 8 years ago
- Builds a recommender system using TensorFlow☆86Updated 7 years ago