hanhanwu / Hanhan_Data_Science_Resources2Links
more data science resources
☆14Updated 3 years ago
Alternatives and similar repositories for Hanhan_Data_Science_Resources2
Users that are interested in Hanhan_Data_Science_Resources2 are comparing it to the libraries listed below
Sorting:
- H2OAI Driverless AI Code Samples and Tutorials☆37Updated 11 months ago
- Advanced workshop on XGBoost with Tianqi Chen in Santa Monica, June 2, 2016☆26Updated 8 years ago
- Assembly of fundamental statistics implemented based on Apache Spark☆31Updated 9 years ago
- VW, Liblinear and StreamSVM compared on webspam☆14Updated 11 years ago
- feng - feature engineering for machine-learning champions☆27Updated 8 years ago
- Predicting happiness from demographics and poll answers☆45Updated 8 years ago
- Code to create benchmarks for Kaggle's Facebook Recruiting Competition☆86Updated 13 years ago
- Show how to perform fast retraining with LightGBM in different business cases☆54Updated 6 years ago
- R Code + R Notebook for analyzing millions of Amazon reviews using Apache Spark☆85Updated 8 years ago
- ☆11Updated 8 years ago
- Some notebook examples related to Apache Spark, IPython / Jupyter, Zeppelin☆52Updated 9 years ago
- Some work on Kaggle data for fun☆64Updated 8 years ago
- Quick-Data-Science-Experiments☆19Updated 7 years ago
- A helper library for data science pipeline☆36Updated 6 years ago
- ☆15Updated 8 years ago
- Python utilities for Machine Learning competitions☆32Updated 7 years ago
- PySpark Machine Learning Examples☆45Updated 7 years ago
- Set of Machine Learning and Stochastic Optimazion tools based on Hadoop, Spark and Storm https://pkghosh.wordpress.com/☆177Updated last year
- Materials for Machine Learning with H2O Open Platform at ODSC Masterclass Summit 2017☆12Updated 8 years ago
- I developed this case study only in 7 days with Pyspark (Spark 1.6.0) SQL & MLlib. I used Databricks cluster and AWS. %90 AUC is achieved…☆17Updated 9 years ago
- Anomaly Detection model uses Spark for training and Spark Streaming for testing☆67Updated 9 years ago
- DEPRECATED Build, manage and deploy H2O's high-speed machine learning models.☆61Updated 6 years ago
- Analysis of Categorical Encodings for dense Decision Trees☆41Updated 8 years ago
- Keras Deep Learning neural network model for University of Wisconsin Cancer data that uses the Integrated Variants library to explain pre…☆74Updated 5 years ago
- Predicting sales with Pandas☆15Updated 9 years ago
- This repository contains code files specifically IPython notebooks for the assignments in the course "Scalable Machine Learning" by UC Be…☆31Updated 10 years ago
- Collection of presentation of my work on various platforms and meetups☆22Updated 6 years ago
- An API for Distributed Machine Learning☆155Updated 9 years ago
- A collection of data science examples implemented across a variety of languages and libraries.☆33Updated 9 years ago
- Install directions and example notebooks for Udacity's Deep Learning classes☆28Updated 9 years ago