hanhanwu / Hanhan_Data_Science_Resources2Links
more data science resources
☆14Updated 3 years ago
Alternatives and similar repositories for Hanhan_Data_Science_Resources2
Users that are interested in Hanhan_Data_Science_Resources2 are comparing it to the libraries listed below
Sorting:
- Show how to perform fast retraining with LightGBM in different business cases☆54Updated 6 years ago
- Assembly of fundamental statistics implemented based on Apache Spark☆31Updated 10 years ago
- PySpark Machine Learning Examples☆45Updated 7 years ago
- ☆13Updated 6 years ago
- Some notebook examples related to Apache Spark, IPython / Jupyter, Zeppelin☆52Updated 9 years ago
- I developed this case study only in 7 days with Pyspark (Spark 1.6.0) SQL & MLlib. I used Databricks cluster and AWS. %90 AUC is achieved…☆17Updated 9 years ago
- 57th place solution in "Bosch Production Line Performance"☆19Updated 8 years ago
- Some work on Kaggle data for fun☆64Updated 8 years ago
- Predicting happiness from demographics and poll answers☆46Updated 9 years ago
- This repository contains code files specifically IPython notebooks for the assignments in the course "Scalable Machine Learning" by UC Be…☆32Updated 10 years ago
- ☆15Updated 8 years ago
- Kaggle Criteo https://www.kaggle.com/c/criteo-display-ad-challenge☆97Updated 11 years ago
- Advanced workshop on XGBoost with Tianqi Chen in Santa Monica, June 2, 2016☆27Updated 9 years ago
- Awesome Distributed Machine Learning Frameworks☆31Updated 8 years ago
- Datasets and notebooks☆13Updated 9 years ago
- feng - feature engineering for machine-learning champions☆27Updated 8 years ago
- Comparing keras, pytorch and gluon using neural collaborative filtering☆18Updated 6 years ago
- VW, Liblinear and StreamSVM compared on webspam☆14Updated 11 years ago
- Materials for Machine Learning with H2O Open Platform at ODSC Masterclass Summit 2017☆12Updated 8 years ago
- Solution to Kaggle's Mercari Price Suggestion Competition☆22Updated 7 years ago
- Brian Farris' Talk on Reinforcement Learning and Multi-Armed Bandits for the Data Incubator☆30Updated 7 years ago
- R Code + R Notebook for analyzing millions of Amazon reviews using Apache Spark☆85Updated 9 years ago
- Hadoop, Spark and Storm based anomaly detection implementations for data quality, cyber security, fraud detection etc.☆129Updated 2 years ago
- Kaggle competition results☆20Updated 7 years ago
- ☆35Updated 9 years ago
- Set of Machine Learning and Stochastic Optimazion tools based on Hadoop, Spark and Storm https://pkghosh.wordpress.com/☆177Updated 2 years ago
- Fraud Detection using ensemble of Statistical, Network analysis and Machine learning approach.☆68Updated 11 years ago
- Some thoughts on how to use machine learning in production☆71Updated 8 years ago
- Fast, accurate, lightweight, multi-core ML in Python, leveraging Vowpal Wabbit☆21Updated 7 years ago
- Python implementation of machine learning metrics☆14Updated 8 years ago