hanhanwu / Hanhan_Data_Science_Resources2Links
more data science resources
☆14Updated 3 years ago
Alternatives and similar repositories for Hanhan_Data_Science_Resources2
Users that are interested in Hanhan_Data_Science_Resources2 are comparing it to the libraries listed below
Sorting:
- Advanced workshop on XGBoost with Tianqi Chen in Santa Monica, June 2, 2016☆26Updated 8 years ago
- feng - feature engineering for machine-learning champions☆27Updated 8 years ago
- Tweet Analysis with Spark☆15Updated 8 years ago
- I developed this case study only in 7 days with Pyspark (Spark 1.6.0) SQL & MLlib. I used Databricks cluster and AWS. %90 AUC is achieved…☆17Updated 9 years ago
- Predicting sales with Pandas☆15Updated 9 years ago
- Tutorial on deploying machine learning models to production☆59Updated 5 years ago
- 💫 Runtime performance comparison of spaCy against other NLP libraries☆20Updated 3 years ago
- Predicting happiness from demographics and poll answers☆45Updated 8 years ago
- Document clustering in Python☆30Updated 9 years ago
- A curated inventory of machine learning methods available on the Apache Spark platform, both in official and third party libraries.☆65Updated 8 years ago
- PySpark Machine Learning Examples☆45Updated 7 years ago
- Show how to perform fast retraining with LightGBM in different business cases☆54Updated 6 years ago
- Some work on Kaggle data for fun☆64Updated 7 years ago
- Datasets and notebooks☆13Updated 8 years ago
- Install directions and example notebooks for Udacity's Deep Learning classes☆28Updated 9 years ago
- VW, Liblinear and StreamSVM compared on webspam☆14Updated 10 years ago
- Some notebook examples related to Apache Spark, IPython / Jupyter, Zeppelin☆52Updated 9 years ago
- Quick-Data-Science-Experiments☆19Updated 7 years ago
- A helper library for data science pipeline☆36Updated 6 years ago
- GBM multicore scaling: h2o, xgboost and lightgbm on multicore and multi-socket systems☆20Updated 7 years ago
- Analysis of Categorical Encodings for dense Decision Trees☆41Updated 8 years ago
- Kaggle competition results☆20Updated 6 years ago
- Machine Learning Orchestration☆51Updated 5 years ago
- A simple example of containerized data science with python and Docker.☆51Updated 7 years ago
- Material and slides for Boston NLP meetup May 23rd 2016☆17Updated 9 years ago
- Assembly of fundamental statistics implemented based on Apache Spark☆31Updated 9 years ago
- Word2Vec models with Twitter data using Spark. Blog:☆66Updated 6 years ago
- Different machine learning algorithms implementation in Tensorflow☆27Updated 8 years ago
- Presentations & notebooks from our talks /workshops/meetups/etc☆24Updated 7 years ago
- Code to 1) scrap wikipedia page view counts, and to 2) conduct time series analysis with GAM☆47Updated 8 years ago