frrmack / hadoop-python-hive-tutorial
A tutorial for using Hadoop with Python and Hive
☆10Updated 9 years ago
Related projects: ⓘ
- Code example to predict prices of Airbnb vacation rentals, using scikit-learn on Spark with spark-sklearn, on MapR.☆44Updated 8 years ago
- A simple introduction to using spark ml pipelines☆26Updated 6 years ago
- Using Luigi to create a Machine Learning Pipeline using the Rossman Sales data from Kaggle☆33Updated 8 years ago
- ☆25Updated 6 years ago
- Introduction to Data Science☆18Updated 8 years ago
- Sharing interesting and noteworthy Data Engineering content☆65Updated 7 years ago
- Material for UW Extension Data Science 350☆19Updated 6 years ago
- ☆26Updated 8 months ago
- Ingest tweets with Kafka. Use Spark to track popular hashtags and trendsetters for each hashtag☆29Updated 8 years ago
- This repository contains code files specifically IPython notebooks for the assignments in the course "Scalable Machine Learning" by UC Be…☆30Updated 9 years ago
- Udacity Data Pipeline Exercises☆15Updated 4 years ago
- A short tutorial notebook on PySpark☆15Updated 8 years ago
- Realtime social media data analytics with Apache Spark, Python, Kafka, Pandas, etc☆52Updated 8 years ago
- ☆39Updated this week
- Tutorial repo for the article "ML in Production"☆30Updated last year
- Data Science and Machine Learning with Python - Hands On from Udemy☆14Updated 7 years ago
- ☆16Updated 4 years ago
- Free resources for learning data science☆22Updated 6 years ago
- AWS, Vagrant, and Spark☆20Updated 8 years ago
- A tutorial to create python based prediction web app☆29Updated 4 years ago
- ☆16Updated 6 years ago
- ☆136Updated this week
- Serving TensorFlow Models using Kubernetes and TF Serving☆12Updated 6 years ago
- Code for the Spark tutorial at the Pydata conference in London June 2015☆12Updated 7 years ago
- My machine learning model for the See Click Predict Fix Kaggle competition☆31Updated 7 years ago
- All Kaggle competitions☆91Updated 8 years ago
- Twitter visualizaton experiment using various python-based technologies.☆60Updated 8 years ago
- 🚨 Simple, self-contained fraud detection system built with Apache Kafka and Python☆81Updated 5 years ago
- Supporting content (slides and exercises) for the Addison-Wesley (Pearson) video series covering best practices for developing scalable S…☆66Updated 8 years ago
- A couple projects using scikit-learn illustrating project decision making.☆15Updated 7 years ago