MLWhiz / Spark_ProjectsLinks
Spark Projects for the Berkeley Data Science Course
☆13Updated 9 years ago
Alternatives and similar repositories for Spark_Projects
Users that are interested in Spark_Projects are comparing it to the libraries listed below
Sorting:
- Repository for medium article☆22Updated last year
- A collection of course materials, notes and assignments for the Masters of Information and Data Sciences program at UC Berkeley☆13Updated 5 years ago
- Data Analysis and Visualization on Airbnb Data☆11Updated 6 years ago
- ☆18Updated 7 years ago
- Code for my presentation: Using PySpark to Process Boat Loads of Data☆20Updated 7 years ago
- Contains source files used in the Spark with Python course☆18Updated 6 years ago
- Code repo for Packt course I developed, "Beginning Data Wrangling with Python"☆30Updated 5 years ago
- Quick EDA on a data set to determine what segments there are.☆31Updated 6 years ago
- Create Interactive Dashboards with Streamlit and Python Coursera☆10Updated 5 years ago
- Few tutorials on pandas, matplotlib and seaborn☆27Updated 9 years ago
- Data Science and Machine Learning with Python - Hands On from Udemy☆14Updated 8 years ago
- Contains code and presentation for my interactive hack session, 'Effective Feature Engineering: A Structured Approach to Building Better …☆30Updated 4 years ago
- Data analysis using numpy, pandas, matplotlib, seaborn, sqlite3, data wrangling☆32Updated 5 years ago
- Cancer Prediction based on Mass Spectrum Analysis☆10Updated 2 years ago
- Hands on Unsupervised Learning with Python [Video], Published by Packt☆29Updated 2 years ago
- Work for Mastering Large Datasets with Python☆19Updated 2 years ago
- Basic TensorFlow mechanics, operations, class definitions, and neural networks building. Examples from deeplearning.ai Tensorflow course …☆35Updated 6 years ago
- This is a guided certification project, as a part of Data Science for Social Good initiative☆17Updated 5 years ago
- Apache Spark in 7 Days [Video], by Packt Publishing☆18Updated 2 years ago
- This repository hosts the code/projects/demos/slides for Big Data technologies under Apache Hadoop and Apache Spark umbrella.☆42Updated 2 years ago
- Exploratory Data Analysis with Pandas and Python 3.x, published by Packt☆44Updated 2 years ago
- Python Machine Learning (ML) project that demonstrates the archetypal ML workflow within a Jupyter notebook, with automated model deploym…☆62Updated 2 years ago
- ☆19Updated 4 years ago
- Data models, build data warehouses and data lakes, automate data pipelines, and worked with massive datasets.☆13Updated 6 years ago
- Customer life time analysis (CLV analysis). We are using Gamma-Gamma model to estimate average transaction value for each customer.☆47Updated 7 years ago
- Repository for GH public projects☆18Updated last year
- ☆18Updated 5 years ago
- Source Code for 'Data Analysis and Visualization Using Python' by Dr. Ossama Embarak☆51Updated 6 years ago
- Tips for Advanced Feature Engineering☆52Updated 4 years ago
- The goal of this project is to offer an AWS EMR template using Spot Fleet and On-Demand Instances that you can use quickly. Just focus on…☆28Updated 3 years ago