cloudera / CML_AMP_Churn_Prediction
Build an scikit-learn model to predict churn using customer telco data.
☆15Updated 3 months ago
Alternatives and similar repositories for CML_AMP_Churn_Prediction:
Users that are interested in CML_AMP_Churn_Prediction are comparing it to the libraries listed below
- Essential PySpark for Scalable Data Analytics, published by Packt☆43Updated 2 years ago
- Code for my "Efficient Data Processing in SQL" book.☆56Updated 7 months ago
- Example Python and R code for Cloudera Machine Learning (CML) training☆14Updated 4 years ago
- 😈Complete End to End ETL Pipeline with Spark, Airflow, & AWS☆44Updated 5 years ago
- Full stack data engineering tools and infrastructure set-up☆50Updated 4 years ago
- PySpark-ETL☆23Updated 5 years ago
- ☆33Updated last year
- PySpark functions and utilities with examples. Assists ETL process of data modeling☆100Updated 4 years ago
- ☆40Updated 8 months ago
- Data Engineering with Scala, published by Packt☆23Updated last year
- ☆87Updated 2 years ago
- Databricks ML in Action, Published by Packt☆27Updated 10 months ago
- A Postgres data warehouse for processing synthetic data using IAC principles☆16Updated 2 years ago
- PySpark Tutorial for Beginners - Practical Examples in Jupyter Notebook with Spark version 3.4.1. The tutorial covers various topics like…☆101Updated last year
- Code for blog at https://www.startdataengineering.com/post/python-for-de/☆72Updated 9 months ago
- ☆16Updated last year
- Challenge Data Engineer☆25Updated 2 years ago
- Automated Machine Learning on AWS, published by Packt☆45Updated last year
- Finance 🏦 Data Builder 🛠️ @ postgres 🐘☆20Updated 4 years ago
- Data Engineering, Data Warehouse, Data Mart, Cloud Data, AWS, SAS, Redshift, S3☆30Updated 4 years ago
- A tutorial that helps Big Data Engineers ramp up faster by getting familiar with PySpark dataframes and functions. It also covers topics …☆20Updated 3 years ago
- Public source code for the Batch Processing with Apache Beam (Python) online course☆18Updated 4 years ago
- Simplified ETL process in Hadoop using Apache Spark. Has complete ETL pipeline for datalake. SparkSession extensions, DataFrame validatio…☆53Updated last year
- Databricks Certified Associate Spark Developer preparation toolkit to setup single node Standalone Spark Cluster along with material in t…☆30Updated 11 months ago
- ☆14Updated last year
- A course by DataTalks Club that covers Spark, Kafka, Docker, Airflow, Terraform, DBT, Big Query etc☆14Updated 3 years ago
- Course Material Data Engineering on AWS Course☆28Updated 6 months ago
- Produce Kafka messages, consume them and upload into Cassandra, MongoDB.☆40Updated last year
- Classwork projects and home works done through Udacity data engineering nano degree☆74Updated last year
- Step by step instructions to create a production-ready data pipeline☆42Updated 3 months ago