dgadiraju / nifi-workshop
☆25Updated 4 years ago
Alternatives and similar repositories for nifi-workshop:
Users that are interested in nifi-workshop are comparing it to the libraries listed below
- ☆14Updated 6 years ago
- Road to Azure Data Engineer Part-II: DP-201 - Designing an Azure Data Solution☆19Updated 4 years ago
- My Study guide used to pass the CRT020 Spark Certification exam☆33Updated 5 years ago
- ☆115Updated 4 years ago
- Simplified ETL process in Hadoop using Apache Spark. Has complete ETL pipeline for datalake. SparkSession extensions, DataFrame validatio…☆54Updated 2 years ago
- ETL pipeline using pyspark (Spark - Python)☆114Updated 5 years ago
- Guide for databricks spark certification☆58Updated 3 years ago
- All the Snowflake Virtual Warehouse - Example☆12Updated 4 years ago
- Educational notes,Hands on problems w/ solutions for hadoop ecosystem☆87Updated 6 years ago
- Data Engineering with Spark and Delta Lake☆98Updated 2 years ago
- Code Repository for GCP: Complete Google Data Engineer and Cloud Architect Guide(v), Published by Packt☆16Updated 2 years ago
- ☆25Updated last year
- Repository of notebooks and related collateral used in the Databricks Demo Hub, showing how to use Databricks, Delta Lake, MLflow, and mo…☆25Updated 3 years ago
- Nested Data (JSON/AVRO/XML) Parsing and Flattening in Spark☆16Updated last year
- End-to-end Machine Learning Pipeline demo using Delta Lake, MLflow and AzureML in Azure Databricks☆18Updated 5 years ago
- CICD pipeline that deploys a dbt image on a GKE cluster☆11Updated 3 years ago
- Repository for all ITVersity Vagrant Boxes.☆31Updated 5 years ago
- Optimizing Databricks Workload, published by Packt☆17Updated 2 years ago
- ☆87Updated 2 years ago
- Road to Azure Data Engineer Part-I: DP-200 - Implementing an Azure Data Solution☆67Updated 4 years ago
- ☆35Updated 2 months ago
- Simplifying Data Engineering and Analytics with Delta, published by Packt☆21Updated last year
- Multi-stage, config driven, SQL based ETL framework using PySpark☆25Updated 5 years ago
- How to manage Slowly Changing Dimensions with Apache Hive☆55Updated 5 years ago
- Data Engineering on GCP☆35Updated 2 years ago
- My Git Repo for Csv Data☆21Updated 4 years ago
- Apache Spark 3 - Structured Streaming Course Material☆122Updated last year
- Azure Databricks Cookbook, Published by Packt☆59Updated last year
- Batch Processing , orchestration using Apache Airflow and Google Workflows, spark structured Streaming and a lot more☆19Updated 2 years ago
- Apche Spark Structured Streaming with Kafka using Python(PySpark)☆40Updated 5 years ago