dgadiraju / nifi-workshopLinks
☆26Updated 4 years ago
Alternatives and similar repositories for nifi-workshop
Users that are interested in nifi-workshop are comparing it to the libraries listed below
Sorting:
- ☆14Updated 6 years ago
- Nested Data (JSON/AVRO/XML) Parsing and Flattening in Spark☆16Updated last year
- ☆115Updated 4 years ago
- Guide for databricks spark certification☆58Updated 4 years ago
- ☆26Updated last year
- Simplified ETL process in Hadoop using Apache Spark. Has complete ETL pipeline for datalake. SparkSession extensions, DataFrame validatio…☆55Updated 2 years ago
- ETL pipeline using pyspark (Spark - Python)☆117Updated 5 years ago
- Data Engineering with Spark and Delta Lake☆101Updated 2 years ago
- Educational notes,Hands on problems w/ solutions for hadoop ecosystem☆87Updated 6 years ago
- Repository for all ITVersity Vagrant Boxes.☆31Updated 5 years ago
- How to manage Slowly Changing Dimensions with Apache Hive☆55Updated 5 years ago
- Road to Azure Data Engineer Part-II: DP-201 - Designing an Azure Data Solution☆19Updated 4 years ago
- My Git Repo for Csv Data☆21Updated 4 years ago
- My Study guide used to pass the CRT020 Spark Certification exam☆33Updated 5 years ago
- Unit testing using databricks connect☆31Updated 3 years ago
- Apache Spark Course Material☆91Updated 2 years ago
- Slowly Changing Dimension type 2 using Hive query language using exclusive join technique with ORC Hive tables, partitioned and clustered…☆16Updated 6 years ago
- A production-grade data pipeline has been designed to automate the parsing of user search patterns to analyze user engagement. Extract d…☆24Updated 3 years ago
- ☆87Updated 2 years ago
- Batch Processing , orchestration using Apache Airflow and Google Workflows, spark structured Streaming and a lot more☆18Updated 3 years ago
- ☆20Updated 5 years ago
- ☆19Updated 5 years ago
- ☆53Updated 4 years ago
- Collection of Sample Databricks Spark Notebooks ( mostly for Azure Databricks )☆89Updated 7 years ago
- Spark implementation of Slowly Changing Dimension type 2☆11Updated 6 years ago
- PySpark-ETL☆23Updated 5 years ago
- Simplifying Data Engineering and Analytics with Delta, published by Packt☆21Updated last year
- Multi-stage, config driven, SQL based ETL framework using PySpark☆25Updated 5 years ago
- This repo is a collection of tools to deploy, manage and operate a Databricks based Lakehouse.☆45Updated 4 months ago
- All the Snowflake Virtual Warehouse - Example☆12Updated 5 years ago