charlesb / CDF-workshopLinks

Leveraging Hortonworks' HDP 3.1.0 and HDF 3.4.0 components, this tutorial guides the user through steps to stream data from a REST API into a live dashboard using NiFi, Kafka, Hive LLAP with Druid integration and Superset. This workshop will also cover steps to remotely manage MiNiFi to send data to NiFi using Edge Flow Manager (EFM).

☆19

Alternatives and similar repositories for CDF-workshop

Users that are interested in CDF-workshop are comparing it to the libraries listed below

Sorting:

asdaraujo / edge2ai-workshop
Edge2AI Workshop
☆70Updated last month
data-solution-automation-engine / DIRECT
DIRECT, the Data Integration Run-time Execution Control Tool, is a data logistics control framework that can be used to monitor, log, aud…
☆27Updated this week
fabiog1901 / SingleNodeCDPCluster
☆27Updated last year
bhavink / databricks
Databricks Platform - Architecture, Security, Automation and much more!!
☆51Updated last month
jeremiahhansen / snowflake-connector-adf
An Azure Function which allows Azure Data Factory (ADF) to connect to Snowflake in a flexible way.
☆26Updated 2 years ago
jaceklaskowski / spark-delta-lake-workshop
Spark and Delta Lake Workshop
☆22Updated 3 years ago
TrivadisPF / platys-modern-data-platform
Support for generating modern platforms dynamically with services such as Kafka, Spark, Streamsets, HDFS, ....
☆75Updated last week
data-solution-automation-engine / TEAM
The Taxonomy for ETL Automation Metadata (TEAM) is a tool for design metadata management geared towards data warehouse automation. It is …
☆36Updated 6 months ago
Azure / kafka-sink-azure-kusto
Kafka sink for Kusto
☆50Updated last week
microsoft / Data-Quality-Rule-Engine
☆22Updated 2 years ago
microsoft / MonitoFi
MonitoFi: Health & Performance Monitor for your Apache NiFi
☆66Updated 2 years ago
SETL-Framework / setl
A simple Spark-powered ETL framework that just works 🍺
☆182Updated last week
delta-io / delta-docs
Delta Lake Documentation
☆49Updated last year
tspannhw / EverythingApacheNiFi
EverythingApacheNiFi
☆113Updated last year
simonellistonball / masterclass-hdf
HDF masterclass materials
☆28Updated 9 years ago
microsoft / nutter
Testing framework for Databricks notebooks
☆306Updated last year
andyweaves / system-tables-audit-logs
SQL Queries & Alerts for Databricks System Tables access.audit Logs
☆33Updated 2 weeks ago
saikrishnapujari / Spark-Nested-Data-Parser
Nested Data (JSON/AVRO/XML) Parsing and Flattening in Spark
☆16Updated last year
algattik / databricks-lineage-tutorial
☆8Updated 6 years ago
AdamPaternostro / Azure-Data-Factory-CI-CD-Source-Control
How do to CI/CD with Azure Data Factory
☆41Updated 4 years ago
devlace / datadevops
How DevOps principles can be applied to Data Pipeline Solution built with Azure Databricks, Data Factory and ADL Gen2. Moved to: https://…
☆61Updated 9 months ago
smart-data-lake / smart-data-lake
Smart Automation Tool for building modern Data Lakes and Data Pipelines
☆124Updated last week
richchad / data_quality_databricks
Examples of metadata driven SQL processes implemented in Databricks
☆16Updated 4 years ago
microsoft / Azure-Databricks-NYC-Taxi-Workshop
An Azure Databricks workshop leveraging the New York Taxi and Limousine Commission Trip Records dataset
☆109Updated 2 years ago
syedhassaanahmed / databricks-notebooks
Collection of Databricks and Jupyter Notebooks
☆22Updated last year
avensolutions / spark-sql-etl-framework
Multi-stage, config driven, SQL based ETL framework using PySpark
☆26Updated 5 years ago
delta-io / delta-examples
Delta Lake examples
☆227Updated 10 months ago
microsoft / dbt-synapse
dbt adapter for Azure Synapse Dedicated SQL Pools
☆73Updated 3 weeks ago
hdinsight / tpcds-hdinsight
TPCDS benchmark for various engines
☆18Updated 3 years ago
data-solution-automation-engine / virtual-data-warehouse
The Virtual Data Warehouse is a code generation and template management tool. It is part of the data solution automation ecosystem - the …
☆45Updated 3 weeks ago