charlesb / CDF-workshopLinks
Leveraging Hortonworks' HDP 3.1.0 and HDF 3.4.0 components, this tutorial guides the user through steps to stream data from a REST API into a live dashboard using NiFi, Kafka, Hive LLAP with Druid integration and Superset. This workshop will also cover steps to remotely manage MiNiFi to send data to NiFi using Edge Flow Manager (EFM).
☆19Updated 6 years ago
Alternatives and similar repositories for CDF-workshop
Users that are interested in CDF-workshop are comparing it to the libraries listed below
Sorting:
- ☆27Updated last year
- Edge2AI Workshop☆70Updated 3 months ago
- A simple Spark-powered ETL framework that just works 🍺☆182Updated last month
- Delta Lake Documentation☆49Updated last year
- Spark and Delta Lake Workshop☆22Updated 3 years ago
- Apache Spark Connector for SQL Server and Azure SQL☆287Updated 6 months ago
- dbt adapter for Azure Synapse Dedicated SQL Pools☆75Updated 3 weeks ago
- Delta Lake examples☆227Updated 11 months ago
- DataQuality for BigData☆144Updated last year
- Support for generating modern platforms dynamically with services such as Kafka, Spark, Streamsets, HDFS, ....☆76Updated last week
- Databricks Platform - Architecture, Security, Automation and much more!!☆51Updated last month
- Enabling Continuous Data Processing with Apache Spark and Azure Event Hubs☆238Updated 7 months ago
- Smart Automation Tool for building modern Data Lakes and Data Pipelines☆124Updated this week
- TPCDS benchmark for various engines☆18Updated 3 years ago
- Nested Data (JSON/AVRO/XML) Parsing and Flattening in Spark☆16Updated last year
- The Virtual Data Warehouse is a code generation and template management tool. It is part of the data solution automation ecosystem - the …☆45Updated 2 months ago
- A simplified, lightweight ETL Framework based on Apache Spark☆589Updated last year
- MonitoFi: Health & Performance Monitor for your Apache NiFi☆66Updated 2 years ago
- An Azure Databricks workshop leveraging the New York Taxi and Limousine Commission Trip Records dataset☆110Updated 2 years ago
- SQL Queries & Alerts for Databricks System Tables access.audit Logs☆33Updated last month
- The Internals of Spark on Kubernetes☆71Updated 3 years ago
- A general purpose framework for automating Cloudera Products☆67Updated 6 months ago
- This repo is a collection of tools to deploy, manage and operate a Databricks based Lakehouse.☆46Updated 7 months ago
- ☆22Updated 2 years ago
- Guide for databricks spark certification☆58Updated 4 years ago
- Data Quality and Observability platform for the whole data lifecycle, from profiling new data sources to full automation with Data Observ…☆165Updated last week
- ☆32Updated 6 years ago
- The Lakehouse Engine is a configuration driven Spark framework, written in Python, serving as a scalable and distributed engine for sever…☆266Updated last week
- Examples of metadata driven SQL processes implemented in Databricks☆16Updated 4 years ago
- Connect your Spark Databricks clusters Log4J output to the Application Insights Appender☆20Updated 5 years ago