charlesb / CDF-workshopLinks
Leveraging Hortonworks' HDP 3.1.0 and HDF 3.4.0 components, this tutorial guides the user through steps to stream data from a REST API into a live dashboard using NiFi, Kafka, Hive LLAP with Druid integration and Superset. This workshop will also cover steps to remotely manage MiNiFi to send data to NiFi using Edge Flow Manager (EFM).
☆19Updated 6 years ago
Alternatives and similar repositories for CDF-workshop
Users that are interested in CDF-workshop are comparing it to the libraries listed below
Sorting:
- Edge2AI Workshop☆70Updated 4 months ago
- Spark and Delta Lake Workshop☆22Updated 3 years ago
- TPCDS benchmark for various engines☆18Updated 3 years ago
- ☆27Updated last year
- Delta Lake examples☆230Updated last year
- Support for generating modern platforms dynamically with services such as Kafka, Spark, Streamsets, HDFS, ....☆77Updated this week
- A simplified, lightweight ETL Framework based on Apache Spark☆588Updated last year
- Apache Spark Connector for SQL Server and Azure SQL☆286Updated 8 months ago
- A simple Spark-powered ETL framework that just works 🍺☆182Updated last month
- The Internals of Spark on Kubernetes☆72Updated 3 years ago
- DataQuality for BigData☆144Updated last year
- Multiple node presto cluster on docker container☆126Updated 3 years ago
- Nested Data (JSON/AVRO/XML) Parsing and Flattening in Spark☆16Updated last year
- DIRECT, the Data Integration Run-time Execution Control Tool, is a data logistics control framework that can be used to monitor, log, aud…☆27Updated 2 months ago
- The Virtual Data Warehouse is a code generation and template management tool. It is part of the data solution automation ecosystem - the …☆45Updated 3 months ago
- Enabling Continuous Data Processing with Apache Spark and Azure Event Hubs☆238Updated 8 months ago
- An Azure Databricks workshop leveraging the New York Taxi and Limousine Commission Trip Records dataset☆109Updated 2 years ago
- dbt adapter for Azure Synapse Dedicated SQL Pools☆75Updated 2 months ago
- Multi-stage, config driven, SQL based ETL framework using PySpark☆26Updated 6 years ago
- How to manage Slowly Changing Dimensions with Apache Hive☆55Updated 6 years ago
- Code snippets used in demos recorded for the blog.☆37Updated 2 months ago
- Spark app to merge different schemas☆23Updated 4 years ago
- ☆22Updated 2 years ago
- Delta Lake Documentation☆50Updated last year
- Smart Automation Tool for building modern Data Lakes and Data Pipelines☆122Updated this week
- Databricks Platform - Architecture, Security, Automation and much more!!☆51Updated this week
- Kafka sink for Kusto☆51Updated 3 weeks ago
- An Azure Function which allows Azure Data Factory (ADF) to connect to Snowflake in a flexible way.☆26Updated 2 years ago
- A Spark datasource for the HadoopOffice library☆37Updated last month
- SQL Queries & Alerts for Databricks System Tables access.audit Logs☆37Updated 3 months ago