Leveraging Hortonworks' HDP 3.1.0 and HDF 3.4.0 components, this tutorial guides the user through steps to stream data from a REST API into a live dashboard using NiFi, Kafka, Hive LLAP with Druid integration and Superset. This workshop will also cover steps to remotely manage MiNiFi to send data to NiFi using Edge Flow Manager (EFM).
☆19Aug 16, 2019Updated 6 years ago
Alternatives and similar repositories for CDF-workshop
Users that are interested in CDF-workshop are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A complete custom processor project, for your reference.☆17Sep 29, 2015Updated 10 years ago
- HDF masterclass materials☆29Mar 28, 2016Updated 10 years ago
- HDFS Automatic Snapshot Service for Linux☆11Oct 17, 2016Updated 9 years ago
- HADOOP-CLI is an interactive command line shell that makes interacting with the Hadoop Distribted Filesystem (HDFS) simpler and more intu…☆38May 7, 2026Updated last month
- ☆32Feb 15, 2019Updated 7 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- The Device Manager Demo is designed to demonstrate a fully functioning modern Data/IoT application. It is a Lambda architecture built usi…☆13Aug 31, 2017Updated 8 years ago
- Manage Apache Atlas and Ranger configuration for your Hadoop environment.☆16May 4, 2021Updated 5 years ago
- Apache Nifi Hello World Example☆22Jan 26, 2018Updated 8 years ago
- Creating a REST API with Python on Synapse Serverless pools using external tables☆12Dec 27, 2021Updated 4 years ago
- Explore Ambari REST APIs from an Ambari view☆18Dec 7, 2015Updated 10 years ago
- Edge2AI Workshop☆70Jun 11, 2025Updated 11 months ago
- Collection of tools for bootstrapping Apache Ambari & deploying clusters☆83Apr 17, 2019Updated 7 years ago
- An Apache Spark app for making data movement between Apache Hive and Apache Phoenix/HBase☆14Mar 23, 2016Updated 10 years ago
- low-level helpers for Apache Spark libraries and tests☆16Dec 29, 2018Updated 7 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Memory / Configuration Calculator for Hive LLAP☆14Jul 18, 2020Updated 5 years ago
- Simulation of job offers and CVs with real-time processing, classification, and analytics using Kafka, Ray, Spark, and Databricks. Includ…☆14Dec 25, 2024Updated last year
- Ansible scripts for deploying Kafka on EC2☆10Oct 7, 2016Updated 9 years ago
- Microsoft 365 Defender Hunting via PowerShell.☆14Feb 8, 2022Updated 4 years ago
- Function to rotate storage account keys stored in key vault as secret☆13Nov 15, 2023Updated 2 years ago
- Integrate Grafana with Ambari Metrics System☆27Jun 13, 2025Updated 11 months ago
- a azure monitor workbook for LogicApps☆18Apr 10, 2020Updated 6 years ago
- This repository contains notebooks with different probability density function estimators.☆13Jun 4, 2020Updated 6 years ago
- ☆49Oct 22, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆12Mar 15, 2022Updated 4 years ago
- Ambari stack service for easily installing and managing Solr on HDP cluster☆37Jan 3, 2018Updated 8 years ago
- Spark stream from kafka(json) to s3(parquet)☆15Nov 8, 2018Updated 7 years ago
- Pulsar Presto (outdated), go to https://github.com/apache/pulsar-sql instead☆18Oct 18, 2024Updated last year
- Data Quality Monitoring Tool☆15Dec 5, 2017Updated 8 years ago
- spark自学手册,包含了例如spark core、spark sql、spark streaming、spark-kafka、delta-lake,以及scala基础练习,还有一些例如master、shuffle源码分析,总结及翻译。☆18Jul 19, 2023Updated 2 years ago
- Terraform module to deploy Cloudera on Oracle Cloud Infrastructure (OCI)☆19Oct 20, 2021Updated 4 years ago
- Spark-based pipeline to extract and parse monthly games from the Lichess database.☆21Sep 22, 2025Updated 8 months ago
- Kafka Connect connector for receiving data and writing data to Splunk.☆25Nov 7, 2017Updated 8 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Telemetry and logs generator for benchmarks☆21Aug 23, 2022Updated 3 years ago
- File Watcher 核心库:轻量级Java库☆30Sep 20, 2018Updated 7 years ago
- Create a data mart using Azure Data Factory as ELT / ETL, Azure Synapse as database and Power BI as visualization tool.☆19Apr 20, 2022Updated 4 years ago
- This group will share the private preview documentation, issues for the partial update feature.☆19Oct 1, 2021Updated 4 years ago
- Template to deploy Synapse Analytics using best practices to deliver a proof of concept.☆21Mar 3, 2023Updated 3 years ago
- Asynchronous inorder messaging using MQTT Paho for ANDROID☆10Mar 21, 2015Updated 11 years ago
- A Python package to help Databricks Unity Catalog users to read and query Delta Lake tables with Polars, DuckDb, or PyArrow.☆27Mar 25, 2024Updated 2 years ago