Leveraging Hortonworks' HDP 3.1.0 and HDF 3.4.0 components, this tutorial guides the user through steps to stream data from a REST API into a live dashboard using NiFi, Kafka, Hive LLAP with Druid integration and Superset. This workshop will also cover steps to remotely manage MiNiFi to send data to NiFi using Edge Flow Manager (EFM).
☆19Aug 16, 2019Updated 6 years ago
Alternatives and similar repositories for CDF-workshop
Users that are interested in CDF-workshop are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Materials for various Hadoop & Nifi related workshops☆51Mar 20, 2019Updated 7 years ago
- A complete custom processor project, for your reference.☆17Sep 29, 2015Updated 10 years ago
- HDFS Automatic Snapshot Service for Linux☆11Oct 17, 2016Updated 9 years ago
- ☆32Feb 15, 2019Updated 7 years ago
- The Device Manager Demo is designed to demonstrate a fully functioning modern Data/IoT application. It is a Lambda architecture built usi…☆13Aug 31, 2017Updated 8 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Ambari service to deploy/manage Hortonworks IoT demo☆22Apr 27, 2017Updated 9 years ago
- A simple Panel-based dashboard visualizing geotagged tweets with hvplot and Datashader.☆17Mar 25, 2024Updated 2 years ago
- Creating a REST API with Python on Synapse Serverless pools using external tables☆12Dec 27, 2021Updated 4 years ago
- Explore Ambari REST APIs from an Ambari view☆18Dec 7, 2015Updated 10 years ago
- Collection of tools for bootstrapping Apache Ambari & deploying clusters☆83Apr 17, 2019Updated 7 years ago
- An Apache Spark app for making data movement between Apache Hive and Apache Phoenix/HBase☆14Mar 23, 2016Updated 10 years ago
- Simulation of job offers and CVs with real-time processing, classification, and analytics using Kafka, Ray, Spark, and Databricks. Includ…☆14Dec 25, 2024Updated last year
- Ansible scripts for deploying Kafka on EC2☆10Oct 7, 2016Updated 9 years ago
- Integrate Grafana with Ambari Metrics System☆27Jun 13, 2025Updated 10 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- My HackerRank Solutions : https://www.hackerrank.com/RohanKhude☆12Jul 13, 2016Updated 9 years ago
- ☆20Apr 27, 2012Updated 14 years ago
- a azure monitor workbook for LogicApps☆18Apr 10, 2020Updated 6 years ago
- ☆49Oct 22, 2024Updated last year
- ☆12Mar 15, 2022Updated 4 years ago
- NICTA Named Entity Recogniser is a rule based Named Entity Recogniser which extracts named entities from text such as Organisation, Locat…☆16Apr 15, 2023Updated 3 years ago
- Spark DataFrame transformation and UDF test examples☆22Feb 13, 2023Updated 3 years ago
- ☆15Jan 17, 2022Updated 4 years ago
- Ambari stack service for easily installing and managing Solr on HDP cluster☆37Jan 3, 2018Updated 8 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Spark stream from kafka(json) to s3(parquet)☆15Nov 8, 2018Updated 7 years ago
- A Docker Compose files to compose a NiFi cluster on Docker.☆35May 29, 2017Updated 8 years ago
- Algorithms and Data Structures implemented in Java☆12Jul 28, 2019Updated 6 years ago
- Pulsar Presto (outdated), go to https://github.com/apache/pulsar-sql instead☆18Oct 18, 2024Updated last year
- Data Quality Monitoring Tool☆15Dec 5, 2017Updated 8 years ago
- Blazing fast, modular, next gen logagent☆11Apr 16, 2026Updated last week
- spark自学手册,包含了例如spark core、spark sql、spark streaming、spark-kafka、delta-lake,以及scala基础练习,还有一些例如master、shuffle源码分析,总结及翻译。☆18Jul 19, 2023Updated 2 years ago
- Code repository for Learning Apache Spark 2, published by Packt☆21Jan 30, 2023Updated 3 years ago
- Spark-based pipeline to extract and parse monthly games from the Lichess database.☆21Sep 22, 2025Updated 7 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Open Network Inspection Command Suite☆10Dec 4, 2022Updated 3 years ago
- TRANSPARÊNCIA COVID-19☆19Dec 17, 2021Updated 4 years ago
- Kafka Connect connector for receiving data and writing data to Splunk.☆25Nov 7, 2017Updated 8 years ago
- Telemetry and logs generator for benchmarks☆21Aug 23, 2022Updated 3 years ago
- Create a data mart using Azure Data Factory as ELT / ETL, Azure Synapse as database and Power BI as visualization tool.☆20Apr 20, 2022Updated 4 years ago
- Template to deploy Synapse Analytics using best practices to deliver a proof of concept.☆21Mar 3, 2023Updated 3 years ago
- Examples for Apache Oozie book☆18May 30, 2016Updated 9 years ago