streamsets / tutorialsLinks
StreamSets Tutorials
☆349Updated 10 months ago
Alternatives and similar repositories for tutorials
Users that are interested in tutorials are comparing it to the libraries listed below
Sorting:
- Apache NiFi example flows☆204Updated 5 years ago
- A collection of templates for use with Apache NiFi.☆277Updated 8 years ago
- Apache MiNiFi (a subproject of Apache NiFi)☆125Updated 4 years ago
- Docker image with Ambari☆291Updated 7 years ago
- hadoop-mini-clusters provides an easy way to test Hadoop projects directly in your IDE☆292Updated 2 years ago
- Cloudera Manager API Client☆306Updated last year
- A Spark Atlas connector to track data lineage in Apache Atlas☆266Updated 2 years ago
- ☆240Updated 3 years ago
- Ambari service for Apache Flink☆126Updated 4 years ago
- Sparkline BI Accelerator provides fast ad-hoc query capability over Logical Cubes. This has been folded into our SNAP Platform(http://bit…☆282Updated 6 years ago
- High Performance Kafka Connector for Spark Streaming.Supports Multi Topic Fetch, Kafka Security. Reliable offset management in Zookeeper.…☆632Updated 3 years ago
- A Maven-based example of using Cloudera Impala's JDBC driver☆118Updated 9 years ago
- Mirror of Apache Atlas (Incubating)☆94Updated 2 years ago
- Build configuration-driven ETL pipelines on Apache Spark☆159Updated 2 years ago
- Cloudera Manager Extensibility Tools and Documentation.☆189Updated last year
- Ambari stack service for easily installing and managing Hue on HDP cluster☆107Updated 5 years ago
- Bigtop is an Apache Foundation project for Infrastructure Engineers and Data Scientists looking for comprehensive packaging, testing, and…☆642Updated last week
- Apache Livy is an open source REST interface for interacting with Apache Spark from anywhere.☆913Updated 2 weeks ago
- ☆199Updated 2 weeks ago
- Kettle plugin that provides support for interacting within many "big data" projects including Hadoop, Hive, HBase, Cassandra, MongoDB, an…☆238Updated this week
- Mirror of Apache Oozie☆722Updated 4 months ago
- Hive federation service. Enables disparate tables to be concurrently accessed across multiple Hive deployments.☆283Updated this week
- StreamLine - Streaming Analytics☆164Updated last year
- The Internals of Spark Structured Streaming☆420Updated 2 years ago
- Mirror of Apache Sentry☆119Updated 4 years ago
- Mirror of Apache Knox☆198Updated this week
- This repository trackes the code and files for building docker image with Apache Kylin.☆126Updated 3 years ago
- Dockerfiles for StreamSets Data Collector☆114Updated 3 months ago
- A library for querying Binlog with Apache Spark structure streaming, for Spark SQL , DataFrames and [MLSQL](https://www.mlsql.tech).☆152Updated 2 years ago
- ☆103Updated 5 years ago