streamsets / tutorialsLinks
StreamSets Tutorials
☆350Updated last year
Alternatives and similar repositories for tutorials
Users that are interested in tutorials are comparing it to the libraries listed below
Sorting:
- Apache NiFi example flows☆205Updated 5 years ago
- A collection of templates for use with Apache NiFi.☆279Updated 8 years ago
- Cloudera Manager API Client☆306Updated last year
- A tool to install, configure and manage Presto installations☆170Updated 2 years ago
- Ambari stack service for easily installing and managing Hue on HDP cluster☆107Updated 5 years ago
- Kylo is a data lake management software platform and framework for enabling scalable enterprise-class data lakes on big data technologies…☆1,113Updated 2 years ago
- Mirror of Apache Knox☆203Updated last week
- Mirror of Apache Sentry☆119Updated 5 years ago
- A Spark Atlas connector to track data lineage in Apache Atlas☆267Updated 2 years ago
- Kafka Connect to Hbase☆43Updated 4 years ago
- Bigtop is an Apache Foundation project for Infrastructure Engineers and Data Scientists looking for comprehensive packaging, testing, and…☆651Updated 2 weeks ago
- Ambari service for Apache Flink☆127Updated 4 years ago
- Docker image with Ambari☆291Updated 7 years ago
- This Apache Atlas is built from the latest release source tarball and patched to be run in a Docker container.☆142Updated last year
- A Maven-based example of using Cloudera Impala's JDBC driver☆118Updated 9 years ago
- Cloudera Manager Extensibility Tools and Documentation.☆189Updated last year
- Build configuration-driven ETL pipelines on Apache Spark☆160Updated 2 years ago
- Apache MiNiFi (a subproject of Apache NiFi)☆124Updated 4 years ago
- Ambari service for Presto☆44Updated 6 months ago
- Dockerfiles for StreamSets Data Collector☆114Updated 5 months ago
- Few scripts to automate daily data loads from RDBMS to Partitioned Avro Hive table☆30Updated 10 years ago
- The Apache Spark - Apache HBase Connector is a library to support Spark accessing HBase table as external data source or sink.☆552Updated 4 years ago
- ☆240Updated 3 years ago
- Mirror of Apache Atlas (Incubating)☆94Updated 2 years ago
- ReAir is a collection of easy-to-use tools for replicating tables and partitions between Hive data warehouses.☆279Updated 6 years ago
- High Performance Kafka Connector for Spark Streaming.Supports Multi Topic Fetch, Kafka Security. Reliable offset management in Zookeeper.…☆632Updated 3 years ago
- ☆198Updated last month
- Kettle plugin that provides support for interacting within many "big data" projects including Hadoop, Hive, HBase, Cassandra, MongoDB, an…☆238Updated this week
- Kerberos and Hadoop: The Madness beyond the Gate☆280Updated 2 years ago
- Demos around Ambari Views, Services, Blueprints☆63Updated 9 years ago