streamsets / datacollector-dockerLinks
Dockerfiles for StreamSets Data Collector
☆114Updated 7 months ago
Alternatives and similar repositories for datacollector-docker
Users that are interested in datacollector-docker are comparing it to the libraries listed below
Sorting:
- Apache MiNiFi (a subproject of Apache NiFi)☆125Updated 4 years ago
- A visual ETL development and debugging tool for big data☆154Updated 2 years ago
- Collection of examples integrating NiFi with stream process frameworks.☆59Updated 9 years ago
- StreamSets Tutorials☆351Updated last year
- Presto Plugin for Oracle JDBC Connection☆43Updated 2 years ago
- A plugin to Apache Airflow to allow you to run Spark Submit Commands as an Operator☆73Updated 6 years ago
- StreamLine - Streaming Analytics☆165Updated 2 years ago
- Apache NiFi example flows☆207Updated 5 years ago
- A proof of concept using Divolte, Kafka, Druid and Superset☆61Updated 5 years ago
- CDP Public Cloud is an integrated analytics and data management platform deployed on cloud services. It offers broad data analytics and a…☆361Updated this week
- A plugin to the Kafka Connect framework that replicates data from MySQL to Kafka☆96Updated 9 years ago
- Big Data ETL and Utilities for Hadoop Map Reduce, Spark and Storm☆103Updated last year
- Kylo is a data lake management software platform and framework for enabling scalable enterprise-class data lakes on big data technologies…☆22Updated 6 years ago
- Demos around Ambari Views, Services, Blueprints☆63Updated 9 years ago
- This repository is to help with the Partner Demonstration of the Apache Atlas project.☆30Updated 9 years ago
- An Integrated and collaborative cloud environment for building and running Spark applications on PKS/Kubernetes☆84Updated 5 years ago
- SQL for Kafka Connectors☆99Updated last year
- This application, Kafka ES Indexer, will read the messages from Kafka, processes (if needed) and batch index them into ElasticSearch.☆162Updated 4 years ago
- Examples for how to use the Flink Docker images in a variety of ways☆91Updated 3 years ago
- 🔍 Open Distro for Elasticsearch JDBC Driver☆113Updated 5 years ago
- Few scripts to automate daily data loads from RDBMS to Partitioned Avro Hive table☆30Updated 11 years ago
- Mirror of Apache Knox☆206Updated this week
- A tool to install, configure and manage Presto installations☆171Updated 2 years ago
- Ansible playbooks to construct distributed computing environments☆62Updated 4 years ago
- Jumbune, an open source BigData APM & Data Quality Management Platform for Data Clouds. Enterprise feature offering is available at http:…☆72Updated 2 years ago
- Dockerized HDP Cluster☆84Updated 7 years ago
- ☆26Updated 6 years ago
- Demo Ambari service to deploy/manage NiFi on HDP - Deprecated☆75Updated 7 years ago
- Docker Image for Kudu☆38Updated 6 years ago
- Real-time analytics in Apache Flume☆52Updated 9 years ago