big-data-europe / app-bde-pipeline
Bootstrap a pipeline on the BDE platform
☆26Updated 8 years ago
Alternatives and similar repositories for app-bde-pipeline:
Users that are interested in app-bde-pipeline are comparing it to the libraries listed below
- Big Data ETL and Utilities for Hadoop Map Reduce, Spark and Storm☆103Updated last year
- An example Apache Beam project.☆111Updated 7 years ago
- Real-world Spark pipelines examples☆83Updated 6 years ago
- spark-drools tutorials☆16Updated 10 months ago
- Spark on Kubernetes using Helm☆34Updated 4 years ago
- Build configuration-driven ETL pipelines on Apache Spark☆159Updated 2 years ago
- How to manage Slowly Changing Dimensions with Apache Hive☆55Updated 5 years ago
- Example for article Running Spark 3 with standalone Hive Metastore 3.0☆97Updated 2 years ago
- Apache Spark examples exclusively in Java☆100Updated last year
- ☆27Updated last month
- These are some code examples☆55Updated 5 years ago
- Pipeline library for StreamSets Data Collector and Transformer☆33Updated 2 years ago
- Interactive Notebooks that support the book☆39Updated 4 years ago
- Deploy your Spark Production Cluster on Kubernetes☆47Updated 4 years ago
- Postgresql configured to work as metastore for Hive.☆32Updated 2 years ago
- Multi docker container images for main Big Data Tools. (Hadoop, Spark, Kafka, HBase, Cassandra, Zookeeper, Zeppelin, Drill, Flink, Hive, …☆34Updated 2 months ago
- ☆105Updated 5 years ago
- Ansible playbooks to construct distributed computing environments☆62Updated 3 years ago
- Reference architecture for real-time stream processing with Apache Flink on Amazon EMR, Amazon Kinesis, and Amazon Elasticsearch Service.☆71Updated last year
- spark on kubernetes☆105Updated 2 years ago
- Ambari stack service for installing and managing Apache Airflow on HDP cluster☆59Updated 6 years ago
- Rocksdb state storage implementation for Structured Streaming.☆17Updated 4 years ago
- Apache Nifi Examples by http://www.nifi.rocks☆37Updated 6 years ago
- Docker Image and Kubernetes Configurations for Spark 2.x☆41Updated 5 years ago
- A plugin to Apache Airflow to allow you to run Spark Submit Commands as an Operator☆73Updated 5 years ago
- Dockerized Hadoop/Minio/Hive/Presto stack☆36Updated 11 months ago
- Spark structured streaming with Kafka data source and writing to Cassandra☆62Updated 5 years ago
- One click deploy docker-compose with Kafka, Spark Streaming, Zeppelin UI and Monitoring (Grafana + Kafka Manager)☆119Updated 3 years ago
- Sample processing code using Spark 2.1+ and Scala☆51Updated 4 years ago
- Spark 3.0.0 Structured Streaming Kafka Avro Demo☆15Updated last year