streamsets / pipeline-library
Pipeline library for StreamSets Data Collector and Transformer
☆32Updated last year
Related projects ⓘ
Alternatives and complementary repositories for pipeline-library
- A Flink applcation that demonstrates reading and writing to/from Apache Kafka with Apache Flink☆20Updated last year
- ☆27Updated 2 weeks ago
- spark-drools tutorials☆16Updated 6 months ago
- Scalable CDC Pattern Implemented using PySpark☆18Updated 5 years ago
- Apache-Spark based Data Flow(ETL) Framework which supports multiple read, write destinations of different types and also support multiple…☆26Updated 3 years ago
- ☆47Updated 4 years ago
- ☆13Updated last week
- ☆24Updated 2 months ago
- Cask Hydrator Plugins Repository☆66Updated 2 weeks ago
- Postgresql configured to work as metastore for Hive.☆30Updated last year
- Collection of examples integrating NiFi with stream process frameworks.☆56Updated 8 years ago
- ☆26Updated 9 months ago
- NiFi Processor for Apache Pulsar☆10Updated 7 months ago
- Hadoop/Hive/Spark container to perform CI tests☆11Updated 3 years ago
- A bridge to Apache Atlas for provenance metadata created in course of using Apache NiFi☆15Updated last year
- ☆39Updated 5 years ago
- MonitoFi: Health & Performance Monitor for your Apache NiFi☆61Updated last year
- Demos for Nessie. Nessie provides Git-like capabilities for your Data Lake.☆28Updated last week
- Apache StreamPipes - A self-service (Industrial) IoT toolbox to enable non-technical users to connect, analyze and explore IoT data strea…☆26Updated last year
- Dremio Flight connector. Access Dremio using Arrow flight☆40Updated 3 years ago
- Sample processing code using Spark 2.1+ and Scala☆51Updated 4 years ago
- Snowflake Connector for Dremio using the ARP SDK.☆16Updated last year
- Example for article Running Spark 3 with standalone Hive Metastore 3.0☆96Updated last year
- Wrangler Transform: A DMD system for transforming Big Data☆89Updated last month
- M3D Engine is a Spark application for the development of scalable data transformations and ingestions in data lakes.☆18Updated 3 years ago
- Infrastructure automation to deploy Hadoop,Hive,Spark,airflow nodes on a docker host☆20Updated 5 years ago
- Bootstrap a pipeline on the BDE platform☆26Updated 8 years ago
- HDF masterclass materials☆28Updated 8 years ago