streamsets / pipeline-library
Pipeline library for StreamSets Data Collector and Transformer
☆32Updated last year
Related projects ⓘ
Alternatives and complementary repositories for pipeline-library
- A Flink applcation that demonstrates reading and writing to/from Apache Kafka with Apache Flink☆20Updated last year
- ☆27Updated 3 weeks ago
- HDF masterclass materials☆28Updated 8 years ago
- spark-drools tutorials☆16Updated 7 months ago
- Scalable CDC Pattern Implemented using PySpark☆18Updated 5 years ago
- ☆26Updated 4 years ago
- ☆39Updated 5 years ago
- MonitoFi: Health & Performance Monitor for your Apache NiFi☆61Updated last year
- A bridge to Apache Atlas for provenance metadata created in course of using Apache NiFi☆15Updated last year
- Demos for Nessie. Nessie provides Git-like capabilities for your Data Lake.☆28Updated 2 weeks ago
- ☆27Updated 9 months ago
- Support for generating modern platforms dynamically with services such as Kafka, Spark, Streamsets, HDFS, ....☆71Updated this week
- Apache-Spark based Data Flow(ETL) Framework which supports multiple read, write destinations of different types and also support multiple…☆26Updated 3 years ago
- Streaming Synthetic Sales Data Generator: Streaming sales data generator for Apache Kafka, written in Python☆44Updated last year
- Presto Trino with Apache Hive Postgres metastore☆37Updated 2 months ago
- ☆13Updated last week
- Supplementary material for Building a Modern Data Platform with Snowflake, from Pearson.☆21Updated 2 years ago
- Demonstration of a Hive Input Format for Iceberg☆26Updated 3 years ago
- Hadoop/Hive/Spark container to perform CI tests☆11Updated 3 years ago
- Infrastructure automation to deploy Hadoop,Hive,Spark,airflow nodes on a docker host☆20Updated 5 years ago
- Collection of examples integrating NiFi with stream process frameworks.☆56Updated 8 years ago
- Snowflake Connector for Dremio using the ARP SDK.☆16Updated last year
- Ecosystem website for Apache Flink☆12Updated 9 months ago
- Wrangler Transform: A DMD system for transforming Big Data☆89Updated this week
- An opinionated auto-deployer for the Hortonworks Platform☆34Updated 3 years ago
- Qubole Streaminglens tool for tuning Spark Structured Streaming Pipelines☆17Updated 4 years ago
- A temporary home for LinkedIn's changes to Apache Iceberg (incubating)☆62Updated 6 months ago
- Examples of Spark 3.0☆47Updated 4 years ago
- Delta Lake Examples☆12Updated 4 years ago