theShadow89 / nifi-bigquery-bundle
Bigquery bundle for Apache NiFi
☆15Updated 6 years ago
Alternatives and similar repositories for nifi-bigquery-bundle
Users that are interested in nifi-bigquery-bundle are comparing it to the libraries listed below
Sorting:
- Examples for High Performance Spark☆15Updated 6 months ago
- Scalable CDC Pattern Implemented using PySpark☆18Updated 5 years ago
- A Giter8 template for scio☆31Updated 3 months ago
- Docker Image and Kubernetes Configurations for Spark 2.x☆41Updated 5 years ago
- CDF Tech Bootcamp☆9Updated 5 years ago
- The iterative broadcast join example code.☆69Updated 7 years ago
- Circus Train is a dataset replication tool that copies Hive tables between clusters and clouds.☆88Updated last year
- JSON schema parser for Apache Spark☆81Updated 2 years ago
- Spark pipelines that correspond to a series of Dataflow examples.☆27Updated 6 years ago
- A dynamic data completeness and accuracy library at enterprise scale for Apache Spark☆30Updated 6 months ago
- JDBC driver for Apache Kafka☆87Updated 3 years ago
- Rokku project. This project acts as a proxy on top of any S3 storage solution providing services like authentication, authorization, shor…☆67Updated 2 months ago
- Schema Registry integration for Apache Spark☆40Updated 2 years ago
- Avro Schema Shredder is a REST API that enables storage of Avro Schemas in Apache Atlas. This API enables an organization to use Apache A…☆13Updated 8 years ago
- Flink stream filtering examples☆19Updated 8 years ago
- Test your Hive scripts inside your favorite IDE with HiveQLUnit! Increase your developers productivity by testing on all operating system…☆39Updated 4 years ago
- HDF masterclass materials☆28Updated 9 years ago
- Starter project for building MemSQL Streamliner Pipelines☆32Updated 8 years ago
- Spark cloud integration: tests, cloud committers and more☆19Updated 3 months ago
- Lighthouse is a library for data lakes built on top of Apache Spark. It provides high-level APIs in Scala to streamline data pipelines an…☆61Updated 8 months ago
- ☆48Updated 7 years ago
- Apache Atlas development image for the Rokku project: https://github.com/ing-bank/rokku☆21Updated 4 years ago
- A small project to show how to add lineage to Atlas when using Spark as ETL tool☆12Updated 8 years ago
- PySpark for ETL jobs including lineage to Apache Atlas in one script via code inspection☆18Updated 8 years ago
- Spark connector for SFTP☆100Updated 2 years ago
- Scala SDK for working with Snowplow enriched events in Spark, AWS Lambda, Flink et al.☆21Updated 6 months ago
- ☆31Updated 6 years ago
- Magic to help Spark pipelines upgrade☆35Updated 7 months ago
- A framework for creating composable and pluggable data processing pipelines using Apache Spark, and running them on a cluster.☆47Updated 8 years ago
- Nested array transformation helper extensions for Apache Spark☆37Updated last year