theShadow89 / nifi-bigquery-bundleLinks
Bigquery bundle for Apache NiFi
☆15Updated 6 years ago
Alternatives and similar repositories for nifi-bigquery-bundle
Users that are interested in nifi-bigquery-bundle are comparing it to the libraries listed below
Sorting:
- Google BigQuery support for Spark, SQL, and DataFrames☆155Updated 6 years ago
- Snowflake Data Source for Apache Spark.☆230Updated 2 weeks ago
- Loads Snowplow enriched events into Google BigQuery☆22Updated 8 months ago
- Google BigQuery support for Spark, Structured Streaming, SQL, and DataFrames with easy Databricks integration.☆70Updated 2 years ago
- Spark connector for SFTP☆98Updated 2 years ago
- Examples for High Performance Spark☆16Updated 2 months ago
- Hive Storage Handler for interoperability between BigQuery and Apache Hive☆19Updated 11 months ago
- BigQuery data source for Apache Spark: Read data from BigQuery into DataFrames, write DataFrames into BigQuery tables.☆413Updated 2 weeks ago
- Spark data source for Salesforce☆81Updated last year
- Kinesis Connector for Structured Streaming☆137Updated last year
- Circus Train is a dataset replication tool that copies Hive tables between clusters and clouds.☆91Updated last year
- JSON schema parser for Apache Spark☆82Updated 3 years ago
- Spark package for checking data quality☆222Updated 5 years ago
- ☆31Updated 7 years ago
- A COBOL parser and Mainframe/EBCDIC data source for Apache Spark☆158Updated last week
- Example Spark applications that run on Kubernetes and access GCP products, e.g., GCS, BigQuery, and Cloud PubSub☆37Updated 7 years ago
- The iterative broadcast join example code.☆70Updated 8 years ago
- A rough prototype of a tool for discovering Apache Hive schemas from JSON documents.☆42Updated 2 years ago
- Benchmark data warehouses under Fivetran-like conditions☆172Updated 3 years ago
- Cloud Dataproc: Samples and Utils☆206Updated 3 weeks ago
- Google BigQuery data source for Apache Spark☆17Updated last year
- An application that uses Cloud Dataflow and Cloud Build to copy/transfer BigQuery tables between locations/regions.☆14Updated 4 years ago
- A framework for creating composable and pluggable data processing pipelines using Apache Spark, and running them on a cluster.☆47Updated 9 years ago
- Data ingestion library for Amundsen to build graph and search index☆204Updated last year
- Collection of open-source Spark tools & frameworks that have made the data engineering and data science teams at Swoop highly productive☆184Updated 2 months ago
- Libraries and tools for interoperability between Hadoop-related open-source software and Google Cloud Platform.☆289Updated this week
- type-class based data cleansing library for Apache Spark SQL☆78Updated 6 years ago
- ☆66Updated last year
- File compaction tool that runs on top of the Spark framework.☆59Updated 6 years ago
- Sample code with integration between Data Catalog and Hive data source.☆24Updated 11 months ago