samelamin / spark-bigqueryView external linksLinks
Google BigQuery support for Spark, Structured Streaming, SQL, and DataFrames with easy Databricks integration.
☆70May 8, 2023Updated 2 years ago
Alternatives and similar repositories for spark-bigquery
Users that are interested in spark-bigquery are comparing it to the libraries listed below
Sorting:
- Google BigQuery support for Spark, SQL, and DataFrames☆156Dec 14, 2019Updated 6 years ago
- ☆31Oct 17, 2018Updated 7 years ago
- Named Entity Extraction on Twitter Stream using Apache Spark Streaming and Stanford CoreNLP☆15Oct 12, 2016Updated 9 years ago
- A handy Scala wrapper of Google BigQuery API 's Java Client Library.☆34Sep 29, 2018Updated 7 years ago
- A minimalist klout API interface.☆23Jan 21, 2018Updated 8 years ago
- Hadoop InputFormat for http://druid.io/☆10Oct 26, 2016Updated 9 years ago
- Helm Chart for lyft/flinkk8soperator☆11Mar 10, 2020Updated 5 years ago
- Google BigQuery API using service account credentials.☆21Feb 22, 2016Updated 9 years ago
- IPython magics to work with DBT☆15Jul 22, 2022Updated 3 years ago
- Minitime - a Java Time wrapper for Scala and Scala.js☆16Jan 17, 2020Updated 6 years ago
- Postgres extension drivers for quill☆15Oct 31, 2016Updated 9 years ago
- BigQuery data source for Apache Spark: Read data from BigQuery into DataFrames, write DataFrames into BigQuery tables.☆420Feb 4, 2026Updated last week
- Spark Structured Streaming State Tools☆34Jul 3, 2020Updated 5 years ago
- Recipes and examples for Apache Spark☆13Jan 21, 2015Updated 11 years ago
- Bigquery bundle for Apache NiFi☆15Apr 20, 2019Updated 6 years ago
- Writing PySpark logs in Apache Spark and Databricks☆17Jun 13, 2022Updated 3 years ago
- Dockerfile for Apache Zeppelin☆17Dec 9, 2015Updated 10 years ago
- Hive Storage Handler for interoperability between BigQuery and Apache Hive☆19Jan 29, 2025Updated last year
- Code to munge data between Kaggle .tsv Rotten Tomatoes Sentiment Analysis data set and Vowpal Wabbit☆24Jun 22, 2014Updated 11 years ago
- Example Spark applications that run on Kubernetes and access GCP products, e.g., GCS, BigQuery, and Cloud PubSub☆37Feb 13, 2018Updated 8 years ago
- Coursera Machine Learning class examples in Spark☆43Feb 14, 2014Updated 12 years ago
- BigQuery Schema Conversion Tool☆23Oct 6, 2020Updated 5 years ago
- Sparklyr extension package to connect to Google BigQuery☆19Oct 29, 2024Updated last year
- ☆45Apr 27, 2020Updated 5 years ago
- Library for organizing batch processing pipelines in Apache Spark☆42Jan 4, 2017Updated 9 years ago
- Apache Calcite Adapter for Apache Kudu☆28Sep 26, 2025Updated 4 months ago
- Spark data profiling utilities☆22Nov 24, 2018Updated 7 years ago
- Spark DataFrames for earth observation data☆19May 1, 2018Updated 7 years ago
- Debussy is an opinionated Data Architecture and Engineering framework, enabling data analysts and engineers to build better platforms and…☆28Mar 20, 2023Updated 2 years ago
- Discover Flink clusters on Hadoop YARN for Prometheus☆23Aug 5, 2020Updated 5 years ago
- Replicates data between Google Cloud BigQuery projects☆22Jul 13, 2016Updated 9 years ago
- A sink to save Spark Structured Streaming DataFrame into Hive table☆23May 7, 2018Updated 7 years ago
- A minimal seed template for an Akka gRPC with Scala build☆19Jan 22, 2026Updated 3 weeks ago
- Flink Controller implements a Kubernetes Custom Controller (aka Kubernetes Operator) for Apache Flink☆52Jan 26, 2026Updated 2 weeks ago
- A SBT resolver and publisher for Google Cloud Storage☆23Dec 15, 2021Updated 4 years ago
- Example code for building your own MemSQL Streamliner Pipelines☆23Apr 18, 2017Updated 8 years ago
- Visual Studio Code extension for GoogleSQL☆24Feb 1, 2026Updated last week
- ACID Data Source for Apache Spark based on Hive ACID☆96Jul 7, 2021Updated 4 years ago
- doobie integration with quill☆22Jul 8, 2019Updated 6 years ago