Google BigQuery support for Spark, SQL, and DataFrames
☆155Dec 14, 2019Updated 6 years ago
Alternatives and similar repositories for spark-bigquery
Users that are interested in spark-bigquery are comparing it to the libraries listed below
Sorting:
- Google BigQuery support for Spark, Structured Streaming, SQL, and DataFrames with easy Databricks integration.☆70May 8, 2023Updated 2 years ago
- Spark Extension : ML transformers, SQL aggregations, etc that are missing in Apache Spark☆146Jan 26, 2016Updated 10 years ago
- Libraries and tools for interoperability between Hadoop-related open-source software and Google Cloud Platform.☆289Updated this week
- A handy Scala wrapper of Google BigQuery API 's Java Client Library.☆34Sep 29, 2018Updated 7 years ago
- A collection of Apache Parquet add-on modules☆30Feb 12, 2026Updated 2 weeks ago
- Machine learning evaluation database☆24Feb 7, 2018Updated 8 years ago
- A Scala API for Apache Beam and Google Cloud Dataflow.☆2,615Feb 12, 2026Updated 2 weeks ago
- an example of integrating Spark Streaming with Google Pub/Sub and Google Datastore☆17Mar 22, 2017Updated 8 years ago
- Ephemeral Hadoop clusters using Google Compute Platform☆135Mar 31, 2022Updated 3 years ago
- A tool for data sampling, data generation, and data diffing☆345Jan 8, 2026Updated last month
- functionstest☆33Oct 25, 2016Updated 9 years ago
- Base classes to use when writing tests with Spark☆1,549Dec 22, 2025Updated 2 months ago
- Fluent Scala DSL for Google's Cloud Dataflow SDK☆56Aug 2, 2015Updated 10 years ago
- Spark pipelines that correspond to a series of Dataflow examples.☆27May 5, 2019Updated 6 years ago
- Simplifying robust end-to-end machine learning on Apache Spark.☆475Apr 18, 2017Updated 8 years ago
- BigQuery data source for Apache Spark: Read data from BigQuery into DataFrames, write DataFrames into BigQuery tables.☆421Updated this week
- Google BigQuery data source for Apache Spark☆17Oct 1, 2024Updated last year
- A connector for SingleStore and Spark☆162Sep 24, 2025Updated 5 months ago
- Run in all nodes of your cluster before the cluster starts - lets you customize your cluster☆600Jan 23, 2026Updated last month
- ☆54Aug 3, 2017Updated 8 years ago
- Runs JVM closures in Docker containers on Kubernetes☆36Mar 23, 2018Updated 7 years ago
- Examples To Help You Learn Akka☆17May 7, 2019Updated 6 years ago
- ☆84Jan 26, 2026Updated last month
- A Scala feature transformation library for data science and machine learning☆474Feb 7, 2025Updated last year
- Compile-time tools for working with Avros in Scala☆55Dec 10, 2017Updated 8 years ago
- DEPRECATED. PLEASE USE https://github.com/confluentinc/kafka-connect-bigquery. A Kafka Connect BigQuery sink connector☆152Mar 4, 2024Updated last year
- An SBT plugin for automatically calling Avro code generation and a thin scala wrapper for reading and writing Avro files☆22Mar 8, 2018Updated 7 years ago
- A quick start project for polyaxon☆29Aug 2, 2024Updated last year
- Scala bindings for Bokeh plotting library☆138Oct 11, 2023Updated 2 years ago
- The missing MatPlotLib for Scala + Spark☆731Jan 30, 2022Updated 4 years ago
- The code for the in memory data pipeline that was presented at Berlin Buzzwords 2015.☆10Jun 1, 2015Updated 10 years ago
- Shaded version of Apache Hadoop 2.x for Presto☆16Sep 16, 2025Updated 5 months ago
- A gulp plugin that makes it easy to replace latex equations in a markdown file with rendered images☆11Jul 24, 2015Updated 10 years ago