Google BigQuery support for Spark, SQL, and DataFrames
☆156Dec 14, 2019Updated 6 years ago
Alternatives and similar repositories for spark-bigquery
Users that are interested in spark-bigquery are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Google BigQuery support for Spark, Structured Streaming, SQL, and DataFrames with easy Databricks integration.☆70May 8, 2023Updated 3 years ago
- an example of integrating Spark Streaming with Google Pub/Sub and Google Datastore☆16Mar 22, 2017Updated 9 years ago
- A handy Scala wrapper of Google BigQuery API 's Java Client Library.☆34Sep 29, 2018Updated 7 years ago
- Hive Storage Handler for interoperability between BigQuery and Apache Hive☆19Jan 29, 2025Updated last year
- Libraries and tools for interoperability between Hadoop-related open-source software and Google Cloud Platform.☆292Updated this week
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A collection of Apache Parquet add-on modules☆30May 20, 2026Updated 3 weeks ago
- Spark Extension : ML transformers, SQL aggregations, etc that are missing in Apache Spark☆145Jan 26, 2016Updated 10 years ago
- Ephemeral Hadoop clusters using Google Compute Platform☆136Mar 31, 2022Updated 4 years ago
- Google BigQuery data source for Apache Spark☆17Oct 1, 2024Updated last year
- A tool for data sampling, data generation, and data diffing☆349Mar 31, 2026Updated 2 months ago
- Shaded version of Apache Hadoop 2.x for Presto☆16Sep 16, 2025Updated 8 months ago
- Runs JVM closures in Docker containers on Kubernetes☆38Mar 23, 2018Updated 8 years ago
- Machine learning evaluation database☆24Feb 7, 2018Updated 8 years ago
- BigQuery data source for Apache Spark: Read data from BigQuery into DataFrames, write DataFrames into BigQuery tables.☆423May 21, 2026Updated 3 weeks ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Base classes to use when writing tests with Spark☆1,553Apr 20, 2026Updated last month
- Run in all nodes of your cluster before the cluster starts - lets you customize your cluster☆596May 31, 2026Updated 2 weeks ago
- Fluent Scala DSL for Google's Cloud Dataflow SDK☆56Aug 2, 2015Updated 10 years ago
- Metrics collection library for Google Dataflow☆13Nov 7, 2018Updated 7 years ago
- A Scala feature transformation library for data science and machine learning☆475Feb 7, 2025Updated last year
- DEPRECATED. PLEASE USE https://github.com/confluentinc/kafka-connect-bigquery. A Kafka Connect BigQuery sink connector☆152Mar 4, 2024Updated 2 years ago
- AMQP data source for dstream (Spark Streaming)☆26Mar 31, 2022Updated 4 years ago
- Compile-time tools for working with Avros in Scala☆55Dec 10, 2017Updated 8 years ago
- A connector for SingleStore and Spark☆162Jun 4, 2026Updated last week
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆14May 27, 2022Updated 4 years ago
- Scio IDEA plugin☆30Oct 2, 2025Updated 8 months ago
- Simplifying robust end-to-end machine learning on Apache Spark.☆473Apr 18, 2017Updated 9 years ago
- ☆84Jan 26, 2026Updated 4 months ago
- A tool for moving tables from Redshift to BigQuery☆65Jan 20, 2019Updated 7 years ago
- Open source tools for Google Cloud Storage and Databases.☆64May 1, 2024Updated 2 years ago
- Spark pipelines that correspond to a series of Dataflow examples.☆27May 5, 2019Updated 7 years ago
- Interactive Audience Analytics with Spark and HyperLogLog☆55Oct 14, 2015Updated 10 years ago
- [SUNSET] Async Google Pubsub Client☆159Mar 18, 2023Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- The code for the in memory data pipeline that was presented at Berlin Buzzwords 2015.☆10Jun 1, 2015Updated 11 years ago
- ☆14Oct 18, 2020Updated 5 years ago
- Opinion Analysis of News, Threaded Conversations, and User Generated Content☆110Sep 19, 2024Updated last year
- Scala bindings for Bokeh plotting library☆138Oct 11, 2023Updated 2 years ago
- Luigi integration for Google BigQuery☆15Nov 18, 2015Updated 10 years ago
- ☆10Feb 7, 2023Updated 3 years ago
- functionstest☆33Oct 25, 2016Updated 9 years ago