samelamin/spark-bigquery

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/samelamin/spark-bigquery)

samelamin / spark-bigquery

Google BigQuery support for Spark, Structured Streaming, SQL, and DataFrames with easy Databricks integration.

☆70

Alternatives and similar repositories for spark-bigquery

Users that are interested in spark-bigquery are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

spotify / spark-bigquery
View on GitHub
Google BigQuery support for Spark, SQL, and DataFrames
☆156Dec 14, 2019Updated 6 years ago
seratch / bigquery4s
View on GitHub
A handy Scala wrapper of Google BigQuery API 's Java Client Library.
☆34Sep 29, 2018Updated 7 years ago
polleyg / gcp-dataflow-copy-bigquery
View on GitHub
An application that uses Cloud Dataflow and Cloud Build to copy/transfer BigQuery tables between locations/regions.
☆14Mar 17, 2021Updated 5 years ago
miraisolutions / sparkbq
View on GitHub
Sparklyr extension package to connect to Google BigQuery
☆19Oct 29, 2024Updated last year
univalence / spark-tools
View on GitHub
☆46Apr 27, 2020Updated 6 years ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
calrissian / spark-jetty-server
View on GitHub
Recipes and examples for Apache Spark
☆13Jan 21, 2015Updated 11 years ago
memsql / streamliner-examples
View on GitHub
Example code for building your own MemSQL Streamliner Pipelines
☆23Apr 18, 2017Updated 9 years ago
HeartSaVioR / spark-state-tools
View on GitHub
Spark Structured Streaming State Tools
☆34Jul 3, 2020Updated 6 years ago
GoogleCloudPlatform / spark-on-k8s-gcp-examples
View on GitHub
Example Spark applications that run on Kubernetes and access GCP products, e.g., GCS, BigQuery, and Cloud PubSub
☆35Feb 13, 2018Updated 8 years ago
GoogleCloudDataproc / spark-bigquery-connector
View on GitHub
BigQuery data source for Apache Spark: Read data from BigQuery into DataFrames, write DataFrames into BigQuery tables.
☆425Updated this week
pishen / minitime
View on GitHub
Minitime - a Java Time wrapper for Scala and Scala.js
☆16Jan 17, 2020Updated 6 years ago
nevillelyh / shapeless-datatype
View on GitHub
Shapeless utilities for common data types
☆67Jul 2, 2026Updated 2 weeks ago
dhwajraj / spark-twitter-named-entity
View on GitHub
Named Entity Extraction on Twitter Stream using Apache Spark Streaming and Stanford CoreNLP
☆14Oct 12, 2016Updated 9 years ago
mdedetrich / quill-pg
View on GitHub
Postgres extension drivers for quill
☆14Oct 31, 2016Updated 9 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
bolcom / hive_compared_bq
View on GitHub
hive_compared_bq compares/validates 2 (SQL like) tables, and graphically shows the rows/columns that are different.
☆27Dec 13, 2017Updated 8 years ago
uswitch / big-replicate
View on GitHub
Replicates data between Google Cloud BigQuery projects
☆22Jul 13, 2016Updated 10 years ago
GoogleCloudDataproc / hive-bigquery-storage-handler
View on GitHub
Hive Storage Handler for interoperability between BigQuery and Apache Hive
☆19Jan 29, 2025Updated last year
lightbend / sbt-google-cloud-storage
View on GitHub
A SBT resolver and publisher for Google Cloud Storage
☆23Dec 15, 2021Updated 4 years ago
theShadow89 / nifi-bigquery-bundle
View on GitHub
Bigquery bundle for Apache NiFi
☆15Apr 20, 2019Updated 7 years ago
implydata / druid-hadoop-inputformat
View on GitHub
Hadoop InputFormat for http://druid.io/
☆10Oct 26, 2016Updated 9 years ago
nezihyigitbasi / FlinkParquet
View on GitHub
Using the Parquet file format (with Avro) to process data with Apache Flink
☆14Aug 17, 2015Updated 10 years ago
jasonsatran / spark-meta
View on GitHub
Spark data profiling utilities
☆23Nov 24, 2018Updated 7 years ago
codehaus / jcsp
View on GitHub
Read-only mirror of https://xircles.codehaus.org/projects/jcsp/repos/primary/repo
☆13Jun 30, 2014Updated 12 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
debasishg / tradeio
View on GitHub
A disciplined way to purely functional domain models in Scala
☆30Aug 12, 2024Updated last year
lightbend / flink-operator
View on GitHub
Helm Chart for lyft/flinkk8soperator
☆11Mar 10, 2020Updated 6 years ago
google-marketing-solutions / argon
View on GitHub
Campaign Manager 360 and Display & Video 360 Reports to BigQuery connector
☆37Apr 18, 2023Updated 3 years ago
skinny-framework / skinny-splash
View on GitHub
Make your Spray applications simpler with Skinny components
☆17Aug 20, 2016Updated 9 years ago
twilio / calcite-kudu
View on GitHub
Apache Calcite Adapter for Apache Kudu
☆28Sep 26, 2025Updated 9 months ago
google-marketing-solutions / bqflow
View on GitHub
☆31Mar 7, 2025Updated last year
yu-iskw / spark-streaming-with-google-cloud-example
View on GitHub
an example of integrating Spark Streaming with Google Pub/Sub and Google Datastore
☆16Mar 22, 2017Updated 9 years ago
praetorian-inc / gcloud-lockdown
View on GitHub
Scripts to demonstrate VPC Service Controls between tenant and shared projects
☆12Jun 11, 2019Updated 7 years ago
thesamet / sparksql-scalapb-test
View on GitHub
Test for SparkSQL ScalaPB
☆14Jun 28, 2022Updated 4 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
itspawanbhardwaj / spark-fuzzy-matching
View on GitHub
Fuzzy matching function in spark (https://spark-packages.org/package/itspawanbhardwaj/spark-fuzzy-matching)
☆24Dec 30, 2019Updated 6 years ago
staroids / universe
View on GitHub
⭐️ Staroid Universe project registry
☆12Mar 26, 2021Updated 5 years ago
ColCarroll / working_ml
View on GitHub
Examples of applied machine learning
☆13Dec 27, 2017Updated 8 years ago
googleapis / nodejs-os-login
View on GitHub
This repository is deprecated. All of its content and history has been moved to googleapis/google-cloud-node.
☆12Jul 13, 2023Updated 3 years ago
jeffreyolchovy / sbt-fmpp-resolver
View on GitHub
An Apache FreeMarker template resolver for the sbt new command
☆12Aug 12, 2017Updated 8 years ago
Snowflake-Labs / sfguide-ask-questions-to-your-documents-using-rag-with-snowflake-cortex-search
View on GitHub
☆15Apr 23, 2025Updated last year
globalbiodata / inventory_2022
View on GitHub
Public repository for the biodata resource inventory performed in 2022.
☆11Nov 25, 2025Updated 7 months ago