Google BigQuery support for Spark, Structured Streaming, SQL, and DataFrames with easy Databricks integration.
☆70May 8, 2023Updated 2 years ago
Alternatives and similar repositories for spark-bigquery
Users that are interested in spark-bigquery are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆31Oct 17, 2018Updated 7 years ago
- Named Entity Extraction on Twitter Stream using Apache Spark Streaming and Stanford CoreNLP☆15Oct 12, 2016Updated 9 years ago
- IPython magics to work with DBT☆15Jul 22, 2022Updated 3 years ago
- An application that uses Cloud Dataflow and Cloud Build to copy/transfer BigQuery tables between locations/regions.☆14Mar 17, 2021Updated 5 years ago
- Google BigQuery API using service account credentials.☆21Feb 22, 2016Updated 10 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Spark pipelines that correspond to a series of Dataflow examples.☆27May 5, 2019Updated 6 years ago
- Spark Structured Streaming State Tools☆34Jul 3, 2020Updated 5 years ago
- ☆45Apr 27, 2020Updated 5 years ago
- Example Spark applications that run on Kubernetes and access GCP products, e.g., GCS, BigQuery, and Cloud PubSub☆37Feb 13, 2018Updated 8 years ago
- Example code for building your own MemSQL Streamliner Pipelines☆23Apr 18, 2017Updated 9 years ago
- BigQuery data source for Apache Spark: Read data from BigQuery into DataFrames, write DataFrames into BigQuery tables.☆421Apr 12, 2026Updated last week
- Minitime - a Java Time wrapper for Scala and Scala.js☆16Jan 17, 2020Updated 6 years ago
- hive_compared_bq compares/validates 2 (SQL like) tables, and graphically shows the rows/columns that are different.☆28Dec 13, 2017Updated 8 years ago
- ☆22Jun 9, 2016Updated 9 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Postgres extension drivers for quill☆15Oct 31, 2016Updated 9 years ago
- Replicates data between Google Cloud BigQuery projects☆22Jul 13, 2016Updated 9 years ago
- Hadoop InputFormat for http://druid.io/☆10Oct 26, 2016Updated 9 years ago
- Singer tap for getting CSV and XLS(X) data out of Amazon S3☆12Feb 12, 2025Updated last year
- Hive Storage Handler for interoperability between BigQuery and Apache Hive☆19Jan 29, 2025Updated last year
- Helm Chart for lyft/flinkk8soperator☆11Mar 10, 2020Updated 6 years ago
- Bigquery bundle for Apache NiFi☆15Apr 20, 2019Updated 6 years ago
- Spark data profiling utilities☆23Nov 24, 2018Updated 7 years ago
- My entry to the Kaggle 2013 StumbleUpon competition. Ranked 4th on the final private leaderboard.☆15Apr 23, 2014Updated 11 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Using the Parquet file format (with Avro) to process data with Apache Flink☆14Aug 17, 2015Updated 10 years ago
- Dockerfile for Apache Zeppelin☆17Dec 9, 2015Updated 10 years ago
- Campaign Manager 360 and Display & Video 360 Reports to BigQuery connector☆37Apr 18, 2023Updated 3 years ago
- an example of integrating Spark Streaming with Google Pub/Sub and Google Datastore☆16Mar 22, 2017Updated 9 years ago
- A Singer (https://singer.io) target that writes data to Google BigQuery.☆40Mar 8, 2021Updated 5 years ago
- ☆11Jun 4, 2021Updated 4 years ago
- Tutorials, Examples about Kubeflow Pipeline.☆13Nov 21, 2022Updated 3 years ago
- Test for SparkSQL ScalaPB☆14Jun 28, 2022Updated 3 years ago
- Scripts to demonstrate VPC Service Controls between tenant and shared projects☆12Jun 11, 2019Updated 6 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- This repository is deprecated. All of its content and history has been moved to googleapis/google-cloud-node.☆13Jul 20, 2023Updated 2 years ago
- Coursera Machine Learning class examples in Spark☆43Feb 14, 2014Updated 12 years ago
- Spark SQL Macros provides a mechanism similar to Spark User-Defined function registration; with the key enhancement being that custom cod…☆16Mar 17, 2021Updated 5 years ago
- Code to munge data between Kaggle .tsv Rotten Tomatoes Sentiment Analysis data set and Vowpal Wabbit☆24Jun 22, 2014Updated 11 years ago
- Connector between Spark and InfluxDB.☆23May 19, 2016Updated 9 years ago
- SVM classifiers built for emotion classification☆10Apr 27, 2016Updated 9 years ago
- Google API client (or one the Discworld, the Ephebian God of Avalanches).☆16Apr 7, 2026Updated last week