Spark pipelines that correspond to a series of Dataflow examples.
☆27May 5, 2019Updated 7 years ago
Alternatives and similar repositories for spark-examples
Users that are interested in spark-examples are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Google Cloud Dataflow provides a simple, powerful model for building both batch and streaming parallel data processing pipelines. This re…☆166Jul 25, 2018Updated 7 years ago
- Processing Logs at Scale using Cloud Dataflow☆60Mar 18, 2019Updated 7 years ago
- Data Science with Apache Spark and Spark Notebook☆30Jul 24, 2017Updated 8 years ago
- Install JupyterHub on Google Cloud☆17Aug 7, 2017Updated 8 years ago
- Thoughts on things I find interesting.☆17Dec 19, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ScalaIO 2014 Workshop☆25Oct 23, 2014Updated 11 years ago
- Java chat example app☆11Mar 11, 2022Updated 4 years ago
- ☆16Jun 27, 2020Updated 5 years ago
- This repository is deprecated. All of its content and history has been moved to googleapis/google-cloud-node.☆12Jul 13, 2023Updated 2 years ago
- ☆11Oct 11, 2022Updated 3 years ago
- A Gentle introduction to Machine Learning with Apache Spark☆11Mar 2, 2026Updated 3 months ago
- Deprecated Module: See Xponents or OpenSextantToolbox as active code base.☆31Jul 24, 2013Updated 12 years ago
- ☆84Jan 26, 2026Updated 4 months ago
- Sample how to use Camunda DMN decisions in a Zeebe Workflow☆11Apr 13, 2022Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆12Oct 8, 2020Updated 5 years ago
- Postgres JSONB Node.js Example using massive.js☆11Nov 7, 2016Updated 9 years ago
- A set of widgets for Python's Orange Machine Learning to work with Apache Spark ML☆15Dec 24, 2016Updated 9 years ago
- Experimental: Multi-producer Single-consumer Queue☆12Jul 30, 2012Updated 13 years ago
- giter8 template for Scala projects using sbt☆39Nov 20, 2016Updated 9 years ago
- Lab project to showcase Flink's performance differences between using a SQL query and implementing the same logic via the DataStream API☆14Apr 15, 2020Updated 6 years ago
- What's Wrong - simple, quick first step when debugging any server issue☆52Jan 31, 2013Updated 13 years ago
- ☆12Sep 22, 2023Updated 2 years ago
- Optimizing downstream data processing with Amazon Kinesis Data Firehose and Amazon EMR running Apache Spark☆14Apr 14, 2023Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- A Proteus example using RSocket RPC, and Kafka☆19Feb 8, 2019Updated 7 years ago
- Basic Spark utilities☆13Feb 20, 2025Updated last year
- Google BigQuery support for Spark, Structured Streaming, SQL, and DataFrames with easy Databricks integration.☆70May 8, 2023Updated 3 years ago
- [Unmaintained] Analyze dependencies of Python libraries☆13Oct 1, 2016Updated 9 years ago
- Repo for various Kubernetes applications☆18Dec 29, 2016Updated 9 years ago
- ☆11Apr 7, 2017Updated 9 years ago
- Admin SDK codelab samples☆15Aug 24, 2018Updated 7 years ago
- This is a mirror of https://github.com/LucaCanali/sparkMeasure - sparkMeasure is a tool for performance troubleshooting of Apache Spark w…☆16May 21, 2026Updated 3 weeks ago
- Twitter bot to determine whether an image is a hot dog or not☆12May 17, 2017Updated 9 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Due to lack of resources on how to deploy kafka with simple SASL authentication (just username and password) and how to write producer an…☆12Dec 29, 2021Updated 4 years ago
- Google Dataflow Runner for Apache Flink™ (deprecated; please use the up-to-date Beam Runner)☆88Jul 7, 2016Updated 9 years ago
- Example to create lineage in Atlas with sqoop and spark☆14Apr 5, 2017Updated 9 years ago
- ☆15Mar 1, 2019Updated 7 years ago
- ☆18Dec 20, 2016Updated 9 years ago
- Google Cloud Dataflow provides a simple, powerful model for building both batch and streaming parallel data processing pipelines.☆850Nov 25, 2020Updated 5 years ago
- Demo app for blog post☆14May 29, 2017Updated 9 years ago