Spark pipelines that correspond to a series of Dataflow examples.
☆27May 5, 2019Updated 7 years ago
Alternatives and similar repositories for spark-examples
Users that are interested in spark-examples are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Google Cloud Dataflow provides a simple, powerful model for building both batch and streaming parallel data processing pipelines. This re…☆166Jul 25, 2018Updated 7 years ago
- Processing Logs at Scale using Cloud Dataflow☆62Mar 18, 2019Updated 7 years ago
- Data Science with Apache Spark and Spark Notebook☆30Jul 24, 2017Updated 8 years ago
- Install JupyterHub on Google Cloud☆17Aug 7, 2017Updated 8 years ago
- Thoughts on things I find interesting.☆17Dec 19, 2024Updated last year
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- PostgreSQL and GreenPlum Data Source for Apache Spark☆35May 6, 2026Updated 2 weeks ago
- ☆26Feb 28, 2013Updated 13 years ago
- Java chat example app☆11Mar 11, 2022Updated 4 years ago
- This repository is deprecated. All of its content and history has been moved to googleapis/google-cloud-node.☆13Jul 20, 2023Updated 2 years ago
- Shared resources of db-migrate.☆13Mar 31, 2023Updated 3 years ago
- Google Cloud Dataflow provides a simple, powerful model for building both batch and streaming parallel data processing pipelines.☆164May 31, 2017Updated 8 years ago
- A Gentle introduction to Machine Learning with Apache Spark☆11Mar 2, 2026Updated 2 months ago
- DataSphere 产品文档☆12Sep 25, 2019Updated 6 years ago
- Deprecated Module: See Xponents or OpenSextantToolbox as active code base.☆31Jul 24, 2013Updated 12 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆84Jan 26, 2026Updated 3 months ago
- Sample how to use Camunda DMN decisions in a Zeebe Workflow☆11Apr 13, 2022Updated 4 years ago
- Docker-based utility for testing network failures and partitions in distributed applications☆10Oct 4, 2016Updated 9 years ago
- A set of widgets for Python's Orange Machine Learning to work with Apache Spark ML☆15Dec 24, 2016Updated 9 years ago
- The goal is to pilot Microsoft Cognitive Services to unlock the strategic value of UN unstructured content by building on AI and semantic…☆16Jul 6, 2023Updated 2 years ago
- giter8 template for Scala projects using sbt☆39Nov 20, 2016Updated 9 years ago
- ☆12Oct 16, 2023Updated 2 years ago
- Akka Java cluster singleton example☆10Dec 5, 2023Updated 2 years ago
- Lab project to showcase Flink's performance differences between using a SQL query and implementing the same logic via the DataStream API☆14Apr 15, 2020Updated 6 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- gRPC API definitions for µONOS☆17Sep 9, 2024Updated last year
- What's Wrong - simple, quick first step when debugging any server issue☆52Jan 31, 2013Updated 13 years ago
- Optimizing downstream data processing with Amazon Kinesis Data Firehose and Amazon EMR running Apache Spark☆14Apr 14, 2023Updated 3 years ago
- A Proteus example using RSocket RPC, and Kafka☆19Feb 8, 2019Updated 7 years ago
- A sample solution that periodically checks the status of SSL proxy load balancers and rotates their certificates from a configured Privat…☆16Jun 29, 2021Updated 4 years ago
- Link GitHub issues to Cases in Salesforce1☆13Dec 12, 2016Updated 9 years ago
- Google BigQuery support for Spark, Structured Streaming, SQL, and DataFrames with easy Databricks integration.☆70May 8, 2023Updated 3 years ago
- Published data models for the Hercules vendor-agnostic SDN switch☆12Aug 15, 2020Updated 5 years ago
- Repo for various Kubernetes applications☆18Dec 29, 2016Updated 9 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆21Updated this week
- Labs for the OReilly Training "Process Automation in Modern Architectures"☆12Jul 15, 2021Updated 4 years ago
- Google Dataflow Runner for Apache Flink™ (deprecated; please use the up-to-date Beam Runner)☆88Jul 7, 2016Updated 9 years ago
- Make the Guice EDSL more Scala friendly☆45Oct 26, 2017Updated 8 years ago
- ☆15Mar 1, 2019Updated 7 years ago
- Example to create lineage in Atlas with sqoop and spark☆14Apr 5, 2017Updated 9 years ago
- Tools for faster and optimized interaction with Teradata and large datasets.☆17Jul 11, 2018Updated 7 years ago