godatadriven/iterative-broadcast-join

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/godatadriven/iterative-broadcast-join)

godatadriven / iterative-broadcast-join

The iterative broadcast join example code.

☆71

Alternatives and similar repositories for iterative-broadcast-join

Users that are interested in iterative-broadcast-join are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

phatak-dev / Statistical-Data-Exploration-Using-Spark-2.0
View on GitHub
Data Exploration Using Spark 2.0
☆14Apr 17, 2018Updated 8 years ago
hortonworks-spark / cloud-integration
View on GitHub
Spark cloud integration: tests, cloud committers and more
☆20Jan 30, 2025Updated last year
AbsaOSS / spark-hofs
View on GitHub
Scala API for Apache Spark SQL high-order functions
☆15Aug 4, 2023Updated 2 years ago
mkuthan / example-spark
View on GitHub
Spark, Spark Streaming and Spark SQL unit testing strategies
☆215Oct 12, 2016Updated 9 years ago
CODAIT / aardpfark
View on GitHub
A library for exporting Spark ML models and pipelines to PFA
☆55Nov 21, 2018Updated 7 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
saikrishnapujari / Spark-Nested-Data-Parser
View on GitHub
Nested Data (JSON/AVRO/XML) Parsing and Flattening in Spark
☆16Jan 22, 2024Updated 2 years ago
fqaiser94 / mse
View on GitHub
Make Structs Easy (MSE)
☆18Jun 22, 2020Updated 6 years ago
hammerlab / spark-tests
View on GitHub
Utilities for writing tests that use Apache Spark.
☆24Dec 29, 2018Updated 7 years ago
holdenk / spark-testing-base
View on GitHub
Base classes to use when writing tests with Spark
☆1,555Apr 20, 2026Updated 3 months ago
NikhilSuthar / Scala-Spark-Mail
View on GitHub
Scala utility to send mail
☆14May 4, 2020Updated 6 years ago
advancedxy / hackerrank
View on GitHub
Scala solutions for hackerrank
☆11Nov 20, 2016Updated 9 years ago
Neuw84 / structured-streaming-avro-demo
View on GitHub
Spark 3.0.0 Structured Streaming Kafka Avro Demo
☆15Apr 21, 2023Updated 3 years ago
seanpquig / confluent-platform-spark-streaming
View on GitHub
Working example of consuming Avro data from Kafka with Spark Streaming
☆12Feb 21, 2016Updated 10 years ago
ADTRAN / gradle-scala-multiversion-plugin
View on GitHub
Gradle plugin to build a project against multiple versions of Scala
☆31Oct 25, 2024Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
saikrishnapujari / Spark-Drools-Integration
View on GitHub
☆23Apr 22, 2019Updated 7 years ago
ldbc / gcore-spark
View on GitHub
Implementation of the G-CORE graph query language on Spark
☆15Aug 25, 2021Updated 4 years ago
mrpowers-io / spark-daria
View on GitHub
Essential Spark extensions and helper methods ✨😲
☆767Jun 22, 2026Updated last month
aschaetzle / Sempala
View on GitHub
Sempala is a SPARQL-over-SQL approach to provide interactive-time SPARQL query processing on Hadoop. It stores RDF data in a columnar lay…
☆12Sep 4, 2017Updated 8 years ago
phatak-dev / spark-3.0-examples
View on GitHub
Examples of Spark 3.0
☆44Nov 11, 2020Updated 5 years ago
ibm-research-ireland / sparkoscope
View on GitHub
Enabling Spark Optimization through Cross-stack Monitoring and Visualization
☆47Aug 23, 2017Updated 8 years ago
PacktPublishing / Apache-Spark-2x-Machine-Learning-Cookbook
View on GitHub
Apache Spark 2x Machine Learning Cookbook, published by Packt
☆33Jul 23, 2025Updated last year
masayuki038 / calcite-arrow-sample
View on GitHub
calcite-arrow-sample(WIP)
☆13Dec 17, 2017Updated 8 years ago
kellrott / spark-gremlin
View on GitHub
☆21Jan 16, 2015Updated 11 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
UdashFramework / udash.g8
View on GitHub
Giter8 template of a Udash application.
☆19Mar 21, 2023Updated 3 years ago
soniclavier / bigdata-notebook
View on GitHub
☆104Nov 26, 2019Updated 6 years ago
radanalyticsio / silex
View on GitHub
something to help you spark
☆65Oct 23, 2018Updated 7 years ago
phatak-dev / spark2.0-examples
View on GitHub
Examples of Spark 2.0
☆213Aug 11, 2021Updated 4 years ago
chermenin / spark-states
View on GitHub
Custom state store providers for Apache Spark
☆92Feb 14, 2025Updated last year
DataSystemsGroupUT / SPARKSQLRDFBenchmarking
View on GitHub
A systematic Benchmarking on the performance of Spark-SQL for processing Vast RDF datasets
☆14Jun 29, 2022Updated 4 years ago
hortonworks-spark / spark-schema-registry
View on GitHub
Schema Registry integration for Apache Spark
☆40Nov 16, 2022Updated 3 years ago
KonstantinosX / graphgen-project
View on GitHub
A Python wrapper over the GraphGen system
☆38Sep 15, 2017Updated 8 years ago
rbrush / kite-apps
View on GitHub
Prescriptive Applications over Kite and Hadoop
☆12Oct 14, 2015Updated 10 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
cerndb / Hadoop-Profiler
View on GitHub
Hadoop Profiler, or hprofiler, is a tool which is able to analyze on- and off-CPU workloads on distributed computing environments.
☆24Jul 7, 2016Updated 10 years ago
carletes / bazel-python-monorepo
View on GitHub
A sample monorepo of several Python libraries and commands, using Bazel as build system
☆13Oct 11, 2017Updated 8 years ago
jeremybeard / oozieloop
View on GitHub
Loops in Oozie
☆10Feb 15, 2015Updated 11 years ago
carlpulley / docker-compose-testkit
View on GitHub
☆19Mar 2, 2017Updated 9 years ago
Refefer / word2vec-scala
View on GitHub
Scala port of the word2vec toolkit.
☆11Aug 15, 2016Updated 9 years ago
phatak-dev / java-sizeof
View on GitHub
Memory consumption estimator for Scala/Java
☆27Nov 24, 2014Updated 11 years ago
nordyke / akka-streams-examples
View on GitHub
☆13May 26, 2017Updated 9 years ago