godatadriven / iterative-broadcast-joinView external linksLinks
The iterative broadcast join example code.
☆71Oct 23, 2017Updated 8 years ago
Alternatives and similar repositories for iterative-broadcast-join
Users that are interested in iterative-broadcast-join are comparing it to the libraries listed below
Sorting:
- Scala port of the word2vec toolkit.☆11Aug 15, 2016Updated 9 years ago
- Scala solutions for hackerrank☆11Nov 20, 2016Updated 9 years ago
- Data Exploration Using Spark 2.0☆14Apr 17, 2018Updated 7 years ago
- Gradle plugin to build a project against multiple versions of Scala☆31Oct 25, 2024Updated last year
- A basic example of how to read and write streaming data using Apache Spark and Kafka on HDInsight☆13Mar 2, 2023Updated 2 years ago
- Scala API for Apache Spark SQL high-order functions☆14Aug 4, 2023Updated 2 years ago
- code pour les billets "Refactorer Future[Option[T]]" sur☆12Jun 14, 2017Updated 8 years ago
- Make Structs Easy (MSE)☆18Jun 22, 2020Updated 5 years ago
- Spark, Spark Streaming and Spark SQL unit testing strategies☆216Oct 12, 2016Updated 9 years ago
- Giter8 template of a Udash application.☆19Mar 21, 2023Updated 2 years ago
- A javascript based custom slide and build framework for presentations. Many of the Gradle engineers have been using this for their presen…☆35Jul 31, 2015Updated 10 years ago
- Spark 3.0.0 Structured Streaming Kafka Avro Demo☆15Apr 21, 2023Updated 2 years ago
- Project for the talk on NLP using LSTM implementation from DL4J on Spark☆20May 6, 2016Updated 9 years ago
- Paxos protocol in Akka☆24Jan 31, 2016Updated 10 years ago
- ☆20Mar 2, 2017Updated 8 years ago
- A library for exporting Spark ML models and pipelines to PFA☆54Nov 21, 2018Updated 7 years ago
- ☆21Jan 16, 2015Updated 11 years ago
- Spark cloud integration: tests, cloud committers and more☆20Jan 30, 2025Updated last year
- Base classes to use when writing tests with Spark☆1,550Dec 22, 2025Updated last month
- Hadoop Profiler, or hprofiler, is a tool which is able to analyze on- and off-CPU workloads on distributed computing environments.☆24Jul 7, 2016Updated 9 years ago
- Utilities for writing tests that use Apache Spark.☆24Dec 29, 2018Updated 7 years ago
- Enabling Spark Optimization through Cross-stack Monitoring and Visualization☆47Aug 23, 2017Updated 8 years ago
- ☆105Nov 26, 2019Updated 6 years ago
- Joins for skewed datasets in Spark☆57Aug 18, 2017Updated 8 years ago
- Structured Streaming Machine Learning example with Spark 2.0☆94Apr 24, 2017Updated 8 years ago
- The open source version of the Amazon EMR Management Guide. You can submit feedback & requests for changes by submitting issues in this r…☆62Jun 15, 2023Updated 2 years ago
- ☆23Apr 22, 2019Updated 6 years ago
- A testing framework for Presto☆62May 2, 2025Updated 9 months ago
- Apache Spark 2x Machine Learning Cookbook, published by Packt☆33Jul 23, 2025Updated 6 months ago
- Memory consumption estimator for Scala/Java☆26Nov 24, 2014Updated 11 years ago
- something to help you spark☆64Oct 23, 2018Updated 7 years ago
- Scala DSL for Unit-Testing Processing Topologies in Kafka Streams☆186Jan 16, 2021Updated 5 years ago
- SoundCloud Backend Developer Challenge☆25Jan 29, 2017Updated 9 years ago
- Apache Spark Scala utility to track data records during application execution☆11Jun 12, 2023Updated 2 years ago
- Scala utility to send mail☆14May 4, 2020Updated 5 years ago
- The missing MatPlotLib for Scala + Spark☆731Jan 30, 2022Updated 4 years ago
- Plotting/visualization with graphical dataflow analysis.☆35Oct 5, 2023Updated 2 years ago
- This repository has the code from the text and the videos for "Introduction to Programming and Problem Solving using Scala".☆30Feb 11, 2018Updated 8 years ago
- A full example of my blog post regarding Sparks stateful streaming (http://asyncified.io/2016/07/31/exploring-stateful-streaming-with-apa…☆35Jul 30, 2017Updated 8 years ago