The iterative broadcast join example code.
☆71Oct 23, 2017Updated 8 years ago
Alternatives and similar repositories for iterative-broadcast-join
Users that are interested in iterative-broadcast-join are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Data Exploration Using Spark 2.0☆14Apr 17, 2018Updated 8 years ago
- Minimal starter for using React + PostCSS with Webpack.☆17Feb 5, 2019Updated 7 years ago
- Spark cloud integration: tests, cloud committers and more☆20Jan 30, 2025Updated last year
- ☆26Feb 28, 2013Updated 13 years ago
- Spark, Spark Streaming and Spark SQL unit testing strategies☆215Oct 12, 2016Updated 9 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Project for the talk on NLP using LSTM implementation from DL4J on Spark☆20May 6, 2016Updated 10 years ago
- Nested Data (JSON/AVRO/XML) Parsing and Flattening in Spark☆16Jan 22, 2024Updated 2 years ago
- A library for exporting Spark ML models and pipelines to PFA☆55Nov 21, 2018Updated 7 years ago
- Utilities for writing tests that use Apache Spark.☆24Dec 29, 2018Updated 7 years ago
- Base classes to use when writing tests with Spark☆1,555Apr 20, 2026Updated 2 weeks ago
- Scala solutions for hackerrank☆11Nov 20, 2016Updated 9 years ago
- Spark 3.0.0 Structured Streaming Kafka Avro Demo☆15Apr 21, 2023Updated 3 years ago
- Spark-Transformers: Library for exporting Apache Spark MLLIB models to use them in any Java application with no other dependencies.☆42Dec 15, 2017Updated 8 years ago
- Gradle plugin to build a project against multiple versions of Scala☆31Oct 25, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆23Apr 22, 2019Updated 7 years ago
- Scala utility to send mail☆14May 4, 2020Updated 6 years ago
- Enabling Spark Optimization through Cross-stack Monitoring and Visualization☆47Aug 23, 2017Updated 8 years ago
- Examples of Spark 3.0☆45Nov 11, 2020Updated 5 years ago
- Essential Spark extensions and helper methods ✨😲☆766Sep 14, 2025Updated 7 months ago
- Working example of consuming Avro data from Kafka with Spark Streaming☆12Feb 21, 2016Updated 10 years ago
- Apache Spark 2x Machine Learning Cookbook, published by Packt☆33Jul 23, 2025Updated 9 months ago
- ☆21Jan 16, 2015Updated 11 years ago
- ☆104Nov 26, 2019Updated 6 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- something to help you spark☆65Oct 23, 2018Updated 7 years ago
- Spark Plugin for Amazon S3☆13Feb 3, 2016Updated 10 years ago
- Loads LDBC social graph data into Flink DataSets☆10Sep 25, 2024Updated last year
- Giter8 template of a Udash application.☆19Mar 21, 2023Updated 3 years ago
- ☆11Apr 27, 2020Updated 6 years ago
- Examples of diagrams using Mermaid: https://mermaid.js.org/intro/☆12Mar 25, 2023Updated 3 years ago
- S2RDF (SPARQL on Spark for RDF) is a SPARQL query processor for Hadoop based on Spark SQL. It uses the relational interface of Spark for …☆13Apr 21, 2018Updated 8 years ago
- Examples of Spark 2.0☆214Aug 11, 2021Updated 4 years ago
- Paxos protocol in Akka☆25Jan 31, 2016Updated 10 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- A systematic Benchmarking on the performance of Spark-SQL for processing Vast RDF datasets☆14Jun 29, 2022Updated 3 years ago
- Using Spark SQLContext, HiveContext & Spark DataFrames API with ElasticSearch, Cassandra & MongoDB☆22Sep 13, 2016Updated 9 years ago
- calcite-arrow-sample(WIP)☆13Dec 17, 2017Updated 8 years ago
- Hadoop Profiler, or hprofiler, is a tool which is able to analyze on- and off-CPU workloads on distributed computing environments.☆24Jul 7, 2016Updated 9 years ago
- ☆12Sep 20, 2023Updated 2 years ago
- Scala port of the word2vec toolkit.☆11Aug 15, 2016Updated 9 years ago
- Accelerating SPARQL Queries by Exploiting Hash-based Locality and Adaptive Partitioning☆10Jan 21, 2016Updated 10 years ago