rapidsai/spark-examples

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/rapidsai/spark-examples)

rapidsai / spark-examples

[ARCHIVED] Moved to github.com/NVIDIA/spark-xgboost-examples

☆72

Alternatives and similar repositories for spark-examples

Users that are interested in spark-examples are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

rapidsai / xgboost
View on GitHub
Fork of dmlc/xgboost for RAPIDS + XGBoost integration
☆29May 19, 2023Updated 3 years ago
alvintoh / udemy-hands-on-hadoop
View on GitHub
AlvinToh Learning Repository for The Ultimate Hands-On Hadoop - Tame your Big Data!
☆10May 23, 2018Updated 8 years ago
NVIDIA / nvidia-gcp-samples
View on GitHub
NVIDIA GPU Accelerated Application Samples in Google Cloud Platform
☆23Jul 16, 2026Updated last week
jacobtomlinson / jupyterlab-nvdashboard
View on GitHub
A JupyterLab extension for displaying dashboards of GPU usage.
☆13Aug 24, 2023Updated 2 years ago
talperetz / awesome-gradient-boosting
View on GitHub
A curated list of Gradient Boosting resources for Data Scientists
☆16Jan 18, 2019Updated 7 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
databrickslabs / automl-toolkit
View on GitHub
Toolkit for Apache Spark ML for Feature clean-up, feature Importance calculation suite, Information Gain selection, Distributed SMOTE, Mo…
☆191Jun 1, 2021Updated 5 years ago
cne1x / sfcs
View on GitHub
Space-Filling Curves in Scala
☆26Aug 25, 2020Updated 5 years ago
jaceklaskowski / spark-delta-lake-workshop
View on GitHub
Spark and Delta Lake Workshop
☆22Jun 14, 2022Updated 4 years ago
eed3si9n / sudori
View on GitHub
☆12Nov 12, 2021Updated 4 years ago
yassineAlouini / airbus_ship_detection
View on GitHub
The Airbus ship detection Kaggle challenge personal attempt
☆15Nov 10, 2018Updated 7 years ago
criteo / mlflow-yarn
View on GitHub
Backend implementation for running MLFlow projects on Hadoop/YARN.
☆11Dec 27, 2022Updated 3 years ago
agrippa / spark-swat
View on GitHub
Automatic offload of user-written Spark kernels to accelerators
☆18Oct 25, 2016Updated 9 years ago
marcbux / Hi-WAY
View on GitHub
Heterogeneity-incorporating Workflow ApplicationMaster for YARN
☆26Oct 31, 2017Updated 8 years ago
akshaywadia / graphMod
View on GitHub
☆23Feb 23, 2015Updated 11 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
nilan3 / local-aws-spark-zeppelin-stack
View on GitHub
AWS LocalStack + Spark Cluster + Zeppelin [Docker]
☆10Jul 6, 2022Updated 4 years ago
Lancern / cache-coherence-protocol-bench
View on GitHub
Benchmarking code for evaluating the cost of cache coherence protocols implemented on different platforms
☆14Apr 13, 2021Updated 5 years ago
Primetalk / typed-ontology
View on GitHub
A unique fusion of ontology ideas, strong Scala type system and Json flexibility
☆20Oct 15, 2025Updated 9 months ago
Mellanox / ipoib-cni
View on GitHub
IP Over Infiniband (IPoIB) CNI Plugin
☆18Updated this week
quiltdata / examples
View on GitHub
☆12Oct 24, 2025Updated 9 months ago
ExpediaGroup / datasqueeze
View on GitHub
Hadoop utility to compact small files
☆18Feb 16, 2026Updated 5 months ago
mouryar / spring_hive_jdbc_template
View on GitHub
☆10Feb 10, 2017Updated 9 years ago
hazelcast / big-data-benchmark
View on GitHub
☆14Jun 30, 2026Updated 3 weeks ago
emer / auditory
View on GitHub
Neural network auditory processing code in Go focused on filtering speech wav files via mel filters
☆11Jan 22, 2024Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
edtk / vagrant-box
View on GitHub
📦 Starting box for Vagrant. Inside box Ubuntu 20.04 LTS with Git, Docker and Docker compose.
☆19May 5, 2022Updated 4 years ago
MrBr-github / lshca
View on GitHub
☆13Mar 3, 2025Updated last year
sparklingpandas / sparklingml
View on GitHub
Machine Learning Pipeline Stages for Spark (exposed in Scala/Java + Python)
☆73Nov 9, 2023Updated 2 years ago
radanalyticsio / silex
View on GitHub
something to help you spark
☆65Oct 23, 2018Updated 7 years ago
audit4j / audit4j-microservice
View on GitHub
Language independent and centralized auditing server as a microservice.
☆10May 3, 2018Updated 8 years ago
TU-Berlin-DIMA / grizzly-prototype
View on GitHub
Grizzly: Efficient Stream Processing Through Adaptive Query Compilation
☆17Jun 13, 2020Updated 6 years ago
apache / incubator-crail
View on GitHub
Mirror of Apache crail (Incubating)
☆152Jul 3, 2022Updated 4 years ago
pfent / L5RDMA
View on GitHub
A low level, low latency library, which can be used to accelerate network messages using shared memory and RDMA
☆78Dec 7, 2020Updated 5 years ago
BowenforGit / GPU-Joins-Evaluation
View on GitHub
Evaluate state-of-the-art GPU joins
☆14Nov 29, 2023Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
SAITPublic / PNMLibrary
View on GitHub
SW Library for Samsung PNM (including functional simulator)
☆11Nov 2, 2023Updated 2 years ago
jsuereth / shady-side
View on GitHub
Prototype Scala -> GLSL translation, including scaffolding to run + test
☆21May 14, 2020Updated 6 years ago
phatak-dev / Statistical-Data-Exploration-Using-Spark-2.0
View on GitHub
Data Exploration Using Spark 2.0
☆14Apr 17, 2018Updated 8 years ago
dlebrero / kafka-streams-and-ktable-example
View on GitHub
☆13Jul 24, 2017Updated 9 years ago
NVIDIA / cudf-spark
View on GitHub
NVIDIA cuDF for Apache Spark plugin - accelerate Apache Spark with GPUs
☆991Updated this week
WhiteFangBuck / CDSW-DL
View on GitHub
Set up tools for running a few DL libraries on CDH and CDSW
☆17Jul 23, 2020Updated 6 years ago
openscoring / openscoring-docker
View on GitHub
Openscoring application for the Docker distributed applications platform
☆11Nov 8, 2020Updated 5 years ago