amplab/drizzle-spark

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/amplab/drizzle-spark)

amplab / drizzle-spark

Drizzle integration with Apache Spark

☆120

Alternatives and similar repositories for drizzle-spark

Users that are interested in drizzle-spark are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

jerryshao / spark-kafka-0-8-sql
View on GitHub
Spark Structured Streaming Kafka 0.8 Source Implementation
☆35Apr 27, 2017Updated 9 years ago
databricks / tensorframes
View on GitHub
[DEPRECATED] Tensorflow wrapper for DataFrames on Apache Spark
☆744Jul 30, 2024Updated last year
hazelcast / hazelcast-spark
View on GitHub
Spark Connector for Hazelcast
☆22Jun 9, 2021Updated 5 years ago
apache / incubator-crail
View on GitHub
Mirror of Apache crail (Incubating)
☆152Jul 3, 2022Updated 4 years ago
databricks / benchmarks
View on GitHub
A place in which we publish scripts for reproducible benchmarks.
☆105Dec 13, 2019Updated 6 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
ibm-research-ireland / sparkoscope
View on GitHub
Enabling Spark Optimization through Cross-stack Monitoring and Visualization
☆47Aug 23, 2017Updated 8 years ago
hvanhovell / weld-java
View on GitHub
JVM integration for Weld
☆16Sep 24, 2018Updated 7 years ago
h2oai / sparkling-water
View on GitHub
Sparkling Water provides H2O functionality inside Spark cluster
☆979Nov 5, 2025Updated 8 months ago
hortonworks-spark / spark-schema-registry
View on GitHub
Schema Registry integration for Apache Spark
☆40Nov 16, 2022Updated 3 years ago
sequenceiq / docker-spark-native-yarn
View on GitHub
☆13Mar 8, 2018Updated 8 years ago
heronproject / apache-proposal
View on GitHub
Apache Incubator Proposal for Heron
☆22Feb 17, 2016Updated 10 years ago
NetSys / spark-monotasks
View on GitHub
Fast, predictable data analytics based on (and API-compatible with) Apache Spark
☆26Oct 28, 2017Updated 8 years ago
TIBCOSoftware / snappydata
View on GitHub
Project SnappyData - memory optimized analytics database, based on Apache Spark™ and Apache Geode™. Stream, Transact, Analyze, Predict in…
☆1,032Nov 21, 2022Updated 3 years ago
databricks / spark-deep-learning
View on GitHub
Deep Learning Pipelines for Apache Spark
☆1,989Mar 30, 2023Updated 3 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
yahoo / streaming-benchmarks
View on GitHub
Benchmarks for Low Latency (Streaming) solutions including Apache Storm, Apache Spark, Apache Flink, ...
☆647Dec 17, 2023Updated 2 years ago
ikexu / SparkStreaming-Use-OpenCV
View on GitHub
Spark Streaming与OpenCV传感器数据实时获取
☆13Jun 20, 2016Updated 10 years ago
apache / apex-core
View on GitHub
Mirror of Apache Apex core
☆350Jun 7, 2021Updated 5 years ago
dataArtisans / yahoo-streaming-benchmark
View on GitHub
An extension of Yahoo's Benchmarks
☆108Dec 18, 2023Updated 2 years ago
chermenin / spark-states
View on GitHub
Custom state store providers for Apache Spark
☆92Feb 14, 2025Updated last year
cloudera / livy
View on GitHub
Livy is an open source REST interface for interacting with Apache Spark from anywhere
☆1,007Oct 5, 2022Updated 3 years ago
tmatyashovsky / lambda-architecture-jeeconf-kyiv
View on GitHub
Simple Lambda Architecture implementation based on Apache Spark (Core, SQL, Streaming)
☆40Feb 19, 2017Updated 9 years ago
amplab / spark-indexedrdd
View on GitHub
An efficient updatable key-value store for Apache Spark
☆255Mar 11, 2017Updated 9 years ago
databricks / spark-sql-perf
View on GitHub
☆623Feb 26, 2022Updated 4 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
ExNexu / drools-scala-example
View on GitHub
☆10Apr 10, 2014Updated 12 years ago
sunilmallya / robocar2017
View on GitHub
Robocar Rally 2017
☆13Jun 5, 2018Updated 8 years ago
uber / RemoteShuffleService
View on GitHub
Remote shuffle service for Apache Spark to store shuffle data on remote servers.
☆335Sep 29, 2023Updated 2 years ago
h2oai / h2o-droplets
View on GitHub
Templates for projects based on top of H2O.
☆38Mar 17, 2025Updated last year
Kyligence / calcite
View on GitHub
a tailored Apache Calcite for Apache Kylin, more details at http://mail-archives.apache.org/mod_mbox/kylin-dev/201704.mbox/%3CCAF7etT=wEB…
☆14Nov 7, 2025Updated 8 months ago
maropu / hivemall-spark
View on GitHub
A Hivemall wrapper for Spark
☆31Apr 21, 2016Updated 10 years ago
hortonworks-spark / spark-llap
View on GitHub
☆102Mar 23, 2020Updated 6 years ago
holdenk / spark-validator
View on GitHub
A library you can include in your Spark job to validate the counters and perform operations on success. Goal is scala/java/python support…
☆111Feb 1, 2018Updated 8 years ago
CodingCat / xgboost4j-spark-scalability
View on GitHub
a benchmark to test scalability of xgboost4j-spark and relevant projects
☆22Dec 20, 2019Updated 6 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
cguegi / azure-databricks-airflow-example
View on GitHub
Example of orchestrating dependent Databricks jobs using Airflow
☆11Dec 19, 2019Updated 6 years ago
amplab / SparkNet
View on GitHub
Distributed Neural Networks for Spark
☆610Jul 23, 2020Updated 6 years ago
mattyb149 / nifi-client
View on GitHub
A NiFi client library for JVM languages
☆13Mar 18, 2016Updated 10 years ago
Cargill / pipewrench
View on GitHub
Data pipeline automation tool
☆28Jan 11, 2024Updated 2 years ago
yahoo / TensorFlowOnSpark
View on GitHub
TensorFlowOnSpark brings TensorFlow programs to Apache Spark clusters.
☆3,845Jul 10, 2023Updated 3 years ago
ucbrise / cs294-rise-fa16
View on GitHub
CS294 RISE Course Material
☆32Jan 23, 2019Updated 7 years ago
JeffersonLab / clas12-offline-software
View on GitHub
CLAS12 Offline Software
☆10May 18, 2023Updated 3 years ago