adobe-research/spindle

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/adobe-research/spindle)

adobe-research / spindle

Next-generation web analytics processing with Scala, Spark, and Parquet.

☆330

Alternatives and similar repositories for spindle

Users that are interested in spindle are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

brightcove-archive / ooyala_spark-jobserver
View on GitHub
REST job server for Spark. Note that this is *not* the mainline open source version. For that, go to https://github.com/spark-jobserver…
☆345May 19, 2017Updated 9 years ago
twitter / summingbird
View on GitHub
Streaming MapReduce with Scalding and Storm
☆2,123Jan 19, 2022Updated 4 years ago
thunderain-project / thunderain
View on GitHub
A Real-Time Analytical Processing (RTAP) example using Spark/Shark
☆51Feb 21, 2014Updated 12 years ago
twitter / storehaus
View on GitHub
Storehaus is a library that makes it easy to work with asynchronous key value stores
☆465Jul 17, 2020Updated 6 years ago
Banno / samza-mesos
View on GitHub
This project allows to run Samza jobs on Mesos cluster
☆43Mar 25, 2021Updated 5 years ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
tresata / ganitha
View on GitHub
scalding powered machine learning
☆109Nov 18, 2014Updated 11 years ago
twitter / bijection
View on GitHub
Reversible conversions between types
☆656Nov 22, 2024Updated last year
MasseGuillaume / ScalaKata
View on GitHub
Moved
☆118Oct 14, 2019Updated 6 years ago
krasserm / akka-analytics
View on GitHub
Large-scale event processing with Akka Persistence and Apache Spark
☆271Jun 18, 2016Updated 10 years ago
filodb / FiloDB
View on GitHub
Distributed Prometheus time series database
☆1,468Updated this week
twitter / chill
View on GitHub
Scala extensions for the Kryo serialization library
☆618Aug 19, 2024Updated last year
tresata / spark-scalding
View on GitHub
Use Cascading Taps and Scalding DSL with Spark
☆49Dec 28, 2016Updated 9 years ago
saddle / saddle
View on GitHub
SADDLE: Scala Data Library
☆508Mar 21, 2020Updated 6 years ago
hbutani / spark-druid-olap
View on GitHub
Sparkline BI Accelerator provides fast ad-hoc query capability over Logical Cubes. This has been folded into our SNAP Platform(http://bit…
☆281Aug 3, 2018Updated 7 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
alaiacano / dsp-scalding
View on GitHub
Code from my talk on Digital Signal Processing in Hadoop with Scalding
☆15Oct 17, 2013Updated 12 years ago
Bridgewater / scala-notebook
View on GitHub
Interactive Scala REPL in a browser
☆739May 18, 2022Updated 4 years ago
amplab / keystone
View on GitHub
Simplifying robust end-to-end machine learning on Apache Spark.
☆473Apr 18, 2017Updated 9 years ago
mandubian / zpark-ztream
View on GitHub
Driving Spark stream with Scalaz-Stream
☆26Mar 18, 2014Updated 12 years ago
patriknw / akka-data-replication
View on GitHub
Replication of CRDTs in Akka Cluster
☆215Sep 30, 2015Updated 10 years ago
amplab / MLI
View on GitHub
An API for Distributed Machine Learning
☆156Sep 22, 2016Updated 9 years ago
sryza / spark-timeseries
View on GitHub
A library for time series analysis on Apache Spark
☆1,197Oct 13, 2020Updated 5 years ago
memsql / streamliner-starter
View on GitHub
Starter project for building MemSQL Streamliner Pipelines
☆32Apr 18, 2017Updated 9 years ago
tuplejump / embedded-kafka
View on GitHub
Embedded Kafka for testing and quick prototyping.
☆14Apr 19, 2016Updated 10 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
mighdoll / sparkle
View on GitHub
visualization server
☆138Oct 13, 2015Updated 10 years ago
VeritoneAlpha / jaws-spark-sql-rest
View on GitHub
☆91Apr 17, 2017Updated 9 years ago
YahooArchive / samoa
View on GitHub
SAMOA (Scalable Advanced Massive Online Analysis) is an open-source platform for mining big data streams.
☆427Mar 28, 2016Updated 10 years ago
brndnmtthws / kafka-on-marathon
View on GitHub
Scripts for running Apache Kafka on Mesosphere's Marathon
☆14Dec 6, 2015Updated 10 years ago
twitter-archive / ambrose
View on GitHub
A platform for visualization and real-time monitoring of data workflows
☆1,170Jan 22, 2020Updated 6 years ago
NICTA / scoobi
View on GitHub
A Scala productivity framework for Hadoop.
☆479Jul 1, 2022Updated 4 years ago
twitter / hraven
View on GitHub
hRaven collects run time data and statistics from MapReduce jobs in an easily queryable format
☆129Jan 14, 2022Updated 4 years ago
keenlabs / capillary
View on GitHub
Storm Spout + Kafka State Inspector
☆58Dec 20, 2019Updated 6 years ago
pulsarIO / realtime-analytics
View on GitHub
Realtime analytics, this includes the core components of Pulsar pipeline.
☆650Nov 6, 2015Updated 10 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
amplab-extras / SparkR-pkg
View on GitHub
R frontend for Spark
☆641Jun 10, 2016Updated 10 years ago
khronus / khronus
View on GitHub
A reactive time series database
☆234Jun 5, 2018Updated 8 years ago
massie / spark-parquet-example
View on GitHub
Example project to show how to use Spark to read and write Avro/Parquet files
☆50Aug 21, 2013Updated 12 years ago
twitter / scalding
View on GitHub
A Scala API for Cascading
☆3,522May 28, 2023Updated 3 years ago
softprops / cappi
View on GitHub
the sweetest sbt plugin your microbenchmarks will ever meet
☆17Mar 2, 2019Updated 7 years ago
Comcast / sirius
View on GitHub
A distributed system library for managing application reference data
☆296Feb 28, 2025Updated last year
Hydrospheredata / mist
View on GitHub
Serverless proxy for Spark cluster
☆325Apr 13, 2026Updated 3 months ago