Next-generation web analytics processing with Scala, Spark, and Parquet.
☆330Mar 28, 2015Updated 10 years ago
Alternatives and similar repositories for spindle
Users that are interested in spindle are comparing it to the libraries listed below
Sorting:
- A Real-Time Analytical Processing (RTAP) example using Spark/Shark☆51Feb 21, 2014Updated 12 years ago
- Sparkline BI Accelerator provides fast ad-hoc query capability over Logical Cubes. This has been folded into our SNAP Platform(http://bit…☆280Aug 3, 2018Updated 7 years ago
- Large-scale event processing with Akka Persistence and Apache Spark☆273Jun 18, 2016Updated 9 years ago
- Read druid segments from hadoop☆10Jan 18, 2017Updated 9 years ago
- ☆33Jan 9, 2016Updated 10 years ago
- Serverless proxy for Spark cluster☆324Oct 29, 2020Updated 5 years ago
- Low level integration of Spark and Kafka☆130Mar 15, 2018Updated 7 years ago
- Real Time Analytics and Data Pipelines based on Spark Streaming☆531Oct 24, 2019Updated 6 years ago
- Real time and offline time series analysis with Spark, Spark Streaming and Storm☆21Oct 20, 2020Updated 5 years ago
- Mirror of Apache Spark☆56Jul 9, 2015Updated 10 years ago
- Spooker is a dynamic framework for processing high volume data streams via processing pipelines☆30Feb 1, 2016Updated 10 years ago
- Dropwizard Metrics Cassandra reporter☆17Apr 18, 2020Updated 5 years ago
- Beyond Piwik Analytics with Scala and Apache Spark☆46Nov 30, 2014Updated 11 years ago
- Distributed Prometheus time series database☆1,462Updated this week
- GPU Acceleration for Apache Spark☆34Aug 24, 2015Updated 10 years ago
- Graph tool for sysadmins, with anomaly detection and redis backend.☆14May 5, 2017Updated 8 years ago
- Scala, DSL, Rules based reactive workflows and Microservices☆14Oct 20, 2025Updated 4 months ago
- Realtime analytics, this includes the core components of Pulsar pipeline.☆651Nov 6, 2015Updated 10 years ago
- IoT - It's the thing you want! And so here's a full-stack demo.☆62Jul 28, 2016Updated 9 years ago
- Dead simple Swagger config for Spring Boot☆17Oct 30, 2014Updated 11 years ago
- Trident State implementation on top of Elasticsearch☆21May 18, 2015Updated 10 years ago
- Simplifying robust end-to-end machine learning on Apache Spark.☆475Apr 18, 2017Updated 8 years ago
- Scripts for generating Grafana dashboards for monitoring Spark jobs☆242Mar 26, 2015Updated 10 years ago
- ⛔️ [DEPRECATED] sbt's scala incremental compiler☆303May 3, 2017Updated 8 years ago
- An columnar serializer☆15Feb 26, 2016Updated 10 years ago
- A fork of cascading patterns, but implemented for trident☆71Dec 16, 2023Updated 2 years ago
- Experiments with the GDELT dataset and Cassandra schemas.☆25Feb 9, 2016Updated 10 years ago
- ☆110Apr 17, 2017Updated 8 years ago
- High Performance Kafka Connector for Spark Streaming.Supports Multi Topic Fetch, Kafka Security. Reliable offset management in Zookeeper.…☆636Feb 26, 2022Updated 4 years ago
- Collection of generic Apache Flink operators☆17May 15, 2017Updated 8 years ago
- Mirror of Apache Lens☆62Nov 5, 2019Updated 6 years ago
- Spark package to "plug" holes in data using SQL based rules ⚡️ 🔌☆29May 15, 2020Updated 5 years ago
- Schema registry for CSV, TSV, JSON, AVRO and Parquet schema. Supports schema inference and GraphQL API.☆115Mar 5, 2020Updated 5 years ago
- Simple Spark example of generating table stats for use of data quality checks☆28Apr 28, 2017Updated 8 years ago
- Graph algorithms implemented in GraphX and Spark styles☆15Apr 26, 2015Updated 10 years ago
- Enabling queries on compressed data.☆282Dec 16, 2023Updated 2 years ago
- SparkOnHBase☆278Mar 30, 2021Updated 4 years ago
- Oryx 2: Lambda architecture on Apache Spark, Apache Kafka for real-time large scale machine learning☆1,783Aug 16, 2021Updated 4 years ago
- A library for time series analysis on Apache Spark☆1,196Oct 13, 2020Updated 5 years ago