Mirror of Apache Samoa (Incubating)
☆251Apr 16, 2023Updated 2 years ago
Alternatives and similar repositories for incubator-samoa
Users that are interested in incubator-samoa are comparing it to the libraries listed below
Sorting:
- SAMOA (Scalable Advanced Massive Online Analysis) is an open-source platform for mining big data streams.☆428Mar 28, 2016Updated 9 years ago
- Stream Data Mining Library for Spark Streaming☆500Apr 16, 2023Updated 2 years ago
- Distributed, streaming anomaly detection and prediction with HTM in Apache Flink☆136Sep 6, 2017Updated 8 years ago
- Mirror of Apache MRQL (Incubating)☆17Aug 22, 2017Updated 8 years ago
- Mirror of Apache Apex core☆350Jun 7, 2021Updated 4 years ago
- An open source ML system for the end-to-end data science lifecycle☆1,079Feb 20, 2026Updated last week
- Trident-ML : A realtime online machine learning library☆384Dec 16, 2023Updated 2 years ago
- Scalable real-time stream mining on Twitter Public Stream using SAMOA☆14Dec 15, 2014Updated 11 years ago
- Simplifying robust end-to-end machine learning on Apache Spark.☆475Apr 18, 2017Updated 8 years ago
- Google Dataflow Runner for Apache Flink™ (deprecated; please use the up-to-date Beam Runner)☆88Jul 7, 2016Updated 9 years ago
- Benchmarks for Low Latency (Streaming) solutions including Apache Storm, Apache Spark, Apache Flink, ...☆646Dec 17, 2023Updated 2 years ago
- MOA is an open source framework for Big Data stream mining. It includes a collection of machine learning algorithms (classification, regr…☆653Dec 19, 2025Updated 2 months ago
- ☆13Nov 2, 2017Updated 8 years ago
- Parameter Server implementation in Apache Flink.☆14Mar 15, 2018Updated 7 years ago
- Mirror of Apache Apex malhar☆133Nov 13, 2019Updated 6 years ago
- A library for time series analysis on Apache Spark☆1,196Oct 13, 2020Updated 5 years ago
- Cascading on Apache Flink®☆54Feb 5, 2024Updated 2 years ago
- JVM integration for Weld☆16Sep 24, 2018Updated 7 years ago
- Jetstream is a streaming processing framework☆115Sep 16, 2015Updated 10 years ago
- A distributed data integration framework that simplifies common aspects of big data integration such as data ingestion, replication, orga…☆2,260Feb 19, 2026Updated last week
- Apache Spark OpenCPU Executor (ROSE)☆26Jun 16, 2018Updated 7 years ago
- Spooker is a dynamic framework for processing high volume data streams via processing pipelines☆30Feb 1, 2016Updated 10 years ago
- Mirror of Apache Toree (Incubating)☆749Feb 21, 2026Updated last week
- Reusable code for Hive☆16Aug 19, 2014Updated 11 years ago
- Helm Chart for lyft/flinkk8soperator☆11Mar 10, 2020Updated 5 years ago
- Deeplearning framework running on Spark☆61Dec 16, 2023Updated 2 years ago
- Deterministic transactional database layer on top of a stream processing engine☆27Oct 27, 2019Updated 6 years ago
- Generic driver for LDBC Graphalytics implementation☆87Dec 19, 2024Updated last year
- Spark MLlib code optimized to efficiently support sparse data☆51Dec 22, 2016Updated 9 years ago
- Online machine learning algorithms based on Spark streaming☆12Nov 30, 2015Updated 10 years ago
- ☆110Apr 17, 2017Updated 8 years ago
- Realtime analytics, this includes the core components of Pulsar pipeline.☆651Nov 6, 2015Updated 10 years ago
- An experimental Graph Streaming API for Apache Flink☆141Oct 13, 2020Updated 5 years ago
- Mirror of Apache Edgent (Incubating)☆225Nov 1, 2019Updated 6 years ago
- Distributed Prometheus time series database☆1,462Updated this week
- Spark GPU and SIMD Support☆61Jul 22, 2020Updated 5 years ago
- Public Presentations☆24Apr 13, 2025Updated 10 months ago
- A web-latency SQL spout for Hadoop.☆50Jan 25, 2021Updated 5 years ago
- Project SnappyData - memory optimized analytics database, based on Apache Spark™ and Apache Geode™. Stream, Transact, Analyze, Predict in…☆1,037Nov 21, 2022Updated 3 years ago