Starter project for building MemSQL Streamliner Pipelines
☆32Apr 18, 2017Updated 8 years ago
Alternatives and similar repositories for streamliner-starter
Users that are interested in streamliner-starter are comparing it to the libraries listed below
Sorting:
- Example code for building your own MemSQL Streamliner Pipelines☆23Apr 18, 2017Updated 8 years ago
- Use Cascading Taps and Scalding DSL with Spark☆49Dec 28, 2016Updated 9 years ago
- Interactive Audience Analytics with Spark and HyperLogLog☆55Oct 14, 2015Updated 10 years ago
- A utility for generating Oozie workflows from a YAML definition☆49Mar 4, 2019Updated 6 years ago
- Node.js kafka connect connector for prometheus☆13Dec 7, 2022Updated 3 years ago
- Real-time anomaly detection using Kafka, KSQL User Defined Function and a pre-trained model☆30Dec 16, 2023Updated 2 years ago
- Sparse feature extraction with Spark☆30Jul 25, 2018Updated 7 years ago
- Scala Numerical Optimization library☆10Nov 8, 2017Updated 8 years ago
- Enabling Spark Optimization through Cross-stack Monitoring and Visualization☆47Aug 23, 2017Updated 8 years ago
- Cascading on Apache Flink®☆54Feb 5, 2024Updated 2 years ago
- A connector for SingleStore and Spark☆162Sep 24, 2025Updated 5 months ago
- Collection of Interesting Algorithms☆16Oct 13, 2020Updated 5 years ago
- A genomics pipeline build on top of the GATK Queue framework. Main repository: https://github.com/NationalGenomicsInfrastructure/piper (m…☆21Sep 6, 2016Updated 9 years ago
- Library for deep embedding of DSLs based on Scala macros.☆75Jan 12, 2016Updated 10 years ago
- A simple implementation of k-means clustering on the Spark cluster computing framework. See http://cs.berkeley.edu/~matei/spark.☆27Apr 9, 2011Updated 14 years ago
- Simplifying robust end-to-end machine learning on Apache Spark.☆475Apr 18, 2017Updated 8 years ago
- A Ruby client library for CrateDB.☆30Feb 11, 2026Updated 2 weeks ago
- Airflow code accompanying blog post.☆21Feb 20, 2019Updated 7 years ago
- POC: Spark consumer for bottledwater-pg Kafka Avro topics☆16Aug 20, 2020Updated 5 years ago
- Simple Samza Job Using Confluent Platform☆14Apr 14, 2016Updated 9 years ago
- Apache Spark Awesome List☆14Apr 17, 2016Updated 9 years ago
- Mirror of Apache Apex core☆350Jun 7, 2021Updated 4 years ago
- Spark Extension : ML transformers, SQL aggregations, etc that are missing in Apache Spark☆146Jan 26, 2016Updated 10 years ago
- Spark library for doing exploratory data analysis in a scalable way☆43Jan 17, 2016Updated 10 years ago
- A library for time series analysis on Apache Spark☆1,196Oct 13, 2020Updated 5 years ago
- Scala bindings for Bokeh plotting library☆138Oct 11, 2023Updated 2 years ago
- Zen aims to provide the largest scale and the most efficient machine learning platform on top of Spark, including but not limited to logi…☆170Nov 17, 2018Updated 7 years ago
- Bridge which consumes MQTT messages and republishes them on Kafka on the same topic☆17May 9, 2015Updated 10 years ago
- An integration framework that allows you to run and manage CrateDB via Apache Mesos.☆23Jan 30, 2019Updated 7 years ago
- The main repository has moved to http://github.com/fog/fog☆26Nov 2, 2011Updated 14 years ago
- Kaggle competition☆23Jul 15, 2015Updated 10 years ago
- ☆243Jun 14, 2018Updated 7 years ago
- An Apache Spark-shell backend for IPython☆105Jul 2, 2021Updated 4 years ago
- A framework for creating composable and pluggable data processing pipelines using Apache Spark, and running them on a cluster.☆47Aug 1, 2016Updated 9 years ago
- ☆23Jun 11, 2015Updated 10 years ago
- Evaluation of API and performance of different actor libraries☆134Jul 12, 2017Updated 8 years ago
- Experiments in Streaming☆60Aug 27, 2016Updated 9 years ago
- Edit Open Data Contract Standard in Excel☆35Dec 1, 2025Updated 3 months ago
- Play example app to show how to integrate Skinny ORM☆27Apr 23, 2019Updated 6 years ago