allenday / R-StormLinks
☆31Updated 10 years ago
Alternatives and similar repositories for R-Storm
Users that are interested in R-Storm are comparing it to the libraries listed below
Sorting:
- spark backend for dplyr☆48Updated 9 years ago
- Mirror of Apache Zeppelin (Incubating)☆45Updated 9 years ago
- A package that allows R developers to use Hadoop HDFS☆64Updated 7 years ago
- Distributed DataFrame: Productivity = Power x Simplicity For Scientists & Engineers, on any Data Engine☆168Updated 4 years ago
- Examples for Fast Data Processing with Spark☆59Updated 11 years ago
- Vagrant projects for various use-cases with Spark, Zeppelin, IPython / Jupyter, SparkR☆34Updated 9 years ago
- Additional useful algorithms that can be used with spark.☆24Updated 10 years ago
- PMML evaluator library for the Apache Hive data warehouse software (legacy codebase)☆13Updated 10 years ago
- Apache Zeppelin on Kubernetes.☆28Updated 6 years ago
- Templates for projects based on top of H2O.☆38Updated 2 months ago
- MLeap allows for easily putting Spark ML pipelines into production☆78Updated 8 years ago
- open source version of the Bonsai library☆26Updated 9 years ago
- A real time streaming implementation of markov chain based fraud detection☆23Updated 10 years ago
- Training materials for Strata, AMP Camp, etc☆149Updated 9 years ago
- ☆38Updated 10 years ago
- Spark implementation of the Google Correlate algorithm to quickly find highly correlated vectors in huge datasets☆93Updated 9 years ago
- Apache Spark OpenCPU Executor (ROSE)☆26Updated 6 years ago
- Complete Pipeline Training at Big Data Scala By the Bay☆71Updated 9 years ago
- Spark Extension : ML transformers, SQL aggregations, etc that are missing in Apache Spark☆147Updated 9 years ago
- Ambari Service definition for an Jupyter (IPython3) Notebook service☆42Updated 8 years ago
- Cascading on Apache Flink®☆54Updated last year
- Code for Packt Publishing's Scala Data Analysis Cookbook.☆49Updated 9 years ago
- training material☆47Updated 7 months ago
- Starter project for building MemSQL Streamliner Pipelines☆32Updated 8 years ago
- A package that allows R developer to use Hadoop MapReduce☆158Updated 4 years ago
- DEPRECATED Build, manage and deploy H2O's high-speed machine learning models.☆61Updated 6 years ago
- Pig on Apache Spark☆83Updated 10 years ago
- An R-like GLM package for Apache Spark☆10Updated 9 years ago
- A Real-Time Analytical Processing (RTAP) example using Spark/Shark☆51Updated 11 years ago
- Kite SDK Examples☆99Updated 4 years ago