actionml / cluster-setupLinks
Setting up and running the Universal Recommender in clustered mode
☆22Updated 9 years ago
Alternatives and similar repositories for cluster-setup
Users that are interested in cluster-setup are comparing it to the libraries listed below
Sorting:
- PredictiionIO Template for Universal Recommender☆111Updated 8 years ago
- A Spark Streaming job reading events from Amazon Kinesis and writing event counts to DynamoDB☆93Updated 5 years ago
- This project combines Apache Spark and Elasticsearch to enable mining & prediction for Elasticsearch.☆212Updated 11 years ago
- Gallery of Apache Zeppelin notebooks☆216Updated 6 years ago
- A Spark WordCountJob example as a standalone SBT project with Specs2 tests, runnable on Amazon EMR☆120Updated 9 years ago
- DEPRECATED. Zeppelin has moved to Apache. Please make pull request there☆406Updated 8 years ago
- Next-generation web analytics processing with Scala, Spark, and Parquet.☆331Updated 10 years ago
- Docker build for Zeppelin, a web-based Spark notebook☆221Updated 6 years ago
- PySpark Cassandra brings back the fun in working with Cassandra data in PySpark.☆79Updated 8 years ago
- Visualize streaming machine learning in Spark☆177Updated 8 years ago
- ☆110Updated 8 years ago
- Google BigQuery support for Spark, SQL, and DataFrames☆156Updated 6 years ago
- PredictionIO, a machine learning server for developers and ML engineers. Built on Apache Spark, HBase and Spray.☆14Updated 7 years ago
- Run PredictionIO inside Docker☆200Updated 7 years ago
- REST job server for Spark. Note that this is *not* the mainline open source version. For that, go to https://github.com/spark-jobserver…☆346Updated 8 years ago
- ☆76Updated 10 years ago
- ☆39Updated 8 years ago
- Mazerunner extends a Neo4j graph database to run scheduled big data graph compute algorithms at scale with HDFS and Apache Spark.☆382Updated 3 years ago
- Library and tools for advanced feature engineering☆568Updated 5 years ago
- Highly configurable recommender based on PredictionIO and Mahout's Correlated Cross-Occurrence algorithm☆673Updated 6 years ago
- DataPipeline for humans.☆250Updated 3 years ago
- MLeap allows for easily putting Spark ML pipelines into production☆78Updated 9 years ago
- Simplify getting Zeppelin up and running☆56Updated 9 years ago
- Vagrant projects for various use-cases with Spark, Zeppelin, IPython / Jupyter, SparkR☆34Updated 9 years ago
- Hadoop mapreduce job to bulk load data into Cassandra☆75Updated 3 years ago
- REST web service for the true real-time scoring (<1 ms) of Scikit-Learn, R and Apache Spark models☆589Updated 2 months ago
- Simplifying robust end-to-end machine learning on Apache Spark.☆475Updated 8 years ago
- Mazerunner extends a Neo4j graph database to run scheduled big data graph compute algorithms at scale with HDFS and Apache Spark.☆127Updated 10 years ago
- Scalable machine learning library for Apache Hive/Spark/Pig☆502Updated 9 years ago
- Google Cloud Dataflow provides a simple, powerful model for building both batch and streaming parallel data processing pipelines. This re…☆167Updated 7 years ago