sryza / aasLinks
Code to accompany Advanced Analytics with Spark from O'Reilly Media
☆1,531Updated last year
Alternatives and similar repositories for aas
Users that are interested in aas are comparing it to the libraries listed below
Sorting:
- MapReduce, Spark, Java, and Scala for Data Algorithms Book☆1,082Updated last year
- A library for time series analysis on Apache Spark☆1,195Updated 5 years ago
- Examples for High Performance Spark☆524Updated 3 weeks ago
- Scala examples for learning to use Spark☆445Updated 5 years ago
- A free tutorial for Apache Spark.☆993Updated 5 years ago
- REST job server for Apache Spark☆2,846Updated 5 months ago
- The Internals of Apache Spark☆1,534Updated 5 months ago
- A tutorial on the most important features and idioms of Scala that you need to use Spark's Scala APIs.☆674Updated 3 years ago
- Examples for learning spark☆332Updated 10 years ago
- Sparkling Water provides H2O functionality inside Spark cluster☆977Updated last month
- A connector for Spark that allows reading and writing to/from Redis cluster☆945Updated last year
- Livy is an open source REST interface for interacting with Apache Spark from anywhere☆1,007Updated 3 years ago
- Notes talking about the design and implementation of Apache Spark☆5,348Updated last year
- Oryx 2: Lambda architecture on Apache Spark, Apache Kafka for real-time large scale machine learning☆1,784Updated 4 years ago
- PySpark + Scikit-learn = Sparkit-learn☆1,153Updated 4 years ago
- Interactive and Reactive Data Science using Scala and Spark.☆3,153Updated 2 years ago
- TensorFlowOnSpark brings TensorFlow programs to Apache Spark clusters.☆3,861Updated 2 years ago
- A scalable machine learning library on Apache Spark☆796Updated 4 years ago
- Mirror of Apache Toree (Incubating)☆749Updated 3 weeks ago
- The Apache Spark - Apache HBase Connector is a library to support Spark accessing HBase table as external data source or sink.☆551Updated 4 years ago
- The Internals of Spark Structured Streaming☆422Updated last month
- MLeap: Deploy ML Pipelines to Production☆1,528Updated last week
- Jupyter magics and kernels for working with remote Spark clusters☆1,362Updated 3 months ago
- Apache Spark & Python (pySpark) tutorials for Big Data Analysis and Machine Learning as IPython / Jupyter notebooks☆1,667Updated last year
- Java library and command-line application for converting Apache Spark ML pipelines to PMML☆269Updated 3 weeks ago
- Scripts used to setup a Spark cluster on EC2☆389Updated 8 years ago
- Base classes to use when writing tests with Spark☆1,545Updated last month
- Stream Data Mining Library for Spark Streaming☆496Updated 2 years ago
- Code base for the Learning PySpark book (in preparation)☆628Updated 6 years ago
- The book's repo☆273Updated 8 years ago