datalayer-attic / zeppelin
Apache Zeppelin on Kubernetes.
☆28Updated 5 years ago
Alternatives and similar repositories for zeppelin:
Users that are interested in zeppelin are comparing it to the libraries listed below
- Mirror of Apache Zeppelin (Incubating)☆45Updated 8 years ago
- spark backend for dplyr☆48Updated 9 years ago
- open source version of the Bonsai library☆26Updated 9 years ago
- A package that allows R developers to use Hadoop HDFS☆64Updated 7 years ago
- ☆31Updated 9 years ago
- DEPRECATED Build, manage and deploy H2O's high-speed machine learning models.☆61Updated 5 years ago
- Some notebook examples related to Apache Spark, IPython / Jupyter, Zeppelin☆52Updated 8 years ago
- Data Science box: Spark, Jupyter, R+RStudio, Zeppelin, Python 2 & 3, Java, Scala.☆39Updated 6 years ago
- ☆41Updated 7 years ago
- ☆38Updated 9 years ago
- Apache Spark OpenCPU Executor (ROSE)☆26Updated 6 years ago
- spark and hive backends for dplyr☆8Updated 9 years ago
- Apache Toree quickstart tutorial☆29Updated 8 years ago
- R on Apache Spark (SparkR) tutorials for Big Data analysis and Machine Learning as IPython / Jupyter notebooks☆120Updated 7 years ago
- Templates for projects based on top of H2O.☆37Updated this week
- training material☆47Updated 4 months ago
- Apache Spark under Docker☆9Updated 8 years ago
- Sparklyr Extensions API☆31Updated 8 years ago
- Vagrant projects for various use-cases with Spark, Zeppelin, IPython / Jupyter, SparkR☆34Updated 8 years ago
- Druid connector for R☆52Updated 8 years ago
- Training materials for Strata, AMP Camp, etc☆150Updated 9 years ago
- VM with complete R (RStudio) environment☆9Updated 9 years ago
- A package that allows R developer to use Hadoop MapReduce☆159Updated 4 years ago
- Starter project for building MemSQL Streamliner Pipelines☆32Updated 7 years ago
- A set of tools for working with Omniture daily data files (hit_data.tsv) in big or small tools like Spark, Hadoop or just Python.☆38Updated 5 years ago
- R dplyr connector for ImpalaDB☆15Updated 8 years ago
- Latency numbers every data scientist should know (aka the pyramid of analytical tasks) - the order of magnitude of computational time for…☆20Updated 7 years ago
- ☆53Updated 6 years ago
- RHive is an R extension facilitating distributed computing via Apache Hive.☆123Updated 7 years ago
- Deep neural networks on over 50 classification problems from the UC Irvine Machine Learning Repository☆25Updated 9 years ago