vmware-archive / PDLToolsLinks
PDL Tools is a library of reusable tools used and developed by the Pivotal Data Science and Data Engineering teams.
☆17Updated 3 years ago
Alternatives and similar repositories for PDLTools
Users that are interested in PDLTools are comparing it to the libraries listed below
Sorting:
- Single view demo☆14Updated 9 years ago
- Utilities to help HBase as a service in HDInsight Azure☆14Updated 2 years ago
- Tools to deploy Hadoop on EMC Isilon☆17Updated 9 years ago
- ODPi specifications, developed by ODPi Runtime and ODPi Operations projects. Currently in Emeritus status☆35Updated 6 years ago
- HDP Data Science/Machine Learning demo☆37Updated 10 years ago
- End-to-end data science example running on Cloud Foundry☆19Updated 9 years ago
- This project contains the code to translate between Apache Spark and SFrame.☆20Updated 9 years ago
- Materials for various Hadoop & Nifi related workshops☆51Updated 6 years ago
- A simple introduction to using spark ml pipelines☆26Updated 7 years ago
- Gallery of Apache Zeppelin notebooks☆216Updated 6 years ago
- Code to index Hive tables to Solr and Solr indexes to Hive☆47Updated 6 years ago
- ☆41Updated 8 years ago
- Supporting content (slides and exercises) for the Addison-Wesley (Pearson) video series covering best practices for developing scalable S…☆67Updated 9 years ago
- Docker image for apache zeppelin☆38Updated 8 years ago
- Generic spark module for scanning, joining and mutating HBase tables to and from RDDs.☆15Updated 10 years ago
- notebooks for nlp-on-spark☆13Updated 8 years ago
- Machine Learning Pipeline Stages for Spark (exposed in Scala/Java + Python)☆74Updated 2 years ago
- Apache Toree quickstart tutorial☆29Updated 9 years ago
- Simple Spark example of generating table stats for use of data quality checks☆28Updated 8 years ago
- Ambari service for Apache Zeppelin notebook☆71Updated 8 years ago
- Coding exercises for Apache Spark☆104Updated 10 years ago
- Use Vagrant and Ambari Blueprint API to install PivotalHD 3.0 (or Hortonworks HDP2.x) Hadoop cluster with HAWQ 1.3 (SQL on Hadoop) and Sp…☆23Updated 9 years ago
- ☆44Updated 7 years ago
- Visualize streaming machine learning in Spark☆177Updated 8 years ago
- This repository is to help with the Partner Demonstration of the Apache Atlas project.☆30Updated 10 years ago
- ☆15Updated 7 years ago
- HopsWorks - Hadoop for Humans☆117Updated 6 years ago
- Workshop for Spark and Databricks☆54Updated 5 years ago
- ☆110Updated 8 years ago
- Demonstration code for MLeap, both Jupyter notebooks and projects☆24Updated 6 years ago