vmware-archive / PDLToolsLinks
PDL Tools is a library of reusable tools used and developed by the Pivotal Data Science and Data Engineering teams.
☆17Updated 3 years ago
Alternatives and similar repositories for PDLTools
Users that are interested in PDLTools are comparing it to the libraries listed below
Sorting:
- Tools to deploy Hadoop on EMC Isilon☆17Updated 9 years ago
- End-to-end data science example running on Cloud Foundry☆19Updated 9 years ago
- ODPi specifications, developed by ODPi Runtime and ODPi Operations projects. Currently in Emeritus status☆35Updated 6 years ago
- ☆146Updated 9 years ago
- Gallery of Apache Zeppelin notebooks☆216Updated 6 years ago
- Utilities to help HBase as a service in HDInsight Azure☆14Updated last year
- Single view demo☆14Updated 9 years ago
- A simple introduction to using spark ml pipelines☆26Updated 7 years ago
- HopsWorks - Hadoop for Humans☆117Updated 6 years ago
- Supporting content (slides and exercises) for the Addison-Wesley (Pearson) video series covering best practices for developing scalable S…☆67Updated 9 years ago
- Visualize streaming machine learning in Spark☆177Updated 8 years ago
- Generic spark module for scanning, joining and mutating HBase tables to and from RDDs.☆15Updated 10 years ago
- Use Vagrant and Ambari Blueprint API to install PivotalHD 3.0 (or Hortonworks HDP2.x) Hadoop cluster with HAWQ 1.3 (SQL on Hadoop) and Sp…☆23Updated 9 years ago
- An external PySpark module that works like R's read.csv or Panda's read_csv, with automatic type inference and null value handling. Parse…☆90Updated 9 years ago
- ☆107Updated 2 years ago
- A visual ETL development and debugging tool for big data☆154Updated 2 years ago
- [DEPRECATED] Script used to manage Hadoop and Spark instances on Google Compute Engine☆109Updated 5 years ago
- Materials for various Hadoop & Nifi related workshops☆51Updated 6 years ago
- Code to index Hive tables to Solr and Solr indexes to Hive☆48Updated 6 years ago
- notebooks for nlp-on-spark☆13Updated 8 years ago
- Workshops on how to setup security on Hadoop using HDP sandboxes☆100Updated 7 years ago
- Code for Tutorial on designing clickstream analytics application using Hadoop☆54Updated 10 years ago
- A super simple utility for testing Apache Hive scripts locally for non-Java developers.☆73Updated 8 years ago
- Ansible playbooks for Apache Spark on kube☆27Updated 8 years ago
- Web based interactive computing environment for H2O☆142Updated 9 months ago
- ☆110Updated 8 years ago
- Implementations of the Portable Format for Analytics (PFA)☆127Updated 2 years ago
- Simplify getting Zeppelin up and running☆56Updated 9 years ago
- Ambari service for Apache Zeppelin notebook☆71Updated 7 years ago
- Microsoft Azure Data Lake Store Filesystem Library for Python☆70Updated 2 months ago