cloudera / cml-training
Example Python and R code for Cloudera Machine Learning (CML) training
☆14Updated 4 years ago
Alternatives and similar repositories for cml-training:
Users that are interested in cml-training are comparing it to the libraries listed below
- ☆16Updated last year
- Hadoop/Hive/Spark container to perform CI tests☆11Updated 4 years ago
- ☆28Updated last year
- A Flink applcation that demonstrates reading and writing to/from Apache Kafka with Apache Flink☆20Updated last year
- Simplified ETL process in Hadoop using Apache Spark. Has complete ETL pipeline for datalake. SparkSession extensions, DataFrame validatio…☆53Updated last year
- Intended for internal use: deploys all infrastructure required for Astronomer to run on GCP☆10Updated 7 months ago
- Tools to deploy Hadoop on EMC Isilon☆18Updated 8 years ago
- #DataPipeLine #ETL - Created is a Facebook data extraction utility to extract the publicly available data on Facebook. Used Facebook Grap…☆14Updated 6 years ago
- ☆12Updated 5 years ago
- ☆25Updated 4 years ago
- Build an scikit-learn model to predict churn using customer telco data.☆15Updated 3 months ago
- Slowly Changing Dimension type 2 using Hive query language using exclusive join technique with ORC Hive tables, partitioned and clustered…☆16Updated 5 years ago
- Pipeline library for StreamSets Data Collector and Transformer☆33Updated 2 years ago
- Sample programs for MapR Streams compatible with Apache Kafka 0.9 API☆15Updated last year
- cloudera.cloud - an Ansible collection for Cloudera Data Platform (CDP) for Public and Private Cloud☆20Updated 3 weeks ago
- Edge2AI Workshop☆69Updated 2 months ago
- Kirk's Zeppelin Notebooks☆12Updated 6 years ago
- NiFi Processor for Apache Pulsar☆10Updated 4 months ago
- ☆27Updated 2 months ago
- An Ansible collection for lifecycle and management of Cloudera CDP Private Cloud resources on bare metal, IaaS, and PaaS.☆33Updated last week
- ☆16Updated 4 years ago
- ☆14Updated last month
- Code for the deployment of Hadoop clusters, written in Bourne or Bourne Again shell.☆34Updated last year
- Nested Data (JSON/AVRO/XML) Parsing and Flattening in Spark☆16Updated last year
- Udacity Data Engineering Nano Degree Project, Data Modeling for fact and dimension tables, and ETL pipeline that transfers data from file…☆9Updated 4 years ago
- Machine Learning Processors for NiFi☆10Updated 7 years ago
- A complete custom processor project, for your reference.☆18Updated 9 years ago
- Single view demo☆14Updated 9 years ago
- HDF masterclass materials☆28Updated 8 years ago
- Mastering Spark for Data Science, published by Packt☆47Updated 2 years ago