khurramturk / CloudAge
Hadoop Scripts
☆14Updated 8 years ago
Related projects ⓘ
Alternatives and complementary repositories for CloudAge
- Prerequisites checker for Cloudera Manager and CDP PVC Base installations☆57Updated last year
- CDP examples and tutorials☆19Updated 11 months ago
- Useful shell scripts for Hadoop/Linux system administrator☆57Updated 6 years ago
- ☆305Updated 5 years ago
- Databricks - Apache Spark™ - 2X Certified Developer☆264Updated 4 years ago
- Apache Spark™ and Scala Workshops☆262Updated 3 months ago
- Kerberos and Hadoop: The Madness beyond the Gate☆277Updated last year
- The Internals of Spark SQL☆454Updated 2 months ago
- Spark Examples☆123Updated 2 years ago
- Qubole Sparklens tool for performance tuning Apache Spark☆568Updated 4 months ago
- Self-contained examples of Apache Spark streaming integrated with Apache Kafka.☆199Updated 6 years ago
- The Internals of Spark Structured Streaming☆415Updated last year
- ☆11Updated 5 years ago
- Spark Structured Streaming / Kafka / Cassandra / Elastic☆184Updated last year
- Edge2AI Workshop☆68Updated 2 weeks ago
- Educational notes,Hands on problems w/ solutions for hadoop ecosystem☆86Updated 5 years ago
- A general purpose framework for automating Cloudera Products☆64Updated last month
- This is the development repository for sparkMeasure, a tool and library designed for efficient analysis and troubleshooting of Apache Spa…☆705Updated 3 months ago
- ETL pipeline using pyspark (Spark - Python)☆108Updated 4 years ago
- ☆73Updated 3 years ago
- The Apache Spark - Apache HBase Connector is a library to support Spark accessing HBase table as external data source or sink.☆552Updated 3 years ago
- A simplified, lightweight ETL Framework based on Apache Spark☆584Updated 9 months ago
- Apache Spark is a fast, in-memory data processing engine with elegant and expressive development API's to allow data workers to efficient…☆55Updated last year
- CSD for Apache Airflow☆20Updated 5 years ago
- Preparatory notes for the Cloudera Spark and Hadoop Certification☆18Updated 5 years ago
- This project provides Apache Spark SQL, RDD, DataFrame and Dataset examples in Scala language☆559Updated 7 months ago