rss161030 / ETL-processes-using-Sqoop-Hadoop-Hive-Spark-and-ScalaLinks
I implemented various ETL processes like loading the data using sqoop from mysql to hdfs, transform the data using Spark and Scala, perform analytics using Spark and Scala and loading the data back to HDFS.
☆11Updated 7 years ago
Alternatives and similar repositories for ETL-processes-using-Sqoop-Hadoop-Hive-Spark-and-Scala
Users that are interested in ETL-processes-using-Sqoop-Hadoop-Hive-Spark-and-Scala are comparing it to the libraries listed below
Sorting:
- Spark Examples☆125Updated 3 years ago
- Apache Spark Course Material☆94Updated 2 years ago
- Spark Structured Streaming / Kafka / Cassandra / Elastic☆183Updated 2 years ago
- Self-contained examples of Apache Spark streaming integrated with Apache Kafka.☆199Updated 7 years ago
- Apache Spark™ and Scala Workshops☆263Updated last year
- Examples of Spark 3.0☆46Updated 4 years ago
- Apche Spark Structured Streaming with Kafka using Python(PySpark)☆40Updated 6 years ago
- Educational notes,Hands on problems w/ solutions for hadoop ecosystem☆87Updated 6 years ago
- This project provides Apache Spark SQL, RDD, DataFrame and Dataset examples in Scala language☆567Updated last year
- ☆11Updated 6 years ago
- Examples of Spark 2.0☆212Updated 4 years ago
- Spark structured streaming examples with using of version 3.5.1☆26Updated last year
- Databricks - Apache Spark™ - 2X Certified Developer☆265Updated 5 years ago
- Nested Data (JSON/AVRO/XML) Parsing and Flattening in Spark☆16Updated last year
- PySpark-ETL☆23Updated 5 years ago
- How to manage Slowly Changing Dimensions with Apache Hive☆55Updated 6 years ago
- Sample processing code using Spark 2.1+ and Scala☆51Updated 5 years ago
- Apache Spark 3 - Structured Streaming Course Material☆46Updated 5 years ago
- Code snippets used in demos recorded for the blog.☆38Updated 2 weeks ago
- The Internals of Spark Structured Streaming☆419Updated 2 years ago
- Getting started with Spark, Spark streaming, Spark SQL and DataFrame.☆47Updated 7 years ago
- The official repository for the Rock the JVM Spark Optimization with Scala course☆58Updated last year
- Apache Spark is a fast, in-memory data processing engine with elegant and expressive development API's to allow data workers to efficient…☆55Updated 2 years ago
- The official repository for the Rock the JVM Spark Essentials with Scala course☆274Updated 6 months ago
- ☆311Updated 6 years ago
- Simple examle for Spark Streaming over Kafka topic☆107Updated 4 years ago
- Guide for databricks spark certification☆58Updated 4 years ago
- Apache Spark and Apache Kafka integration example☆124Updated 7 years ago
- Flowchart for debugging Spark applications☆107Updated 11 months ago
- Spark with Scala example projects☆34Updated 6 years ago