arjones / bigdata-workshop-es
Workshop Big Data en Español
☆22Updated last year
Related projects ⓘ
Alternatives and complementary repositories for bigdata-workshop-es
- Notebooks compartidos en las sesiones del meetup apachesparkbogota☆12Updated 5 months ago
- Educational notes,Hands on problems w/ solutions for hadoop ecosystem☆86Updated 5 years ago
- The source code for the book Modern Data Engineering with Apache Spark☆33Updated 2 years ago
- Real-world Spark pipelines examples☆83Updated 6 years ago
- Spark Streaming HBase Example☆22Updated 8 years ago
- Repository used for Spark Trainings☆53Updated last year
- Developed a data pipeline to automate data warehouse ETL by building custom airflow operators that handle the extraction, transformation,…☆89Updated 3 years ago
- ☆26Updated 4 years ago
- The official repository for the Rock the JVM Spark Optimization with Scala course☆55Updated 11 months ago
- ☆37Updated 8 years ago
- ETL pipeline using pyspark (Spark - Python)☆108Updated 4 years ago
- Crash course in Scala☆23Updated 4 years ago
- The iterative broadcast join example code.☆69Updated 7 years ago
- Curso de análisis de textos con técnicas de aprendizaje automático☆16Updated 5 years ago
- Batch Processing , orchestration using Apache Airflow and Google Workflows, spark structured Streaming and a lot more☆19Updated 2 years ago
- Simplified ETL process in Hadoop using Apache Spark. Has complete ETL pipeline for datalake. SparkSession extensions, DataFrame validatio…☆53Updated last year
- Airflow training for the crunch conf☆105Updated 6 years ago
- Code snippets used in demos recorded for the blog.☆29Updated last month
- Magic to help Spark pipelines upgrade☆34Updated last month
- Dockerizing an Apache Spark Standalone Cluster☆43Updated 2 years ago
- Repositorio utilizado para el Curso de Apache Spark en Platzi☆19Updated 3 years ago
- Demonstration of using Apache Spark to build robust ETL pipelines while taking advantage of open source, general purpose cluster computin…☆24Updated last year
- type-class based data cleansing library for Apache Spark SQL☆79Updated 5 years ago
- My Study guide used to pass the CRT020 Spark Certification exam☆31Updated 4 years ago
- These are some code examples☆55Updated 4 years ago
- Atlas custom type definitions☆16Updated 3 years ago
- ☆111Updated 4 years ago
- How to manage Slowly Changing Dimensions with Apache Hive☆55Updated 5 years ago
- This project helps me to understand the core concepts of Apache Airflow. I have created custom operators to perform tasks such as staging…☆74Updated 5 years ago