skoonData / docker-composeLinks
☆12Updated 4 years ago
Alternatives and similar repositories for docker-compose
Users that are interested in docker-compose are comparing it to the libraries listed below
Sorting:
- ☆26Updated last year
- Open episode of the data engineering practice course☆29Updated last year
- The simple ETL with docker container☆59Updated 4 months ago
- ☆16Updated 8 months ago
- Multi-container environment with Hadoop, Spark and Hive☆224Updated 5 months ago
- ☆12Updated 4 years ago
- ☆14Updated 2 years ago
- Learn Apache Spark in Scala, Python (PySpark) and R (SparkR) by building your own cluster with a JupyterLab interface on Docker.☆494Updated 2 years ago
- Surfalytics projces on Data Engineering and Analytics☆114Updated last month
- Spark implementation of Slowly Changing Dimension type 2☆11Updated 6 years ago
- Dockerizing an Apache Spark Standalone Cluster☆43Updated 3 years ago
- dbt module for myBI connect☆13Updated 2 years ago
- Docker with Airflow and Spark standalone cluster☆260Updated 2 years ago
- Distributed run of dbt models using Airflow☆166Updated last week
- Creation of a data lakehouse and an ELT pipeline to enable the efficient analysis and use of data☆48Updated last year
- Module for pipelines concept in PySpark☆16Updated last year
- ☆21Updated 7 months ago
- Building a Modern Data Lake with Minio, Spark, Airflow via Docker.☆21Updated last year
- Simple repo to demonstrate how to submit a spark job to EMR from Airflow☆34Updated 5 years ago
- Tutorial for setting up a Spark cluster running inside of Docker containers located on different machines☆134Updated 2 years ago
- For Udemy students: the official repository of Rock the JVM's Spark Streaming course☆26Updated 2 years ago
- Apache Spark for data engineers☆56Updated 3 years ago
- Docker Compose with Almond.sh core for Jupyter☆18Updated last year
- DE or DIE meetup made by data engineers for data engineers. Currently in Russian only.☆57Updated last year
- Курс про Apache Airflow 2.0☆36Updated last month
- ☆21Updated 2 years ago
- Data Engineer RoadMap☆35Updated 3 years ago
- Toolkit for Agile-driven data modeling and data loading using highly Normalized hybrid Model☆23Updated 9 months ago
- Data Forge — a modern data stack playground to practice flows and best practices, not just tools. Spark, Trino, Kafka, Iceberg, ClickHous…☆143Updated last week
- CSD for Apache Airflow☆20Updated 6 years ago