n-surkov / PySparkPipelineLinks
Module for pipelines concept in PySpark
☆15Updated last year
Alternatives and similar repositories for PySparkPipeline
Users that are interested in PySparkPipeline are comparing it to the libraries listed below
Sorting:
- ☆12Updated 4 years ago
- DE or DIE meetup made by data engineers for data engineers. Currently in Russian only.☆57Updated last year
- Data Engineer RoadMap☆35Updated 3 years ago
- Open episode of the data engineering practice course☆30Updated last year
- Курс про Apache Airflow 2.0☆36Updated last month
- Learning resources for Airflow Tutorial article.☆56Updated 5 years ago
- ☆16Updated 7 months ago
- The simple ETL with docker container☆59Updated 4 months ago
- ☆29Updated 3 years ago
- Data Forge — a modern data stack playground to practice flows and best practices, not just tools. Spark, Trino, Kafka, Iceberg, ClickHous…☆72Updated last week
- Distributed run of dbt models using Airflow☆166Updated this week
- Analytics Engineer Course☆18Updated 2 years ago
- Roadmap для Data Engineer. Цель роадмапа – устроиться тебе на работу!☆452Updated this week
- Docker Compose with Almond.sh core for Jupyter☆18Updated last year
- Data Engineering misc☆14Updated 4 years ago
- ☆47Updated 4 years ago
- ☆185Updated 3 years ago
- ☆14Updated last year
- Free Data Science course 4everyone☆161Updated 2 years ago
- Getting Started with Data Enngineering☆1,294Updated 5 months ago
- One ETL tool to rule them all☆84Updated last week
- Practice course on Big Data☆15Updated last year
- ☆13Updated 8 months ago
- ☆75Updated 11 months ago
- ☆148Updated 4 months ago
- Toolkit for Agile-driven data modeling and data loading using highly Normalized hybrid Model☆23Updated 9 months ago
- python курс☆39Updated 3 weeks ago
- Курс по матстату для онлайна :)☆401Updated last year
- ☆83Updated last year
- ☆90Updated 3 years ago