n-surkov / PySparkPipelineLinks
Module for pipelines concept in PySpark
☆16Updated last year
Alternatives and similar repositories for PySparkPipeline
Users that are interested in PySparkPipeline are comparing it to the libraries listed below
Sorting:
- Learning resources for Airflow Tutorial article.☆56Updated 5 years ago
- ☆12Updated 4 years ago
- Data Engineer RoadMap☆35Updated 3 years ago
- DE or DIE meetup made by data engineers for data engineers. Currently in Russian only.☆57Updated last year
- Docker Compose with Almond.sh core for Jupyter☆18Updated last year
- The simple ETL with docker container☆59Updated 4 months ago
- Open episode of the data engineering practice course☆29Updated last year
- Roadmap для Data Engineer. Цель роадмапа – устроиться тебе на работу!☆462Updated last week
- Курс про Apache Airflow 2.0☆36Updated last month
- ☆14Updated last year
- ☆29Updated 3 years ago
- ☆83Updated last year
- Distributed run of dbt models using Airflow☆167Updated last week
- Data Engineering misc☆14Updated 4 years ago
- Practice course on Big Data☆17Updated last year
- ☆149Updated 4 months ago
- ☆47Updated 4 years ago
- ☆185Updated 3 years ago
- ☆13Updated 8 months ago
- Collection of Data Science PET Projects (Сборник PET-проектов Data Science)☆97Updated 2 months ago
- Все, о чем меня когда-либо спрашивали на собеседованиях, и другие полезные знания в кратком формате☆177Updated last year
- ☆16Updated 8 months ago
- python курс☆39Updated last week
- Free Data Science course 4everyone☆161Updated 2 years ago
- Курс по матстату для онлайна :)☆404Updated last year
- ☆75Updated last year
- ☆90Updated 3 years ago
- Проекты по программе "Аналитик данных" от Яндекс.Практикум☆107Updated 5 years ago
- Data Forge — a modern data stack playground to practice flows and best practices, not just tools. Spark, Trino, Kafka, Iceberg, ClickHous…☆147Updated 2 weeks ago
- Analytics Engineer Course☆19Updated 2 years ago