n-surkov / PySparkPipelineLinks
Module for pipelines concept in PySpark
☆16Updated last year
Alternatives and similar repositories for PySparkPipeline
Users that are interested in PySparkPipeline are comparing it to the libraries listed below
Sorting:
- DE or DIE meetup made by data engineers for data engineers. Currently in Russian only.☆58Updated last year
- ☆12Updated 4 years ago
- Learning resources for Airflow Tutorial article.☆56Updated 5 years ago
- Курс про Apache Airflow 2.0☆36Updated 2 months ago
- Open episode of the data engineering practice course☆29Updated last year
- Data Engineer RoadMap☆35Updated 3 years ago
- Docker Compose with Almond.sh core for Jupyter☆18Updated last year
- ☆29Updated 3 years ago
- Roadmap для Data Engineer. Цель роадмапа – устроиться тебе на работу!☆471Updated 2 weeks ago
- The simple ETL with docker container☆61Updated 5 months ago
- Distributed run of dbt models using Airflow☆167Updated this week
- Free Data Science course 4everyone☆162Updated 2 years ago
- Practice course on Big Data☆17Updated last year
- ☆16Updated 9 months ago
- Data Forge — a modern data stack playground to practice flows and best practices, not just tools. Spark, Trino, Kafka, Iceberg, ClickHous…☆152Updated last month
- ☆47Updated 4 years ago
- ☆14Updated 2 years ago
- ☆185Updated 3 years ago
- ☆151Updated 5 months ago
- Getting Started with Data Enngineering☆1,301Updated 6 months ago
- Data Engineering misc☆14Updated 4 years ago
- ☆83Updated last year
- ☆90Updated 3 years ago
- Все, о чем меня когда-либо спрашивали на собеседованиях, и другие полезные знания в кратком формате☆176Updated last year
- ☆78Updated last year
- Analytics Engineer Course☆19Updated 2 years ago
- Проекты по программе "Аналитик данных" от Яндекс.Практикум☆107Updated 5 years ago
- python курс☆39Updated 2 weeks ago
- Курс по матстату для онлайна :)☆408Updated last year
- Полная специализация "Машинное обучение и анализ данных" от МФТИ и Яндекс на Coursera☆251Updated 5 years ago