n-surkov / PySparkPipeline
Module for pipelines concept in PySpark
☆15Updated 9 months ago
Alternatives and similar repositories for PySparkPipeline:
Users that are interested in PySparkPipeline are comparing it to the libraries listed below
- ☆0Updated last year
- ☆11Updated 3 years ago
- Курс про Apache Airflow 2.0☆32Updated 6 months ago
- Data Engineer RoadMap☆34Updated 2 years ago
- Open episode of the data engineering practice course☆25Updated 6 months ago
- Learning resources for Airflow Tutorial article.☆55Updated 4 years ago
- Docker Compose with Almond.sh core for Jupyter☆17Updated 4 months ago
- The simple ETL with docker container☆36Updated last month
- DE or DIE meetup made by data engineers for data engineers. Currently in Russian only.☆57Updated last year
- ☆47Updated 3 years ago
- ☆11Updated last year
- Roadmap для Data Engineer. Цель роадмапа – устроиться тебе на работу!☆93Updated 2 weeks ago
- Data Engineering misc☆14Updated 3 years ago
- Репозиторий курса "Modern Storages and Data Warehousing", ФТиАД, НИУ ВШЭ, 2023☆8Updated last year
- Analytics Engineer Course☆18Updated last year
- ☆134Updated 2 years ago
- ☆29Updated 2 years ago
- 🐳 Проектная деятельность. Здесь хранятся лекции, практические задания и проекты с karpov_courses. Ссылка: https://karpov.courses/☆157Updated 2 years ago
- ☆79Updated 8 months ago
- ☆14Updated last month
- Distributed run of dbt models using Airflow☆143Updated 3 weeks ago
- ☆88Updated 2 years ago
- 100 упражнений по numpy версия на русском☆161Updated 11 months ago
- Подборка ресурсов открытых данных, ориентированная на использование в странах СНГ, или если вы делаете продукт и исследование про страны …☆70Updated 2 years ago
- Курс по матстату для онлайна :)☆357Updated 7 months ago
- ☆43Updated 3 years ago
- Free Data Science course 4everyone☆153Updated 2 years ago
- ☆25Updated 2 years ago
- Course on how to write clean, maintainable and scalable code on Python☆37Updated 3 months ago
- ☆49Updated 3 months ago