n-surkov / PySparkPipelineLinks
Module for pipelines concept in PySpark
☆16Updated last year
Alternatives and similar repositories for PySparkPipeline
Users that are interested in PySparkPipeline are comparing it to the libraries listed below
Sorting:
- ☆1Updated 2 years ago
- Learning resources for Airflow Tutorial article.☆55Updated 5 years ago
- ☆12Updated 4 years ago
- Data Engineer RoadMap☆36Updated 3 years ago
- DE or DIE meetup made by data engineers for data engineers. Currently in Russian only.☆58Updated last year
- Open episode of the data engineering practice course☆26Updated last year
- Курс про Apache Airflow 2.0☆35Updated last year
- Docker Compose with Almond.sh core for Jupyter☆19Updated 11 months ago
- The simple ETL with docker container☆58Updated 2 months ago
- Data Engineering misc☆14Updated 3 years ago
- ☆13Updated last year
- ☆29Updated 3 years ago
- ☆48Updated 4 years ago
- ☆146Updated 2 months ago
- Roadmap для Data Engineer. Цель роадмапа – устроиться тебе на работу!☆416Updated this week
- Practice course on Big Data☆16Updated last year
- ☆182Updated 3 years ago
- Distributed run of dbt models using Airflow☆164Updated 2 months ago
- ☆82Updated last year
- Analytics Engineer Course☆18Updated 2 years ago
- Free Data Science course 4everyone☆160Updated 2 years ago
- ☆9Updated 2 years ago
- Collection of Data Science PET Projects (Сборник PET-проектов Data Science)☆96Updated last year
- ☆12Updated 9 months ago
- ☆16Updated 5 months ago
- Подборка ресурсов открытых данных, ориентированная на использование в странах СНГ, или если вы делаете продукт и исследование про страны …☆73Updated 3 years ago
- Этот репозиторий создан для хранения конспектов по курсам stepik.org☆95Updated 6 years ago
- Репозиторий курса "Modern Storages and Data Warehousing", ФТиАД, НИУ ВШЭ, 2023☆8Updated last year
- Home assignments for data science positions☆632Updated last year
- Spark Cluster with 4 executors☆12Updated last week