n-surkov / PySparkPipelineLinks
Module for pipelines concept in PySpark
☆16Updated last year
Alternatives and similar repositories for PySparkPipeline
Users that are interested in PySparkPipeline are comparing it to the libraries listed below
Sorting:
- ☆1Updated 2 years ago
- Learning resources for Airflow Tutorial article.☆55Updated 4 years ago
- ☆11Updated 4 years ago
- Data Engineer RoadMap☆36Updated 3 years ago
- ☆80Updated last year
- ☆29Updated 3 years ago
- DE or DIE meetup made by data engineers for data engineers. Currently in Russian only.☆58Updated last year
- Курс про Apache Airflow 2.0☆35Updated 11 months ago
- Docker Compose with Almond.sh core for Jupyter☆19Updated 9 months ago
- The simple ETL with docker container☆48Updated this week
- Open episode of the data engineering practice course☆26Updated 11 months ago
- Practice course on Big Data☆16Updated last year
- ☆88Updated 3 years ago
- ☆48Updated 3 years ago
- ☆62Updated 7 months ago
- ☆141Updated 3 years ago
- 100 упражнений по numpy версия на русском☆165Updated last year
- Репозиторий курса "Modern Storages and Data Warehousing", ФТиАД, НИУ ВШЭ, 2023☆8Updated last year
- Collection of Data Science PET Projects (Сборник PET-проектов Data Science)☆90Updated 10 months ago
- 🐳 Проектная деятельность. Здесь хранятся лекции, практические задания и проекты с karpov_courses. Ссылка: https://karpov.courses/☆171Updated 2 years ago
- ☆11Updated last year
- python курс☆38Updated 2 weeks ago
- Roadmap для Data Engineer. Цель роадмапа – устроиться тебе на работу!☆305Updated last week
- ☆8Updated 3 years ago
- ☆14Updated 3 months ago
- Distributed run of dbt models using Airflow☆162Updated this week
- Fast data quality framework for modern data infrastructure☆28Updated 4 months ago
- Course on how to write clean, maintainable and scalable code on Python☆37Updated 3 months ago
- Spark Cluster with 4 executors☆11Updated 5 months ago
- ☆177Updated 3 years ago