Module for pipelines concept in PySpark
☆16Mar 27, 2024Updated last year
Alternatives and similar repositories for PySparkPipeline
Users that are interested in PySparkPipeline are comparing it to the libraries listed below
Sorting:
- ☆13Mar 22, 2024Updated last year
- ETL processing toolset with SQL-like language and GIS capabilities, built on core Spark. Extensible and modular. REPL included☆16Jan 26, 2026Updated last month
- ☆22Dec 18, 2024Updated last year
- ☆12Jul 27, 2021Updated 4 years ago
- Repository for course by System Design☆37May 16, 2023Updated 2 years ago
- ☆10Jun 29, 2021Updated 4 years ago
- ☆10Dec 29, 2018Updated 7 years ago
- ☆36Sep 3, 2022Updated 3 years ago
- Analyze my weight loss journey☆12Oct 27, 2019Updated 6 years ago
- Materials for LSML 2023 (HSE)☆10Mar 21, 2023Updated 2 years ago
- ☆17Jan 12, 2026Updated last month
- ☆11Aug 14, 2022Updated 3 years ago
- Project is in active development and has been moved to https://repository.datamart.ru/datamarts/prostore.☆17Apr 22, 2022Updated 3 years ago
- The `netcat` container, the Swiss army knife of networking tool, Dockerized !☆10Jul 4, 2018Updated 7 years ago
- ☆13Jan 23, 2023Updated 3 years ago
- Scala-based project to visualize Scala programs in UML class diagrams.☆12Aug 30, 2023Updated 2 years ago
- ☆15May 7, 2025Updated 9 months ago
- A WebdriverIO & Cucumber Boilerplate based on Page Object Model!☆10Jan 26, 2023Updated 3 years ago
- Simple class for parsing left and right column for Yandex Wordstat via Direct API☆13Feb 27, 2022Updated 4 years ago
- Source code and slides for the tictactoe4k talk☆12May 2, 2023Updated 2 years ago
- ☆17Dec 2, 2025Updated 2 months ago
- Simple dashboard (follows MVC pattern) for monitoring temperature, humidity, gas, and sound Arduino sensors via using Kafka (stream-proce…☆11Mar 1, 2023Updated 3 years ago
- ☆10Mar 12, 2021Updated 4 years ago
- Тесты к курсу для профессионалов☆11Jan 16, 2026Updated last month
- Download Dump and Test it☆11Nov 15, 2024Updated last year
- Flask Telegram Bot☆11Dec 8, 2022Updated 3 years ago
- Simple demo using "behave" and "pyspark" libraries to test data transformations in a human-readable way☆10Apr 5, 2019Updated 6 years ago
- 2019 PyOhio talk and code sample on spotify/luigi☆11Aug 14, 2023Updated 2 years ago
- ☆12May 19, 2021Updated 4 years ago
- BigData analytics platform☆18Feb 21, 2026Updated last week
- ☆26Dec 15, 2024Updated last year
- Лекции и материалы по курсам "Математические методы анализа текстов" осеннего семестра 2021 года для студентов кафедры ММП, ВМК МГУ и каф…☆11Jan 11, 2022Updated 4 years ago
- ☆14Mar 11, 2023Updated 2 years ago
- A library to make it easier to load input URLs to start scrapy processes☆14Feb 21, 2021Updated 5 years ago
- ☆13Feb 18, 2022Updated 4 years ago
- AirFlow is a system to programmaticaly author, schedule and monitor data pipelines.☆13Jan 17, 2015Updated 11 years ago
- DE or DIE meetup made by data engineers for data engineers. Currently in Russian only.☆58Jan 6, 2024Updated 2 years ago
- Learning resources for Airflow Tutorial article.☆56Jul 22, 2020Updated 5 years ago
- ☆17Nov 7, 2024Updated last year