hnawaz007 / pythondataanalysisView external linksLinks
Python data repo, jupyter notebook, python scripts and data.
☆550Dec 10, 2024Updated last year
Alternatives and similar repositories for pythondataanalysis
Users that are interested in pythondataanalysis are comparing it to the libraries listed below
Sorting:
- ☆10Mar 31, 2025Updated 10 months ago
- build dw with dbt☆51Oct 24, 2024Updated last year
- ☆16May 29, 2023Updated 2 years ago
- ☆11Oct 8, 2021Updated 4 years ago
- ☆38Jan 27, 2026Updated 2 weeks ago
- ☆26Sep 28, 2023Updated 2 years ago
- Workshop about DVC VSCode Extension☆13Sep 25, 2024Updated last year
- PyRapidML is an open source Python library which not only helps in automating Machine Learning Workflows but also helps in building end t…☆14Aug 7, 2021Updated 4 years ago
- a dbt adapter for Apache Doris☆27Nov 17, 2023Updated 2 years ago
- This project demonstrates how to build and automate an ETL pipeline written in Python and schedule it using open source Apache Airflow or…☆18Aug 21, 2025Updated 5 months ago
- A Python package that creates fine-grained dbt tasks on Apache Airflow☆18Apr 25, 2024Updated last year
- ☆21Feb 5, 2024Updated 2 years ago
- Get data from API, run a scheduled script with Airflow, send data to Kafka and consume with Spark, then write to Cassandra☆144Jul 27, 2023Updated 2 years ago
- ☆46Jul 6, 2024Updated last year
- ☆31Oct 4, 2024Updated last year
- Data-aware orchestration with dagster, dbt, and airbyte☆31Jan 20, 2023Updated 3 years ago
- Delta Lake examples☆238Oct 8, 2024Updated last year
- ☆21Nov 21, 2023Updated 2 years ago
- All the ressources and guide to practice the Patou Tips☆27Feb 1, 2026Updated 2 weeks ago
- ☆10Aug 6, 2024Updated last year
- Data Engineering Zoomcamp is a free 9-week course on building production-ready data pipelines. The next cohort starts in January 2026. Jo…☆38,379Updated this week
- Repository for GH public projects☆18Feb 29, 2024Updated last year
- This is an overview of a MLOps architecture that includes both Airflow and MLflow running on separate Docker containers.☆22Oct 18, 2022Updated 3 years ago
- open source data lake☆31Jan 17, 2025Updated last year
- New Generation Opensource Data Stack Demo☆454Feb 6, 2023Updated 3 years ago
- Docker with Airflow + Postgres + Spark cluster + JDK (spark-submit support) + Jupyter Notebooks☆24Apr 2, 2022Updated 3 years ago
- I saw this [Blog Post](https://www.morling.dev/blog/one-billion-row-challenge/) on a Billion Row challenge for Java so naturally I tried …☆14Jan 10, 2024Updated 2 years ago
- ☆11Nov 26, 2024Updated last year
- ☆10Jun 22, 2022Updated 3 years ago
- Writes the CSV file to Postgres, read table and modify it. Write more tables to Postgres with Airflow.☆38Sep 1, 2023Updated 2 years ago
- This repository will help you to learn about databricks concept with the help of examples. It will include all the important topics which…☆104Sep 26, 2025Updated 4 months ago
- ☆12Jan 27, 2026Updated 2 weeks ago
- This reference architecture demonstrates the use of AWS Step Functions to orchestrate an Extract Transfer Load (ETL) workflow with AWS La…☆24Jun 16, 2020Updated 5 years ago
- Linkedin Webscraper is a tool for search jobs publications (or other publications) with a keyword. Download data to excel file.☆24Feb 16, 2022Updated 3 years ago
- ☆21Mar 26, 2023Updated 2 years ago
- Essential PySpark for Scalable Data Analytics, published by Packt☆46Jan 30, 2023Updated 3 years ago
- This project provides an end-to-end data processing and visualization of visa numbers in Japan using PySpark and Plotly. The spark cluste…☆12Oct 11, 2023Updated 2 years ago
- Acquiring and processing information on world's largest banks☆17Jun 17, 2025Updated 7 months ago
- This repo contains my projects from the Udacity Data Engineering Nano degree☆13Apr 26, 2023Updated 2 years ago