Python data repo, jupyter notebook, python scripts and data.
☆553Dec 10, 2024Updated last year
Alternatives and similar repositories for pythondataanalysis
Users that are interested in pythondataanalysis are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆16Jan 16, 2025Updated last year
- build dw with dbt☆55Oct 24, 2024Updated last year
- An end-to-end data pipeline for building Data Lake and supporting report using Apache Spark.☆16Jan 31, 2023Updated 3 years ago
- ☆11Oct 8, 2021Updated 4 years ago
- This project demonstrates how to build and automate an ETL pipeline written in Python and schedule it using open source Apache Airflow or…☆23Aug 21, 2025Updated 9 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Distributed Data Systems with Azure Databricks, published by Packt☆12Jan 18, 2023Updated 3 years ago
- trino + hive + minio with postgres in docker compose☆27Aug 18, 2023Updated 2 years ago
- ☆16Mar 9, 2026Updated 2 months ago
- A Python package that creates fine-grained dbt tasks on Apache Airflow☆20Apr 25, 2024Updated 2 years ago
- ☆26Sep 28, 2023Updated 2 years ago
- ☆32Oct 4, 2024Updated last year
- ☆23Feb 5, 2024Updated 2 years ago
- ☆16Mar 12, 2025Updated last year
- Airflow Tutorials☆25Feb 28, 2021Updated 5 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Get data from API, run a scheduled script with Airflow, send data to Kafka and consume with Spark, then write to Cassandra☆146Jul 27, 2023Updated 2 years ago
- Web scraping and descriptive and predictive modelling of Bogor house pricing☆11Mar 29, 2025Updated last year
- Command line tool for interacting with Qlik Sense Enterprise servers☆16Updated this week
- Acquiring and processing information on world's largest banks☆20Apr 19, 2026Updated last month
- Data Engineering Zoomcamp is a free 9-week course on building production-ready data pipelines. The next cohort starts in January 2026. Jo…☆41,401May 3, 2026Updated 3 weeks ago
- Docker with Airflow + Postgres + Spark cluster + JDK (spark-submit support) + Jupyter Notebooks☆24Apr 2, 2022Updated 4 years ago
- ☆16Jan 8, 2023Updated 3 years ago
- Data-aware orchestration with dagster, dbt, and airbyte☆31Jan 20, 2023Updated 3 years ago
- ☆14Sep 22, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ETL pipeline using pyspark (Spark - Python)☆117Apr 4, 2020Updated 6 years ago
- ☆10Aug 6, 2024Updated last year
- ☆14May 14, 2024Updated 2 years ago
- I saw this [Blog Post](https://www.morling.dev/blog/one-billion-row-challenge/) on a Billion Row challenge for Java so naturally I tried …☆14Jan 10, 2024Updated 2 years ago
- Код и данные с ODS Introspect Hackathon'а который проходил в кафе "Райский Пирожок", 19-21 Мая 2017.☆11Aug 10, 2022Updated 3 years ago
- ☆21Mar 26, 2023Updated 3 years ago
- ☆146Jan 31, 2023Updated 3 years ago
- Lightweight Python wrapper around the DuckDB extension, httpserver (extension developed by @quackscience)☆17Sep 24, 2025Updated 8 months ago
- An LLM-powered self-studying app using retrieval-augmented generation prompting | Streamlit LLM Hackathon 2023☆17Oct 6, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- End to end data engineering project☆59Oct 27, 2022Updated 3 years ago
- ☆67Sep 24, 2025Updated 8 months ago
- Building a Data Pipeline with an Open Source Stack☆59Jun 27, 2025Updated 11 months ago
- ☆139Mar 16, 2026Updated 2 months ago
- ☆19Nov 27, 2023Updated 2 years ago
- Delta Lake examples☆238Oct 8, 2024Updated last year
- ☆17Dec 9, 2022Updated 3 years ago