sarthak-sarbahi / data-analytics-minio-sparkView external linksLinks
☆19Dec 19, 2023Updated 2 years ago
Alternatives and similar repositories for data-analytics-minio-spark
Users that are interested in data-analytics-minio-spark are comparing it to the libraries listed below
Sorting:
- ☆13Feb 15, 2025Updated last year
- Beyond Vibe Coding. Code, Planning, Documentation and Product Management agents.☆70Jun 16, 2025Updated 8 months ago
- Code for blog at https://www.startdataengineering.com/post/python-for-de/☆98Jun 7, 2024Updated last year
- Daily updated fake data for DBT learning and projects☆35Jan 7, 2024Updated 2 years ago
- Jinja cheatsheet for dbt development☆42Jul 25, 2022Updated 3 years ago
- Data Engineering Bootcamp 2021☆13Aug 8, 2023Updated 2 years ago
- Ebook for Data Scientist, Machine Learning, Deep Learning☆11Mar 16, 2021Updated 4 years ago
- DBT and clickhouse test project with dagster☆12Aug 29, 2023Updated 2 years ago
- This project sets up a real-time data pipeline utilizing Change Data Capture (CDC) to stream changes from a PostgreSQL database to a Clic…☆12May 9, 2024Updated last year
- ☆13Sep 15, 2024Updated last year
- This project showcases how to integrate the world of DevOps, focusing on Continuous Integration (CI) and Continuous Deployment (CD) with …☆15Dec 27, 2023Updated 2 years ago
- A Delta Lake reader for Dask☆53Jul 29, 2025Updated 6 months ago
- ☆11Jul 30, 2017Updated 8 years ago
- dbt-databend adapter plugin☆10May 30, 2024Updated last year
- This construct builds some elements for you to quickly launch an EMR Serverless application. After submitting the Emr Serverless job, you…☆11Nov 18, 2025Updated 2 months ago
- Plugin for Intake to read from SQL servers☆15May 29, 2023Updated 2 years ago
- An example CI/CD pipeline using GitHub Actions for doing continuous deployment of AWS Glue jobs built on PySpark and Jupyter Notebooks.☆13Oct 15, 2020Updated 5 years ago
- Data Observability for Data Engineering, published by Packt Publishing☆11Jan 24, 2025Updated last year
- Trying out the Dataframe Polars library with Delta Lake ... feat Python.☆12Jan 29, 2025Updated last year
- Rust And Delta Demo. Explanation and walkthrough on delta-rs☆10Aug 21, 2023Updated 2 years ago
- Source code for the module "Advanced Statistics" 📊☆10Feb 25, 2019Updated 6 years ago
- ☆14Feb 23, 2021Updated 4 years ago
- Limit long text output for a single JupyterLab mime render.☆13Jul 30, 2025Updated 6 months ago
- End-to-End ELT data pipeline with Postgres, Airbyte, dbt, Dagster, Snowflake and Metabase☆11Jul 13, 2023Updated 2 years ago
- node js http server☆10Jan 26, 2018Updated 8 years ago
- Sample project to demonstrate data engineering best practices☆202Feb 24, 2024Updated last year
- High-Resolution Image Synthesis with Latent Diffusion Models☆10Mar 3, 2023Updated 2 years ago
- Events about the open source data stack☆13Apr 16, 2022Updated 3 years ago
- Command line client for the Fugue API☆14Mar 7, 2023Updated 2 years ago
- Code for Apache Hudi, Apache Iceberg and Delta Lake analysis☆10Feb 2, 2024Updated 2 years ago
- a stand-alone HTTPie windows binary☆13Nov 26, 2020Updated 5 years ago
- Powershell Scripts for Power BI☆13Sep 20, 2023Updated 2 years ago
- dbt package for EDU's Ed-Fi data warehouse☆17Feb 3, 2026Updated last week
- dlt-dagster-demo☆13Nov 6, 2023Updated 2 years ago
- This project explores using machine learning methods for detection of Parkinson's disease using an individual's speech.☆15Nov 18, 2019Updated 6 years ago
- ☆47Jan 8, 2026Updated last month
- Source Code for 'Beginning Apache Spark 3' by Hien Luu☆13Oct 14, 2021Updated 4 years ago
- ☆15Dec 11, 2023Updated 2 years ago
- ☆13Oct 4, 2023Updated 2 years ago