☆20Dec 19, 2023Updated 2 years ago
Alternatives and similar repositories for data-analytics-minio-spark
Users that are interested in data-analytics-minio-spark are comparing it to the libraries listed below
Sorting:
- ☆14Feb 15, 2025Updated last year
- Magic to help Spark pipelines upgrade☆34Sep 29, 2024Updated last year
- Beyond Vibe Coding. Code, Planning, Documentation and Product Management agents.☆70Feb 20, 2026Updated 2 weeks ago
- Daily updated fake data for DBT learning and projects☆35Jan 7, 2024Updated 2 years ago
- An awesome Analytics Engineering repository to learn and apply for real world problems.☆42Sep 19, 2023Updated 2 years ago
- Code for blog at https://www.startdataengineering.com/post/python-for-de/☆101Jun 7, 2024Updated last year
- Jinja cheatsheet for dbt development☆41Jul 25, 2022Updated 3 years ago
- This project sets up a real-time data pipeline utilizing Change Data Capture (CDC) to stream changes from a PostgreSQL database to a Clic…☆12May 9, 2024Updated last year
- Code to help generate SQL for stakeholders. Code at https://www.startdataengineering.com/post/data-democratize-llm/☆13May 24, 2024Updated last year
- DBT and clickhouse test project with dagster☆12Aug 29, 2023Updated 2 years ago
- Data Engineering Bootcamp 2021☆13Aug 8, 2023Updated 2 years ago
- This project showcases how to integrate the world of DevOps, focusing on Continuous Integration (CI) and Continuous Deployment (CD) with …☆15Dec 27, 2023Updated 2 years ago
- A Delta Lake reader for Dask☆53Jul 29, 2025Updated 7 months ago
- End-to-End ELT data pipeline with Postgres, Airbyte, dbt, Dagster, Snowflake and Metabase☆11Jul 13, 2023Updated 2 years ago
- Building a poor man's data lake: Exploring the Power of Polars and Delta Lake☆11Dec 6, 2025Updated 3 months ago
- Trying out the Dataframe Polars library with Delta Lake ... feat Python.☆12Jan 29, 2025Updated last year
- ☆16Jul 25, 2025Updated 7 months ago
- Source code for the module "Advanced Statistics" 📊☆10Feb 25, 2019Updated 7 years ago
- Rust And Delta Demo. Explanation and walkthrough on delta-rs☆10Aug 21, 2023Updated 2 years ago
- A collaborative, real-time feature model editor☆10Nov 27, 2023Updated 2 years ago
- ☆14Feb 23, 2021Updated 5 years ago
- node js http server☆10Jan 26, 2018Updated 8 years ago
- An example CI/CD pipeline using GitHub Actions for doing continuous deployment of AWS Glue jobs built on PySpark and Jupyter Notebooks.☆13Oct 15, 2020Updated 5 years ago
- Data Observability for Data Engineering, published by Packt Publishing☆11Jan 24, 2025Updated last year
- Plugin for Intake to read from SQL servers☆15May 29, 2023Updated 2 years ago
- dbt-databend adapter plugin☆10May 30, 2024Updated last year
- Sample project to demonstrate data engineering best practices☆204Feb 24, 2024Updated 2 years ago
- Variable Selection Network with PyTorch☆11May 29, 2024Updated last year
- Powershell Scripts for Power BI☆13Sep 20, 2023Updated 2 years ago
- dlt-dagster-demo☆13Nov 6, 2023Updated 2 years ago
- Auto-mirror of scoopinstaller/scoop-main bucket☆12Updated this week
- ☆15Dec 11, 2023Updated 2 years ago
- Events about the open source data stack☆13Apr 16, 2022Updated 3 years ago
- Eclipse MicroProfile based Java Microservice running in Payara Micro☆10Nov 4, 2019Updated 6 years ago
- Command line client for the Fugue API☆14Mar 7, 2023Updated 3 years ago
- a stand-alone HTTPie windows binary☆13Nov 26, 2020Updated 5 years ago
- A fast development template for Admin-dashboard based on Ext JS Classic toolkit☆10Jun 29, 2018Updated 7 years ago
- granadoespadav32 private server setup☆17Jan 24, 2024Updated 2 years ago
- Software Template for Openshift AI☆15Jan 21, 2026Updated last month