Yet Another (Spark) ETL Framework
☆21Oct 21, 2023Updated 2 years ago
Alternatives and similar repositories for yetl
Users that are interested in yetl are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Delta Lake helper methods in PySpark☆328Jan 19, 2026Updated 3 months ago
- Delta Lake Website☆26Updated this week
- A DataOps framework for building a lakehouse.☆57Updated this week
- Traditionally, engineers were needed to implement business logic via data pipelines before business users can start using it. Using this …☆12Apr 23, 2026Updated last week
- Manage Unity Catalog tables with Pydantic Models☆10Mar 5, 2025Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Open Log Analytics queries and samples on querying different Azure resources and services. Includes sample Power BI reports☆12Mar 31, 2022Updated 4 years ago
- Python Package for ducklake☆20Jun 5, 2025Updated 11 months ago
- Spring Data Module for YugabyteDB.☆18Aug 30, 2021Updated 4 years ago
- A unit test framework for Databricks notebooks☆12Dec 8, 2020Updated 5 years ago
- Visualize linear programming at https://lpviz.net☆36Apr 27, 2026Updated last week
- Python code that will collapse structured columns separating out the attributes into new columns☆10Mar 15, 2022Updated 4 years ago
- 🚗 Downloads a Google Drive folder that you can query with gatsby-source-filesystem.☆12Mar 2, 2023Updated 3 years ago
- Code for the paper: Kernel Distributionally Robust Optimization☆13Feb 21, 2021Updated 5 years ago
- Data Lineage for Spark components and PowerBI/AAS showing up in Azure Purview☆19Jun 11, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Project repository of Apache Airflow, deployed on Docker in Amazon EC2 via GitLab.☆15Sep 3, 2021Updated 4 years ago
- Robust Bond Portfolio Construction via Convex-Concave Saddle Point Optimization☆14May 13, 2024Updated last year
- Data sources for Elastic Map Service☆23Updated this week
- Sample Data Lakehouse deployed in Docker containers using Apache Iceberg, Minio, Trino and a Hive Metastore. Can be used for local testin…☆78Sep 2, 2023Updated 2 years ago
- 🧱 A collection of supplementary utilities and helper notebooks to perform admin tasks on Databricks☆57Jul 4, 2025Updated 10 months ago
- TPC-H_SF10☆53Jan 20, 2025Updated last year
- Node.js and MySQL app☆17Apr 9, 2016Updated 10 years ago
- Gatsby transformer plugin for jupyter notebooks☆10Jan 7, 2019Updated 7 years ago
- Jupyter Cell / Line Magics for DuckDB☆59Apr 10, 2026Updated 3 weeks ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Python packages for Support Vector Regression with Linear Constraints☆10Jul 9, 2020Updated 5 years ago
- Walmart version of the Linear Road streaming benchmark.☆22Mar 30, 2021Updated 5 years ago
- A python SPark ETL libRary (SPETLR) for Databricks. https://discord.gg/p9bzqGybVW☆24Mar 3, 2026Updated 2 months ago
- Repository for code samples from the book Mastering Azure Analytics☆25Apr 10, 2017Updated 9 years ago
- Passbolt CE installation scripts☆19Mar 16, 2021Updated 5 years ago
- DuckDB Copilot Extension☆10Jan 12, 2026Updated 3 months ago
- This is a `Rust` based package to help with the management of complex medicine (pill) management cycles.☆27Dec 3, 2023Updated 2 years ago
- A connector to ingest Azure Databricks lineage into Microsoft Purview☆93Apr 12, 2024Updated 2 years ago
- fully custom stepper☆11Mar 28, 2022Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- DuckDB CronJob Extension☆48Mar 29, 2026Updated last month
- Classify images of different kitchenware items☆11Apr 17, 2023Updated 3 years ago
- the codes and some preliminary progress in the work of robust stochastic portfolio optimization☆11Oct 15, 2020Updated 5 years ago
- A password sharing plugin for KeePass.☆17Aug 31, 2019Updated 6 years ago
- A Spark-based data comparison tool at scale which facilitates software development engineers to compare a plethora of pair combinations o…☆53Jun 17, 2025Updated 10 months ago
- Codes for the paper "Residuals-based Distributionally Robust Optimization with Covariate Information"☆10Aug 13, 2022Updated 3 years ago
- This repo demonstrates an Apache Arrow Flight server implementation in Kubernetes.☆12Oct 25, 2024Updated last year