Yet Another (Spark) ETL Framework
☆21Oct 21, 2023Updated 2 years ago
Alternatives and similar repositories for yetl
Users that are interested in yetl are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Delta Lake helper methods in PySpark☆329Jan 19, 2026Updated 4 months ago
- Delta Lake Website☆25May 22, 2026Updated 3 weeks ago
- Python script to call Azure Retail Prices API and save the retail prices as an excel file☆15Oct 14, 2021Updated 4 years ago
- Python Package for ducklake☆20Jun 5, 2025Updated last year
- Machine Learning Pipeline Stages for Spark (exposed in Scala/Java + Python)☆16Oct 14, 2019Updated 6 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Schema Registry Statistics Tool☆24Jun 5, 2026Updated last week
- The Modern Data Stack in a (Smaller) Box☆12Jan 28, 2023Updated 3 years ago
- Map your python dataclasses to pyspark types☆10Feb 11, 2024Updated 2 years ago
- Visualize linear programming at https://lpviz.net☆40May 4, 2026Updated last month
- Delta Lake Documentation☆53Jun 19, 2024Updated last year
- Visual Studio Code Server on Azure Web App for Containers☆10Apr 12, 2019Updated 7 years ago
- ☆15Apr 19, 2018Updated 8 years ago
- Code for the paper: Kernel Distributionally Robust Optimization☆13Feb 21, 2021Updated 5 years ago
- ☆23May 18, 2026Updated 3 weeks ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Robust Bond Portfolio Construction via Convex-Concave Saddle Point Optimization☆14May 13, 2024Updated 2 years ago
- Data sources for Elastic Map Service☆23Jun 4, 2026Updated last week
- NoSQL extract, transform, load (ETL) toolkit with Python☆16Updated this week
- Sample Data Lakehouse deployed in Docker containers using Apache Iceberg, Minio, Trino and a Hive Metastore. Can be used for local testin…☆78Sep 2, 2023Updated 2 years ago
- 🧱 A collection of supplementary utilities and helper notebooks to perform admin tasks on Databricks☆57Jul 4, 2025Updated 11 months ago
- TPC-H_SF10☆53Jan 20, 2025Updated last year
- Jupyter Cell / Line Magics for DuckDB☆59Apr 10, 2026Updated 2 months ago
- Python packages for Support Vector Regression with Linear Constraints☆10Jul 9, 2020Updated 5 years ago
- Clinical Pipeline Engine using Apache cTAKES☆24Nov 9, 2015Updated 10 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Accompanying code for our NeurIPS 2019 paper☆11Nov 7, 2019Updated 6 years ago
- Walmart version of the Linear Road streaming benchmark.☆22Mar 30, 2021Updated 5 years ago
- Repository for code samples from the book Mastering Azure Analytics☆25Apr 10, 2017Updated 9 years ago
- Passbolt CE installation scripts☆19Mar 16, 2021Updated 5 years ago
- A connector to ingest Azure Databricks lineage into Microsoft Purview☆93Apr 12, 2024Updated 2 years ago
- ORM for Apache Spark and DataFrames schema manager☆16Jun 24, 2024Updated last year
- fully custom stepper☆11Mar 28, 2022Updated 4 years ago
- DuckDB CronJob Extension☆50Mar 29, 2026Updated 2 months ago
- Classify images of different kitchenware items☆11Apr 17, 2023Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- the codes and some preliminary progress in the work of robust stochastic portfolio optimization☆11Oct 15, 2020Updated 5 years ago
- Public repository for .NET DevOps for Azure book manuscript☆26Jan 30, 2021Updated 5 years ago
- A password sharing plugin for KeePass.☆17Aug 31, 2019Updated 6 years ago
- A Spark-based data comparison tool at scale which facilitates software development engineers to compare a plethora of pair combinations o…☆53Jun 17, 2025Updated 11 months ago
- Stream processing guidelines and examples using Apache Flink and Apache Spark☆44Apr 21, 2023Updated 3 years ago
- Codes for the paper "Residuals-based Distributionally Robust Optimization with Covariate Information"☆10Aug 13, 2022Updated 3 years ago
- This repo demonstrates an Apache Arrow Flight server implementation in Kubernetes.☆12Oct 25, 2024Updated last year