Yet Another (Spark) ETL Framework
☆21Oct 21, 2023Updated 2 years ago
Alternatives and similar repositories for yetl
Users that are interested in yetl are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Delta Lake helper methods in PySpark☆328Jan 19, 2026Updated 2 months ago
- Delta Lake Website☆26Apr 1, 2026Updated 2 weeks ago
- A DataOps framework for building a lakehouse.☆56Updated this week
- This PowerBI template that connects to the Azure Data Factory API to get information about the current status of your Datasets and Slices☆22Apr 20, 2018Updated 7 years ago
- Manage Unity Catalog tables with Pydantic Models☆10Mar 5, 2025Updated last year
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- native Go library for Delta Lake☆10Jul 31, 2022Updated 3 years ago
- Python Package for ducklake☆20Jun 5, 2025Updated 10 months ago
- Collection of examples for showcasing various Rust graph data structure libraries.☆29Aug 22, 2025Updated 7 months ago
- A stand-alone test framework that allows to write unit tests for Data Factory pipelines on Microsoft Fabric, Azure Data Factory and Azure…☆130Jan 26, 2026Updated 2 months ago
- Machine Learning Pipeline Stages for Spark (exposed in Scala/Java + Python)☆16Oct 14, 2019Updated 6 years ago
- Spring Data Module for YugabyteDB.☆18Aug 30, 2021Updated 4 years ago
- Schema Registry Statistics Tool☆24Updated this week
- A unit test framework for Databricks notebooks☆12Dec 8, 2020Updated 5 years ago
- Map your python dataclasses to pyspark types☆10Feb 11, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- The Modern Data Stack in a (Smaller) Box☆12Jan 28, 2023Updated 3 years ago
- Scripts for Azure Synapse SQL Pools (Provisioned) and Query-on-Demand (Serverless)☆11Nov 2, 2021Updated 4 years ago
- Python code that will collapse structured columns separating out the attributes into new columns☆10Mar 15, 2022Updated 4 years ago
- ☆15Apr 19, 2018Updated 7 years ago
- Code for the paper: Kernel Distributionally Robust Optimization☆13Feb 21, 2021Updated 5 years ago
- Data Lineage for Spark components and PowerBI/AAS showing up in Azure Purview☆19Jun 11, 2024Updated last year
- Publish / Deploy a Tabular or Multidimensional Cube to SSAS or AAS☆11Jul 14, 2025Updated 9 months ago
- Project repository of Apache Airflow, deployed on Docker in Amazon EC2 via GitLab.☆15Sep 3, 2021Updated 4 years ago
- Robust Bond Portfolio Construction via Convex-Concave Saddle Point Optimization☆13May 13, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Sample Data Lakehouse deployed in Docker containers using Apache Iceberg, Minio, Trino and a Hive Metastore. Can be used for local testin…☆76Sep 2, 2023Updated 2 years ago
- 🧱 A collection of supplementary utilities and helper notebooks to perform admin tasks on Databricks☆57Jul 4, 2025Updated 9 months ago
- TPC-H_SF10☆53Jan 20, 2025Updated last year
- Node.js and MySQL app☆17Apr 9, 2016Updated 10 years ago
- Jupyter Cell / Line Magics for DuckDB☆58Apr 6, 2026Updated last week
- Gatsby transformer plugin for jupyter notebooks☆10Jan 7, 2019Updated 7 years ago
- Notebook Discovery Tool for Databricks notebooks☆19Jul 14, 2022Updated 3 years ago
- Walmart version of the Linear Road streaming benchmark.☆22Mar 30, 2021Updated 5 years ago
- A python SPark ETL libRary (SPETLR) for Databricks. https://discord.gg/p9bzqGybVW☆24Mar 3, 2026Updated last month
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Passbolt CE installation scripts☆19Mar 16, 2021Updated 5 years ago
- DuckDB Copilot Extension☆10Jan 12, 2026Updated 3 months ago
- This is a `Rust` based package to help with the management of complex medicine (pill) management cycles.☆27Dec 3, 2023Updated 2 years ago
- ORM for Apache Spark and DataFrames schema manager☆16Jun 24, 2024Updated last year
- DuckDB CronJob Extension☆47Mar 29, 2026Updated 2 weeks ago
- Classify images of different kitchenware items☆11Apr 17, 2023Updated 2 years ago
- the codes and some preliminary progress in the work of robust stochastic portfolio optimization☆11Oct 15, 2020Updated 5 years ago