Yet Another (Spark) ETL Framework
☆21Oct 21, 2023Updated 2 years ago
Alternatives and similar repositories for yetl
Users that are interested in yetl are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Traditionally, engineers were needed to implement business logic via data pipelines before business users can start using it. Using this …☆12Updated this week
- Manage Unity Catalog tables with Pydantic Models☆10Mar 5, 2025Updated last year
- Python script to call Azure Retail Prices API and save the retail prices as an excel file☆15Oct 14, 2021Updated 4 years ago
- Python Package for ducklake☆20Jun 5, 2025Updated 11 months ago
- A stand-alone test framework that allows to write unit tests for Data Factory pipelines on Microsoft Fabric, Azure Data Factory and Azure…☆130Jan 26, 2026Updated 3 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Machine Learning Pipeline Stages for Spark (exposed in Scala/Java + Python)☆16Oct 14, 2019Updated 6 years ago
- Spring Data Module for YugabyteDB.☆18Aug 30, 2021Updated 4 years ago
- Schema Registry Statistics Tool☆24Updated this week
- A unit test framework for Databricks notebooks☆12Dec 8, 2020Updated 5 years ago
- The Modern Data Stack in a (Smaller) Box☆12Jan 28, 2023Updated 3 years ago
- Visualize linear programming at https://lpviz.net☆40May 4, 2026Updated 3 weeks ago
- Delta Lake Documentation☆53Jun 19, 2024Updated last year
- 🚗 Downloads a Google Drive folder that you can query with gatsby-source-filesystem.☆12Mar 2, 2023Updated 3 years ago
- Visual Studio Code Server on Azure Web App for Containers☆10Apr 12, 2019Updated 7 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆11Oct 8, 2021Updated 4 years ago
- Publish / Deploy a Tabular or Multidimensional Cube to SSAS or AAS☆11Jul 14, 2025Updated 10 months ago
- Robust Bond Portfolio Construction via Convex-Concave Saddle Point Optimization☆14May 13, 2024Updated 2 years ago
- NoSQL extract, transform, load (ETL) toolkit with Python☆16May 9, 2026Updated 2 weeks ago
- Sample Data Lakehouse deployed in Docker containers using Apache Iceberg, Minio, Trino and a Hive Metastore. Can be used for local testin…☆78Sep 2, 2023Updated 2 years ago
- TPC-H_SF10☆53Jan 20, 2025Updated last year
- Node.js and MySQL app☆17Apr 9, 2016Updated 10 years ago
- Gatsby transformer plugin for jupyter notebooks☆10Jan 7, 2019Updated 7 years ago
- Jupyter Cell / Line Magics for DuckDB☆59Apr 10, 2026Updated last month
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Python packages for Support Vector Regression with Linear Constraints☆10Jul 9, 2020Updated 5 years ago
- Accompanying code for our NeurIPS 2019 paper☆11Nov 7, 2019Updated 6 years ago
- Notebook Discovery Tool for Databricks notebooks☆19Jul 14, 2022Updated 3 years ago
- Walmart version of the Linear Road streaming benchmark.☆22Mar 30, 2021Updated 5 years ago
- DuckDB Copilot Extension☆10Jan 12, 2026Updated 4 months ago
- This is a `Rust` based package to help with the management of complex medicine (pill) management cycles.☆27Dec 3, 2023Updated 2 years ago
- A connector to ingest Azure Databricks lineage into Microsoft Purview☆93Apr 12, 2024Updated 2 years ago
- ORM for Apache Spark and DataFrames schema manager☆16Jun 24, 2024Updated last year
- DuckDB CronJob Extension☆49Mar 29, 2026Updated last month
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- the codes and some preliminary progress in the work of robust stochastic portfolio optimization☆11Oct 15, 2020Updated 5 years ago
- A Spark-based data comparison tool at scale which facilitates software development engineers to compare a plethora of pair combinations o…☆53Jun 17, 2025Updated 11 months ago
- Codes for the paper "Residuals-based Distributionally Robust Optimization with Covariate Information"☆10Aug 13, 2022Updated 3 years ago
- This repo demonstrates an Apache Arrow Flight server implementation in Kubernetes.☆12Oct 25, 2024Updated last year
- A python SPark ETL libRary (SPETLR) for Databricks. https://discord.gg/p9bzqGybVW☆24Mar 3, 2026Updated 2 months ago
- Java client for EventStore (http://geteventstore.com)☆20May 25, 2015Updated 11 years ago
- Submission for Redis 2021 Hackathon - Helsinki Regional Transit Tracking☆21May 13, 2022Updated 4 years ago