Yet Another (Spark) ETL Framework
☆21Oct 21, 2023Updated 2 years ago
Alternatives and similar repositories for yetl
Users that are interested in yetl are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Delta Lake helper methods in PySpark☆328Jan 19, 2026Updated 2 months ago
- This PowerBI template that connects to the Azure Data Factory API to get information about the current status of your Datasets and Slices☆22Apr 20, 2018Updated 7 years ago
- native Go library for Delta Lake☆10Jul 31, 2022Updated 3 years ago
- Collection of examples for showcasing various Rust graph data structure libraries.☆29Aug 22, 2025Updated 7 months ago
- A stand-alone test framework that allows to write unit tests for Data Factory pipelines on Microsoft Fabric, Azure Data Factory and Azure…☆130Jan 26, 2026Updated last month
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Machine Learning Pipeline Stages for Spark (exposed in Scala/Java + Python)☆16Oct 14, 2019Updated 6 years ago
- Schema Registry Statistics Tool☆24Updated this week
- A unit test framework for Databricks notebooks☆12Dec 8, 2020Updated 5 years ago
- The Modern Data Stack in a (Smaller) Box☆12Jan 28, 2023Updated 3 years ago
- Map your python dataclasses to pyspark types☆10Feb 11, 2024Updated 2 years ago
- minio as local storage and DynamoDB as catalog☆15May 14, 2024Updated last year
- Visualize linear programming at https://lpviz.net☆33Jan 20, 2026Updated 2 months ago
- Python code that will collapse structured columns separating out the attributes into new columns☆10Mar 15, 2022Updated 4 years ago
- Delta Lake Documentation☆53Jun 19, 2024Updated last year
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- 🚗 Downloads a Google Drive folder that you can query with gatsby-source-filesystem.☆12Mar 2, 2023Updated 3 years ago
- Visual Studio Code Server on Azure Web App for Containers☆10Apr 12, 2019Updated 6 years ago
- Data Lineage for Spark components and PowerBI/AAS showing up in Azure Purview☆19Jun 11, 2024Updated last year
- ☆11Oct 8, 2021Updated 4 years ago
- Project repository of Apache Airflow, deployed on Docker in Amazon EC2 via GitLab.☆15Sep 3, 2021Updated 4 years ago
- Sample Data Lakehouse deployed in Docker containers using Apache Iceberg, Minio, Trino and a Hive Metastore. Can be used for local testin…☆75Sep 2, 2023Updated 2 years ago
- 🧱 A collection of supplementary utilities and helper notebooks to perform admin tasks on Databricks☆57Jul 4, 2025Updated 8 months ago
- TPC-H_SF10☆53Jan 20, 2025Updated last year
- Node.js and MySQL app☆17Apr 9, 2016Updated 9 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Gatsby transformer plugin for jupyter notebooks☆10Jan 7, 2019Updated 7 years ago
- Clinical Pipeline Engine using Apache cTAKES☆24Nov 9, 2015Updated 10 years ago
- Notebook Discovery Tool for Databricks notebooks☆19Jul 14, 2022Updated 3 years ago
- DuckDB Copilot Extension☆10Jan 12, 2026Updated 2 months ago
- Passbolt CE installation scripts☆19Mar 16, 2021Updated 5 years ago
- DuckDB CronJob Extension☆47Feb 18, 2026Updated last month
- ORM for Apache Spark and DataFrames schema manager☆16Jun 24, 2024Updated last year
- Classify images of different kitchenware items☆11Apr 17, 2023Updated 2 years ago
- the codes and some preliminary progress in the work of robust stochastic portfolio optimization☆11Oct 15, 2020Updated 5 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A Spark-based data comparison tool at scale which facilitates software development engineers to compare a plethora of pair combinations o…☆52Jun 17, 2025Updated 9 months ago
- Codes for the paper "Residuals-based Distributionally Robust Optimization with Covariate Information"☆10Aug 13, 2022Updated 3 years ago
- Java client for EventStore (http://geteventstore.com)☆20May 25, 2015Updated 10 years ago
- Submission for Redis 2021 Hackathon - Helsinki Regional Transit Tracking☆21May 13, 2022Updated 3 years ago
- A collection of CLI LLM tools that I built and use daily☆15Aug 7, 2024Updated last year
- Get map value via dot-delimited path or nil.☆30Sep 9, 2014Updated 11 years ago
- ☆13May 3, 2022Updated 3 years ago