A lightweight opinionated ETL framework, halfway between plain scripts and Apache Airflow
☆2,086Dec 15, 2023Updated 2 years ago
Alternatives and similar repositories for mara-pipelines
Users that are interested in mara-pipelines are comparing it to the libraries listed below
Sorting:
- Extract Transform Load for Python 3.5+☆1,609May 12, 2023Updated 2 years ago
- An example mini data warehouse for python project stats, template for new projects☆178Jul 21, 2020Updated 5 years ago
- A curated list of awesome ETL frameworks, libraries, and software.☆3,521Jul 23, 2024Updated last year
- Luigi is a Python module that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, vis…☆18,683Feb 28, 2026Updated last week
- Python Extract Transform and Load Tables of Data☆1,309Aug 13, 2025Updated 6 months ago
- An orchestration platform for the development, production, and observation of data assets.☆15,049Updated this week
- A curated list of awesome pipeline toolkits inspired by Awesome Sysadmin☆6,535Updated this week
- Python Stream Processing☆6,828Jul 27, 2024Updated last year
- Prefect is a workflow orchestration framework for building resilient data pipelines in Python.☆21,782Updated this week
- Data-Centric Pipelines and Data Versioning☆6,287Feb 3, 2025Updated last year
- A Python stream processing engine modeled after Yahoo! Pipes☆1,601Dec 28, 2021Updated 4 years ago
- dbt enables data analysts and engineers to transform their data using the same practices that software engineers use to build application…☆12,345Updated this week
- Make Your Company Data Driven. Connect to any data source, easily visualize, dashboard and share your data.☆28,255Mar 2, 2026Updated last week
- Utilities for creating ETL pipelines with mara☆36May 20, 2022Updated 3 years ago
- Apache Airflow - A platform to programmatically author, schedule, and monitor workflows☆44,510Updated this week
- Data Pipeline Framework using the singer.io spec☆658Feb 26, 2026Updated last week
- ETL best practices with airflow, with examples☆1,354Sep 25, 2024Updated last year
- Quilt is a data mesh for connecting people with actionable data☆1,357Updated this week
- Official repository for pygrametl - ETL programming in Python☆299Updated this week
- Always know what to expect from your data.☆11,224Updated this week
- An open source multi-tool for exploring and publishing data☆10,805Feb 26, 2026Updated last week
- the portable Python dataframe library☆6,440Updated this week
- Kedro is a toolbox for production-ready data science. It uses software engineering best practices to help you create data engineering and…☆10,771Feb 26, 2026Updated last week
- The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lak…☆20,840Updated this week
- Curated list of resources about Apache Airflow☆3,896Jan 30, 2026Updated last month
- 📚 Parameterize, execute, and analyze notebooks☆6,390Feb 27, 2026Updated last week
- Parallel computing with task scheduling☆13,754Updated this week
- Python Fast Dataflow programming framework for Data pipeline work( Web Crawler,Machine Learning,Quantitative Trading.etc)☆1,198Feb 3, 2026Updated last month
- Apache Superset is a Data Visualization and Data Exploration Platform☆70,755Mar 2, 2026Updated last week
- Amundsen is a metadata driven application for improving the productivity of data analysts, data scientists and engineers when interacting…☆4,744Mar 1, 2026Updated last week
- Writes the Singer format from Python☆576Feb 27, 2026Updated last week
- The easy-to-use open source Business Intelligence and Embedded Analytics tool that lets everyone work with data☆46,267Mar 2, 2026Updated last week
- ☄️ Python's nested data operator (and CLI), for all your declarative restructuring needs. Got data? Glom it! ☄️☆2,128Dec 29, 2025Updated 2 months ago
- Build, Manage and Deploy AI/ML Systems☆9,903Updated this week
- This repository is a getting started guide to Singer.☆1,329Aug 8, 2025Updated 7 months ago
- A next-generation curated knowledge sharing platform for data scientists and other technical professions.☆5,543Sep 4, 2024Updated last year
- Docker Apache Airflow☆3,808Mar 1, 2023Updated 3 years ago
- Build powerful pipelines in any programming language.☆5,226Jan 10, 2026Updated last month
- A language and runtime for distributed, incremental data processing in the cloud☆976Oct 18, 2023Updated 2 years ago