Python Extract Transform and Load Tables of Data
☆1,309Aug 13, 2025Updated 6 months ago
Alternatives and similar repositories for petl
Users that are interested in petl are comparing it to the libraries listed below
Sorting:
- Optional extensions for petl based on third party libraries.☆44Jun 25, 2015Updated 10 years ago
- Official repository for pygrametl - ETL programming in Python☆299Updated this week
- Extract Transform Load for Python 3.5+☆1,609May 12, 2023Updated 2 years ago
- [NOT MAINTAINED] Bubbles – Python ETL framework☆459Oct 4, 2017Updated 8 years ago
- A lightweight opinionated ETL framework, halfway between plain scripts and Apache Airflow☆2,086Dec 15, 2023Updated 2 years ago
- A curated list of awesome ETL frameworks, libraries, and software.☆3,520Jul 23, 2024Updated last year
- mito ETL tool☆163Jun 1, 2021Updated 4 years ago
- Data Migration for the Blaze Project☆1,005Jul 15, 2022Updated 3 years ago
- Luigi is a Python module that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, vis…☆18,681Feb 25, 2026Updated last week
- Data management framework for Python that provides functionality to describe, extract, validate, and transform tabular data☆807Dec 10, 2025Updated 2 months ago
- Prefect is a workflow orchestration framework for building resilient data pipelines in Python.☆21,697Updated this week
- An orchestration platform for the development, production, and observation of data assets.☆15,049Updated this week
- the portable Python dataframe library☆6,417Updated this week
- 🔩 Like builtins, but boltons. 250+ constructs, recipes, and snippets which extend (and rely on nothing but) the Python standard library.…☆6,852Jan 28, 2026Updated last month
- A Python library for working with Table Schema.☆265Nov 14, 2024Updated last year
- dbt enables data analysts and engineers to transform their data using the same practices that software engineers use to build application…☆12,315Updated this week
- Parallel computing with task scheduling☆13,754Updated this week
- A Python stream processing engine modeled after Yahoo! Pipes☆1,601Dec 28, 2021Updated 4 years ago
- A Python package for manipulating 2-dimensional tabular data structures☆1,883Mar 17, 2025Updated 11 months ago
- Always know what to expect from your data.☆11,197Updated this week
- Computing with Python functions.☆4,324Feb 6, 2026Updated 3 weeks ago
- Python datetimes made easy☆6,620Feb 17, 2026Updated 2 weeks ago
- Apache Airflow - A platform to programmatically author, schedule, and monitor workflows☆44,430Updated this week
- An open source multi-tool for exploring and publishing data☆10,779Updated this week
- A suite of utilities for converting to and working with CSV, the king of tabular file formats.☆6,351Feb 10, 2026Updated 3 weeks ago
- A functional standard library for Python.☆5,118Jan 1, 2026Updated 2 months ago
- Easy-to-use data handling for SQL data stores with support for implicit table creation, bulk loading, and transactions.☆4,852Feb 5, 2025Updated last year
- A curated list of awesome pipeline toolkits inspired by Awesome Sysadmin☆6,531Feb 9, 2026Updated 3 weeks ago
- Utils for streaming large files (S3, HDFS, gzip, bz2...)☆3,431Feb 23, 2026Updated last week
- NumPy and Pandas interface to Big Data☆3,197Sep 29, 2023Updated 2 years ago
- Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of big tabular data at a billion rows per s…☆8,475Feb 5, 2026Updated 3 weeks ago
- Kedro is a toolbox for production-ready data science. It uses software engineering best practices to help you create data engineering and…☆10,771Updated this week
- Modin: Scale your Pandas workflows by changing a single line of code☆10,363Feb 10, 2026Updated 3 weeks ago
- Python Stream Processing☆6,830Jul 27, 2024Updated last year
- Apache Superset is a Data Visualization and Data Exploration Platform☆70,755Updated this week
- Real-time stream processing for python☆1,294Feb 24, 2026Updated last week
- Turbodbc is a Python module to access relational databases via the Open Database Connectivity (ODBC) interface. The module complies with …☆653Feb 4, 2026Updated 3 weeks ago
- Make Your Company Data Driven. Connect to any data source, easily visualize, dashboard and share your data.☆28,241Updated this week
- Data Contracts engine for the modern data stack. https://www.soda.io☆2,298Updated this week