Python Extract Transform and Load Tables of Data
☆1,312Aug 13, 2025Updated 7 months ago
Alternatives and similar repositories for petl
Users that are interested in petl are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Optional extensions for petl based on third party libraries.☆44Jun 25, 2015Updated 10 years ago
- Official repository for pygrametl - ETL programming in Python☆299Mar 10, 2026Updated 2 weeks ago
- Extract Transform Load for Python 3.5+☆1,610May 12, 2023Updated 2 years ago
- [NOT MAINTAINED] Bubbles – Python ETL framework☆460Oct 4, 2017Updated 8 years ago
- A lightweight opinionated ETL framework, halfway between plain scripts and Apache Airflow☆2,084Dec 15, 2023Updated 2 years ago
- mito ETL tool☆163Jun 1, 2021Updated 4 years ago
- A curated list of awesome ETL frameworks, libraries, and software.☆3,522Mar 7, 2026Updated 2 weeks ago
- Luigi is a Python module that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, vis…☆18,694Mar 7, 2026Updated 2 weeks ago
- Data Migration for the Blaze Project☆1,004Jul 15, 2022Updated 3 years ago
- Data management framework for Python that provides functionality to describe, extract, validate, and transform tabular data☆808Dec 10, 2025Updated 3 months ago
- Prefect is a workflow orchestration framework for building resilient data pipelines in Python.☆21,910Updated this week
- An orchestration platform for the development, production, and observation of data assets.☆15,134Updated this week
- 🔩 Like builtins, but boltons. 250+ constructs, recipes, and snippets which extend (and rely on nothing but) the Python standard library.…☆6,859Mar 6, 2026Updated 2 weeks ago
- A Python library for working with Table Schema.☆266Nov 14, 2024Updated last year
- A Python stream processing engine modeled after Yahoo! Pipes☆1,600Dec 28, 2021Updated 4 years ago
- the portable Python dataframe library☆6,457Updated this week
- dbt enables data analysts and engineers to transform their data using the same practices that software engineers use to build application…☆12,429Updated this week
- Apache Airflow - A platform to programmatically author, schedule, and monitor workflows☆44,696Updated this week
- Parallel computing with task scheduling☆13,774Updated this week
- Always know what to expect from your data.☆11,280Updated this week
- A Python package for manipulating 2-dimensional tabular data structures☆1,882Mar 17, 2025Updated last year
- A suite of utilities for converting to and working with CSV, the king of tabular file formats.☆6,360Mar 5, 2026Updated 2 weeks ago
- Python datetimes made easy☆6,632Mar 6, 2026Updated 2 weeks ago
- A functional standard library for Python.☆5,128Jan 1, 2026Updated 2 months ago
- Computing with Python functions.☆4,333Mar 3, 2026Updated 3 weeks ago
- Turbodbc is a Python module to access relational databases via the Open Database Connectivity (ODBC) interface. The module complies with …☆655Mar 1, 2026Updated 3 weeks ago
- An open source multi-tool for exploring and publishing data☆10,839Updated this week
- NumPy and Pandas interface to Big Data☆3,195Sep 29, 2023Updated 2 years ago
- Intake is a lightweight package for finding, investigating, loading and disseminating data.☆1,071Mar 9, 2026Updated 2 weeks ago
- ETL best practices with airflow, with examples☆1,352Sep 25, 2024Updated last year
- Apache Superset is a Data Visualization and Data Exploration Platform☆71,049Updated this week
- Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of big tabular data at a billion rows per s…☆8,492Mar 1, 2026Updated 3 weeks ago
- Modin: Scale your Pandas workflows by changing a single line of code☆10,364Feb 10, 2026Updated last month
- Easy-to-use data handling for SQL data stores with support for implicit table creation, bulk loading, and transactions.☆4,853Feb 5, 2025Updated last year
- Python Stream Processing☆6,822Jul 27, 2024Updated last year
- A curated list of awesome pipeline toolkits inspired by Awesome Sysadmin☆6,554Mar 16, 2026Updated last week
- Amundsen is a metadata driven application for improving the productivity of data analysts, data scientists and engineers when interacting…☆4,749Mar 10, 2026Updated 2 weeks ago
- [NOT MAINTAINED] Light-weight Python OLAP framework for multi-dimensional data analysis☆1,479Apr 29, 2022Updated 3 years ago
- Extract, Transform, Load: Any SQL Database in 4 lines of Code.☆556May 23, 2019Updated 6 years ago