petl-developers / petl
Python Extract Transform and Load Tables of Data
☆1,264Updated 2 weeks ago
Alternatives and similar repositories for petl:
Users that are interested in petl are comparing it to the libraries listed below
- Official repository for pygrametl - ETL programming in Python☆296Updated 2 weeks ago
- Extract Transform Load for Python 3.5+☆1,590Updated last year
- A lightweight opinionated ETL framework, halfway between plain scripts and Apache Airflow☆2,079Updated last year
- Data Migration for the Blaze Project☆1,004Updated 2 years ago
- [NOT MAINTAINED] Bubbles – Python ETL framework☆452Updated 7 years ago
- A Python data analysis library that is optimized for humans instead of machines.☆1,177Updated last month
- Writes the Singer format from Python☆556Updated 3 weeks ago
- A Python library for working with Table Schema.☆262Updated 5 months ago
- ETL best practices with airflow, with examples☆1,330Updated 6 months ago
- Template Language for SQL with Automatic Bind Parameter Extraction☆832Updated last year
- Turbodbc is a Python module to access relational databases via the Open Database Connectivity (ODBC) interface. The module complies with …☆631Updated last week
- python implementation of the parquet columnar file format.☆822Updated 3 weeks ago
- [NOT MAINTAINED] Light-weight Python OLAP framework for multi-dimensional data analysis☆1,484Updated 2 years ago
- CONTRIBUTIONS ONLY: Voluptuous, despite the name, is a Python data validation library.☆1,831Updated 8 months ago
- A curated list of awesome ETL frameworks, libraries, and software.☆3,388Updated 8 months ago
- Simple DAG-based job scheduler in Python☆765Updated 5 years ago
- NumPy and Pandas interface to Big Data☆3,199Updated last year
- Easy pipelines for pandas DataFrames.☆719Updated 5 months ago
- Agile Data Preparation Workflows made easy with Pandas, Dask, cuDF, Dask-cuDF, Vaex and PySpark☆1,503Updated 4 months ago
- sqldf for pandas☆1,346Updated 8 months ago
- Data management framework for Python that provides functionality to describe, extract, validate, and transform tabular data☆745Updated last week
- db.py is an easier way to interact with your databases☆1,215Updated 3 years ago
- ☄️ Python's nested data operator (and CLI), for all your declarative restructuring needs. Got data? Glom it! ☄️☆1,987Updated 3 months ago
- Useful extensions to the standard Python datetime features☆2,442Updated 2 weeks ago
- A powerful and modular toolkit for record linkage and duplicate detection in Python☆997Updated last year
- Python disk-backed cache (Django-compatible). Faster than Redis and Memcached. Pure-Python.☆2,522Updated 8 months ago
- Lightweight, extensible data validation library for Python☆3,198Updated 3 months ago
- Fast Avro for Python☆665Updated last week
- A library for defensive data analysis.☆500Updated 5 years ago
- Pandas, Polars, Spark, and Snowpark DataFrame comparison for humans and more!☆550Updated this week