amancevice / flatsplode
Flatten/Explode JSON objects
☆18Updated 9 months ago
Alternatives and similar repositories for flatsplode:
Users that are interested in flatsplode are comparing it to the libraries listed below
- dagster scikit-learn pipeline example.☆45Updated 2 years ago
- Scripts to make specific datasets cleaner and more convenient☆41Updated 2 years ago
- PySpark schema generator☆42Updated 2 years ago
- A Python library to generate static data catalog sites. Carte scrapes metadata from your data assets and generates a fully searchable fro…☆27Updated 2 years ago
- Ibis analytics, with Ibis (and more!)☆21Updated 6 months ago
- A maximum-strength name parser for record linkage.☆36Updated last month
- Full spreadsheet-style pivot table through SQL macros. Just specify values, rows, columns, and filters!☆13Updated 6 months ago
- A PyTest plugin to speed up your tests which depend on Snowflake sessions☆27Updated last year
- A tool to automatically infer columns data types in .csv files☆35Updated 2 years ago
- quadipy is a python package to help transform structured data into RDF graph format☆19Updated last year
- [DEPRECATED] A dbt adapter for Excel.☆92Updated last year
- A simple and easy to use Data Quality (DQ) tool built with Python.☆49Updated last year
- Provide an easy way with Python to protect your data sources by searching its metadata. 🛡️☆16Updated last week
- Cloud-agnostic Python API☆61Updated 9 months ago
- A serverless duckDB deployment at GCP☆38Updated 2 years ago
- Full stack data engineering tools and infrastructure set-up☆50Updated 4 years ago
- ☆16Updated 2 years ago
- Pandas helper functions☆30Updated 2 years ago
- pytest plugin to run the tests with support of pyspark☆86Updated last week
- Metamapper is a data discovery and documentation platform for improving how teams understand and interact with their data.☆79Updated last week
- [ARCHIVED] The Presto adapter plugin for dbt Core☆33Updated last year
- Composable filesystem hooks and operators for Apache Airflow.☆17Updated 3 years ago
- locopy: Loading/Unloading to Redshift and Snowflake using Python.☆107Updated this week
- A tool to generate PySpark schema from JSON.☆28Updated last year
- Code for data quality with greatexpectations blog☆12Updated 7 months ago
- Pylint plugin for static code analysis on Airflow code☆93Updated 4 years ago
- Fake Pandas / PySpark DataFrame creator☆46Updated last year
- fst: flow state tool | smooth where you want it, friction where you need it when data engineering☆34Updated last year
- makes your sql less bad☆60Updated 5 years ago
- DataOps Data Quality TestGen is part of DataKitchen's Open Source Data Observability. DataOps TestGen delivers simple, fast data qualit…☆54Updated this week