socialpoint-labs / sqlbucket
Lightweight library to write, orchestrate and test your SQL ETL. Writing ETL with data integrity in mind.
☆74Updated last year
Alternatives and similar repositories for sqlbucket:
Users that are interested in sqlbucket are comparing it to the libraries listed below
- A free community driven school for data hosted at dataschool.com☆141Updated last year
- Utilities for creating ETL pipelines with mara☆36Updated 2 years ago
- afctl helps to manage and deploy Apache Airflow projects faster and smoother.☆130Updated 2 years ago
- Tools for working with Singer Taps and Targets☆59Updated 5 months ago
- Visualize Airflow's schedule by exporting future DAG runs as events to Google Calendar.☆70Updated last year
- KNOTS is an intuitive desktop application built to simplify the configuration of Singer pipelines☆67Updated 2 years ago
- An example mini data warehouse for python project stats, template for new projects☆178Updated 4 years ago
- Mapping of DWH database tables to business entities, attributes & metrics in Python, with automatic creation of flattened tables☆72Updated last year
- Data pipelines from re-usable components☆108Updated last year
- ☆27Updated 2 weeks ago
- A luigi powered analytics / warehouse stack☆87Updated 7 years ago
- A Getting Started Guide for developing and using Airflow Plugins☆94Updated 6 years ago
- Data analysis and reporting tool for quick access to custom charts and tables in Jupyter Notebooks and in the shell.☆120Updated last year
- locopy: Loading/Unloading to Redshift and Snowflake using Python.☆106Updated last week
- Metamapper is a data discovery and documentation platform for improving how teams understand and interact with their data.☆79Updated this week
- ETLy is an add-on dashboard service on top of Apache Airflow.☆69Updated last year
- SQL data model for working with Snowplow web data. Supports Redshift and Looker. Snowflake and BigQuery coming soon☆60Updated 4 years ago
- Export Redshift data and convert to Parquet for use with Redshift Spectrum or other data warehouses.☆116Updated 2 years ago
- Write Singer data to CSV files☆37Updated 5 months ago
- Convert JSON files to Parquet using PyArrow☆95Updated last year
- Arc is an opinionated framework for defining data pipelines which are predictable, repeatable and manageable.☆169Updated last year
- Airflow workflow management platform chef cookbook.☆71Updated 5 years ago
- [ARCHIVED] The Presto adapter plugin for dbt Core☆33Updated last year
- Lightweight configuration and access to multiple databases in a single project☆38Updated last year
- Data Brewery is an ETL (Extract-Transform-Load) program that connect to many data sources (cloud services, databases, ...) and manage dat…☆16Updated 4 years ago
- A python client library for the Stitch Import API☆42Updated last year
- ☆73Updated this week
- Soda SQL and Soda Spark have been deprecated and replaced by Soda Core. docs.soda.io/soda-core/overview.html☆61Updated 2 years ago
- A curated collection of publicly available resources on dbt best practices and how data-driven organizations around the world utilize dbt☆113Updated 2 years ago
- Airflow plugin to transfer arbitrary files between operators☆78Updated 6 years ago