socialpoint-labs / sqlbucket
Lightweight library to write, orchestrate and test your SQL ETL. Writing ETL with data integrity in mind.
☆74Updated last year
Alternatives and similar repositories for sqlbucket:
Users that are interested in sqlbucket are comparing it to the libraries listed below
- A free community driven school for data hosted at dataschool.com☆141Updated 2 years ago
- SQL data model for working with Snowplow web data. Supports Redshift and Looker. Snowflake and BigQuery coming soon☆60Updated 4 years ago
- An example mini data warehouse for python project stats, template for new projects☆178Updated 4 years ago
- Write Singer data to CSV files☆37Updated 7 months ago
- Visualize Airflow's schedule by exporting future DAG runs as events to Google Calendar.☆70Updated last year
- A luigi powered analytics / warehouse stack☆88Updated 8 years ago
- afctl helps to manage and deploy Apache Airflow projects faster and smoother.☆130Updated 2 years ago
- ETLy is an add-on dashboard service on top of Apache Airflow.☆70Updated last year
- Metamapper is a data discovery and documentation platform for improving how teams understand and interact with their data.☆79Updated 2 weeks ago
- A Getting Started Guide for developing and using Airflow Plugins☆93Updated 6 years ago
- Data pipelines from re-usable components☆108Updated 2 years ago
- Utilities for creating ETL pipelines with mara☆36Updated 2 years ago
- Data models for snowplow analytics.☆128Updated 2 months ago
- KNOTS is an intuitive desktop application built to simplify the configuration of Singer pipelines☆67Updated 2 years ago
- ☆27Updated 2 months ago
- locopy: Loading/Unloading to Redshift and Snowflake using Python.☆107Updated this week
- Convert a CSV to a parquet file.☆64Updated 2 years ago
- Tools for working with Singer Taps and Targets☆59Updated 7 months ago
- Builds Airflow DAGs from configuration files. Powers all DAGs on the Etsy Data Platform☆261Updated last year
- A Singer.io tap for extracting data from the JIRA API☆35Updated 7 months ago
- Arc is an opinionated framework for defining data pipelines which are predictable, repeatable and manageable.☆169Updated last year
- Writes the Singer format from Python☆556Updated 3 weeks ago
- ☆110Updated 3 years ago
- ☆40Updated 3 years ago
- A python client library for the Stitch Import API☆42Updated last year
- Tough and flexible tools for data analysis, transformation, validation and movement.☆138Updated last year
- A SQLite vtable extension to read Parquet files☆270Updated 3 years ago
- Official repository for pygrametl - ETL programming in Python☆296Updated 2 weeks ago
- a declarative ETL framework that enforces data engineer best practices☆39Updated 7 years ago
- Lightweight configuration and access to multiple databases in a single project☆38Updated last year