Wikia / discreETLy
ETLy is an add-on dashboard service on top of Apache Airflow.
☆69Updated last year
Alternatives and similar repositories for discreETLy:
Users that are interested in discreETLy are comparing it to the libraries listed below
- Pylint plugin for static code analysis on Airflow code☆91Updated 4 years ago
- [ARCHIVED] The Presto adapter plugin for dbt Core☆33Updated last year
- Fast iterative local development and testing of Apache Airflow workflows☆195Updated last month
- Search service library for Amundsen☆54Updated 8 months ago
- Airflow declarative DAGs via YAML☆132Updated last year
- REST-like API exposing Airflow data and operations☆61Updated 6 years ago
- Data models for snowplow analytics.☆126Updated last week
- Builds Airflow DAGs from configuration files. Powers all DAGs on the Etsy Data Platform☆261Updated last year
- A Getting Started Guide for developing and using Airflow Plugins☆94Updated 6 years ago
- Astronomer Core Docker Images☆106Updated 8 months ago
- A facebook for data☆26Updated 5 years ago
- Metadata service library for Amundsen☆83Updated last year
- ⚠️ MAINTENANCE-ONLY MODE: Snowplow maintained SQL data models for working with Snowplow web and mobile behavioral data.☆41Updated 3 weeks ago
- ☆197Updated last year
- Data ingestion library for Amundsen to build graph and search index☆205Updated 10 months ago
- Visualize dependencies between Airflow DAGs☆49Updated 3 years ago
- Convert JSON files to Parquet using PyArrow☆95Updated last year
- A CLI and library to run Singer Taps and Targets☆34Updated 2 years ago
- Airflow workflow management platform chef cookbook.☆70Updated 5 years ago
- Soda Spark is a PySpark library that helps you with testing your data in Spark Dataframes☆63Updated 2 years ago
- SQL data model for working with Snowplow web data. Supports Redshift and Looker. Snowflake and BigQuery coming soon☆60Updated 4 years ago
- a dbt package to make auditing dbt runs easy.☆98Updated last month
- Data validation library for PySpark 3.0.0☆34Updated 2 years ago
- Metamapper is a data discovery and documentation platform for improving how teams understand and interact with their data.☆79Updated this week
- ☆62Updated 5 years ago
- The sane way of building a data layer in Airflow☆24Updated 5 years ago
- 📆 Run, schedule, and manage your dbt jobs using Kubernetes.☆24Updated 6 years ago
- Circus Train is a dataset replication tool that copies Hive tables between clusters and clouds.☆87Updated 10 months ago
- makes your sql less bad☆60Updated 4 years ago
- Rules based grant management for Snowflake☆40Updated 5 years ago