telia-oss / birgitta
Birgitta is a Python ETL test and schema framework, providing automated tests for pyspark notebooks/recipes.
☆14Updated last year
Related projects ⓘ
Alternatives and complementary repositories for birgitta
- ☆10Updated 3 years ago
- A collection of python utility functions☆12Updated 4 months ago
- Plugin for Intake to read from SQL servers☆15Updated last year
- Astronomer Vendor Images☆12Updated last week
- Helper code to interact with Rasgo via our SDK, PyRasgo☆40Updated last year
- Fully unit tested utility functions for data engineering. Python 3 only.☆14Updated 3 months ago
- Hephaestus - ETL and ML tools for OHDSI - OMOP CDM☆13Updated last year
- ☆12Updated 8 months ago
- Awesome Orchest projects, both official and submitted by the community.☆25Updated last year
- A curated list of awesome open source tools and commercial products to catalog, version, and manage data 🚀☆27Updated 2 years ago
- Content for a talk on "The wonderful world of data quality tools in Python"☆18Updated 3 years ago
- A fully-featured multi-source data pipeline for continuously extracting knowledge from COVID-19 data.☆21Updated 3 years ago
- This repository contains code to build an MVP search engine with google like interface.☆16Updated 4 years ago
- A template for an AWS Lambda function that triggers Prefect Flow Runs☆20Updated 3 years ago
- Build your feature store with macros right within your dbt repository☆37Updated last year
- A Python library to generate static data catalog sites. Carte scrapes metadata from your data assets and generates a fully searchable fro…☆27Updated 2 years ago
- This repository auto-configures an Apache Pinot and Superset cluster for analyzing IRA tweets from FiveThirtyEight.☆11Updated 4 years ago
- This connector is a dbt project that maps Medicare CCLF claims data to the Tuva Input Layer.☆13Updated this week
- Repository containing various utils related to Snowflake migration at Faire.☆11Updated last year
- Full stack data engineering tools and infrastructure set-up☆44Updated 3 years ago
- Dask integration for Snowflake☆30Updated last week
- Utility functions for dbt projects running on Spark☆31Updated last year
- Activity Schema dbt package☆14Updated last year
- Using the Parquet file format with Python☆14Updated last year
- ☆15Updated 3 months ago
- Model drift detection☆11Updated last year
- JumpSpark - A modern cookiecutter template for pyspark projects with batteries included.☆10Updated last year
- The IBM DB2 adapter plugin for dbt (data build tool)☆10Updated 6 months ago
- a pytest plugin for dbt adapter test suites☆19Updated last year