telia-oss / birgitta
Birgitta is a Python ETL test and schema framework, providing automated tests for pyspark notebooks/recipes.
☆14Updated last year
Related projects ⓘ
Alternatives and complementary repositories for birgitta
- Astronomer Vendor Images☆12Updated this week
- ☆10Updated 3 years ago
- Hephaestus - ETL and ML tools for OHDSI - OMOP CDM☆13Updated last year
- FHIR to OMOP using PySpark on AWS Glue☆12Updated 3 years ago
- Simple samples for writing ETL transform scripts in Python☆22Updated 3 years ago
- Helper code to interact with Rasgo via our SDK, PyRasgo☆40Updated last year
- JumpSpark - A modern cookiecutter template for pyspark projects with batteries included.☆10Updated last year
- Fully unit tested utility functions for data engineering. Python 3 only.☆14Updated 2 months ago
- The PEDSnet Data Quality Assessment Toolkit (OMOP CDM)☆24Updated 3 years ago
- [under development] ETL materials to support proposal for CDM enhancements for clinical trial data☆21Updated 3 years ago
- Omnipy is a high level Python library for type-driven data wrangling and scalable workflow orchestration (under development)☆19Updated this week
- Full stack data engineering tools and infrastructure set-up☆41Updated 3 years ago
- A curated list of awesome open source tools and commercial products to catalog, version, and manage data 🚀☆25Updated 2 years ago
- Outcomes Insights' Data Model for Clinical Research☆17Updated 8 months ago
- Extension to Python-Markdown to translate pydantic's model fields to markdown table☆12Updated 6 months ago
- Dask integration for Snowflake☆30Updated 4 months ago
- Curated list of awesome software and resources for Senzing, The First Real-Time AI for Entity Resolution.☆51Updated last week
- This connector is a dbt project that maps Medicare CCLF claims data to the Tuva Input Layer.☆13Updated this week
- Data Quality and Observability platform for the whole data lifecycle, from profiling new data sources to full automation with Data Observ…☆111Updated this week
- Profiles the data, validates the schema and runs data quality checks and produces a report☆20Updated 5 years ago
- A Python library to generate static data catalog sites. Carte scrapes metadata from your data assets and generates a fully searchable fro…☆26Updated 2 years ago
- A collection of python utility functions☆12Updated 4 months ago
- Awesome Orchest projects, both official and submitted by the community.☆25Updated last year
- Awesome List for Data Operations☆21Updated 4 years ago
- A convenient but aesthetic way of creating a GANTT chart thanks to Plotly library (especially for everyone who doesn't want to do one). B…☆12Updated 3 years ago
- A collection of Pandas helper functions.☆14Updated last year
- Medium Article☆11Updated 3 years ago
- Fake Pandas / PySpark DataFrame creator☆43Updated 7 months ago