telia-oss / birgittaLinks
Birgitta is a Python ETL test and schema framework, providing automated tests for pyspark notebooks/recipes.
☆14Updated last year
Alternatives and similar repositories for birgitta
Users that are interested in birgitta are comparing it to the libraries listed below
Sorting:
- Medium Article☆11Updated 4 years ago
- The best Python package for comparing two dataframes☆12Updated 3 years ago
- A collection of python utility functions☆11Updated last year
- [under development] ETL materials to support proposal for CDM enhancements for clinical trial data☆24Updated 4 years ago
- ☆12Updated last year
- A collection of Pandas helper functions.☆14Updated 2 years ago
- The PEDSnet Data Quality Assessment Toolkit (OMOP CDM)☆25Updated 4 years ago
- Simple samples for writing ETL transform scripts in Python☆23Updated last month
- A template for an AWS Lambda function that triggers Prefect Flow Runs☆20Updated 4 years ago
- FHIR to OMOP using PySpark on AWS Glue☆14Updated 4 years ago
- Parquet file management in S3 for Athena / Spectrum / Presto partitioning☆22Updated 8 months ago
- Python utility to extract differences between two pandas dataframes.☆11Updated 5 months ago
- ☆15Updated 4 years ago
- Extension to Python-Markdown to translate pydantic's model fields to markdown table☆12Updated last year
- Function decorators for Pandas Dataframe column name and data type validation☆19Updated this week
- Python package for managing OHDSI clinical data models. Includes support for LLM based plain text queries!☆49Updated this week
- DataOps Data Quality TestGen is part of DataKitchen's Open Source Data Observability. DataOps TestGen delivers simple, fast data qualit…☆63Updated last week
- Hephaestus - ETL and ML tools for OHDSI - OMOP CDM☆13Updated last week
- ☆14Updated 6 months ago
- Omnipy is a high level Python library for type-driven data wrangling and scalable workflow orchestration (under development)☆25Updated this week
- Outcomes Insights' Data Model for Clinical Research☆19Updated last month
- ☆11Updated 4 years ago
- Fake Pandas / PySpark DataFrame creator☆48Updated last year
- ☆15Updated last year
- Example project demonstrating deployment patterns for real-time streaming workflows with Prefect 2.0☆45Updated 3 years ago
- This connector is a dbt project that maps Medicare CCLF claims data to the Tuva Input Layer.☆15Updated 6 months ago
- ☆10Updated 3 years ago
- ☆19Updated 5 years ago
- ⚡️ Pandas dataframes with object oriented programming style (not maintained)☆11Updated last year
- A tool for faster application development Plotly Dash☆10Updated 10 months ago