telia-oss / birgittaLinks
Birgitta is a Python ETL test and schema framework, providing automated tests for pyspark notebooks/recipes.
☆14Updated last year
Alternatives and similar repositories for birgitta
Users that are interested in birgitta are comparing it to the libraries listed below
Sorting:
- Simple samples for writing ETL transform scripts in Python☆23Updated last month
- Extension to Python-Markdown to translate pydantic's model fields to markdown table☆12Updated last year
- FHIR to OMOP using PySpark on AWS Glue☆14Updated 4 years ago
- [under development] ETL materials to support proposal for CDM enhancements for clinical trial data☆24Updated 4 years ago
- The best Python package for comparing two dataframes☆12Updated 3 years ago
- A template for an AWS Lambda function that triggers Prefect Flow Runs☆20Updated 4 years ago
- Hephaestus - ETL and ML tools for OHDSI - OMOP CDM☆13Updated 2 years ago
- A collection of python utility functions☆11Updated last year
- ☆12Updated last year
- The PEDSnet Data Quality Assessment Toolkit (OMOP CDM)☆25Updated 4 years ago
- Python package for managing OHDSI clinical data models. Includes support for LLM based plain text queries!☆48Updated this week
- Build and deploy a serverless data pipeline on AWS with no effort.☆111Updated 2 years ago
- Outcomes Insights' Data Model for Clinical Research☆19Updated 3 weeks ago
- Convert monolithic Jupyter notebooks 📙 into maintainable Ploomber pipelines. 📊☆79Updated 11 months ago
- ☆10Updated 3 years ago
- DataOps Data Quality TestGen is part of DataKitchen's Open Source Data Observability. DataOps TestGen delivers simple, fast data qualit…☆61Updated last week
- Medium Article☆11Updated 4 years ago
- Make working with pandas data and AWS DynamoDB easy☆21Updated 7 months ago
- Curated list of awesome software and resources for Senzing, The First Real-Time AI for Entity Resolution.☆61Updated this week
- Helper code to interact with Rasgo via our SDK, PyRasgo☆40Updated 2 years ago
- ☆14Updated 6 months ago
- ☆19Updated 5 years ago
- A curated list of awesome open source tools and commercial products to catalog, version, and manage data 🚀☆35Updated 3 years ago
- ☆22Updated 11 months ago
- ☆11Updated 4 years ago
- Function decorators for Pandas Dataframe column name and data type validation☆18Updated 2 months ago
- Metadata and data identification tool and Python library. Identifies PII, common identifiers, language specific identifiers. Fully custom…☆44Updated last month
- ☆20Updated 2 years ago
- Connector that loads FHIR r4 USCDIv3 JSON data from local file storage into the Tuva common data model in Snowflake.☆27Updated last week
- ⚡️ Pandas dataframes with object oriented programming style (not maintained)☆11Updated last year