telia-oss / birgittaLinks
Birgitta is a Python ETL test and schema framework, providing automated tests for pyspark notebooks/recipes.
☆14Updated last year
Alternatives and similar repositories for birgitta
Users that are interested in birgitta are comparing it to the libraries listed below
Sorting:
- ☆13Updated 3 weeks ago
- The PEDSnet Data Quality Assessment Toolkit (OMOP CDM)☆24Updated 4 years ago
- FHIR to OMOP using PySpark on AWS Glue☆14Updated 4 years ago
- ☆10Updated 3 years ago
- This connector is a dbt project that maps Medicare CCLF claims data to the Tuva Input Layer.☆15Updated 3 months ago
- Python package for managing OHDSI clinical data models. Includes support for LLM based plain text queries!☆45Updated 3 weeks ago
- DataOps Data Quality TestGen is part of DataKitchen's Open Source Data Observability. DataOps TestGen delivers simple, fast data qualit…☆59Updated this week
- Medium Article☆11Updated 4 years ago
- [under development] ETL materials to support proposal for CDM enhancements for clinical trial data☆24Updated 4 years ago
- Hephaestus - ETL and ML tools for OHDSI - OMOP CDM☆13Updated 2 years ago
- Cohort extractor tool which can generate dummy data, or real data against OpenSAFELY-compliant research databases☆38Updated 3 weeks ago
- 🔍Your Data Quality Detector / Gain insight into your data and get it ready for use before you start working with it 💡📊🛠💎☆16Updated 2 years ago
- ☆15Updated last year
- ☆15Updated 4 years ago
- SQL query executor on remote DuckDB instance using Apache Arrow Flight RPC through Streamlit Web interface.☆15Updated 8 months ago
- ☆12Updated last year
- ☆11Updated 8 months ago
- Examples for pretty-jupyter package.☆18Updated 2 years ago
- Cloud-agnostic Python API☆60Updated last year
- Simple samples for writing ETL transform scripts in Python☆23Updated last week
- CLI for data platform☆19Updated last year
- The best Python package for comparing two dataframes☆11Updated 3 years ago
- Code to help generate SQL for stakeholders. Code at https://www.startdataengineering.com/post/data-democratize-llm/☆12Updated last year
- Example project demonstrating deployment patterns for real-time streaming workflows with Prefect 2.0☆45Updated 2 years ago
- Outcomes Insights' Data Model for Clinical Research☆19Updated 2 months ago
- ☆19Updated 2 years ago
- Awesome List for Data Operations☆24Updated 4 years ago
- Connector that loads FHIR r4 USCDIv3 JSON data from local file storage into the Tuva common data model in Snowflake.☆27Updated last month
- Function decorators for Pandas Dataframe column name and data type validation☆18Updated 3 weeks ago
- Helper code to interact with Rasgo via our SDK, PyRasgo☆40Updated 2 years ago