hipagesgroup / data-toolsLinks
Common Python tools and utilities for data engineering, ETL, Exploration, etc. made opensource and packaged, making it easy to use in any environment.
☆13Updated 3 weeks ago
Alternatives and similar repositories for data-tools
Users that are interested in data-tools are comparing it to the libraries listed below
Sorting:
- ☆12Updated last year
- Resources and documentation for UK Biobank to OMOP CDM v5.3.1 conversion☆9Updated 4 years ago
- Productivity Utilities for Data Science with Python Notebooks☆6Updated 5 years ago
- Hephaestus - ETL and ML tools for OHDSI - OMOP CDM☆13Updated 2 years ago
- Jupyter Notebooks and other code for 4CE data visualizations.☆13Updated 2 years ago
- This repo demonstrates how to load a sample Parquet formatted file from an AWS S3 Bucket. A python job will then be submitted to a Apach…☆19Updated 9 years ago
- ☆15Updated last year
- Birgitta is a Python ETL test and schema framework, providing automated tests for pyspark notebooks/recipes.☆14Updated last year
- ☆10Updated 3 years ago
- The PEDSnet Data Quality Assessment Toolkit (OMOP CDM)☆24Updated 4 years ago
- Extension to Python-Markdown to translate pydantic's model fields to markdown table☆13Updated last year
- Collection of various biomedical data models in parseable formats.☆29Updated 3 weeks ago
- Publication: Linked electronic health records for research on a nationwide cohort including over 54 million people in England☆18Updated 2 years ago
- Outcomes Insights' Data Model for Clinical Research☆19Updated last week
- ☆15Updated last year
- jinja2-enabled jupyter notebooks☆37Updated last week
- Medium Article☆11Updated 4 years ago
- Pandas ExtensionDtypes for dealing with genomics data☆47Updated 2 months ago
- ☆14Updated 5 months ago
- Self linked dictionary in Python☆8Updated last year
- A data science enviornment for Ubuntu 14.04 server and desktop☆14Updated 4 years ago
- A project for exploring how Great Expectations can be used to ensure data quality and validate batches within a data pipeline defined in …☆23Updated 2 years ago
- this repo contains the draft, images, and code for the Medium blog post on altair themes.☆12Updated 6 years ago
- [under development] ETL materials to support proposal for CDM enhancements for clinical trial data☆24Updated 4 years ago
- Clinical trial designs and methods in Python☆22Updated 8 years ago
- ☆17Updated 6 years ago
- Interactive Graphic for Exploring Liver Function Data in Clinical Trials☆11Updated 2 years ago
- A pattern focusing on how to use scikit learn and python in Watson Studio to predict opioid prescribers based off of a 2014 kaggle datase…☆36Updated 5 years ago
- Generates a set of plots showing ubiome data over time.☆13Updated 6 years ago
- NHS England PhD Internship Projects Pages☆19Updated 5 months ago