a set of scripts to pull meta data and data profiling metrics from relational database systems
☆77Apr 17, 2024Updated last year
Alternatives and similar repositories for data-profiling
Users that are interested in data-profiling are comparing it to the libraries listed below
Sorting:
- How to use Python to understand data and transform the data into a tidy format ready to be used for modelling and visualisation.☆36Jul 13, 2019Updated 6 years ago
- ETL-CDMBuilder is a repo containing a .NET Core application to perform ETL to OMOP CDM for multiple databases☆54Jan 28, 2026Updated last month
- ☆29Oct 16, 2022Updated 3 years ago
- ☆15May 31, 2023Updated 2 years ago
- Showing the relationship between ImageNet ID and labels and pytorch pre-trained model output ID and labels☆10Oct 11, 2020Updated 5 years ago
- Templates and codebase to allow for an accelerated implementation of data governance capabilities using Microsoft's data governance produ…☆18May 27, 2022Updated 3 years ago
- A platform to manage the data product life cycle☆22Feb 11, 2026Updated 2 weeks ago
- customer visualization for splunk using echarts☆15May 11, 2017Updated 8 years ago
- Content for healthcare.ai, old posts, some hosted notebooks☆14Jan 19, 2018Updated 8 years ago
- Slides on making interactive leaflet maps with R☆18Sep 3, 2025Updated 5 months ago
- BETL. Meta data driven ETL generation using T-SQL☆18Jun 29, 2022Updated 3 years ago
- A handful of SSIS Catalog reports ready for SSRS☆31Dec 21, 2021Updated 4 years ago
- Creates an Excel Document with the metadata of all objects in a SSAS Tabular Model.☆18Sep 20, 2021Updated 4 years ago
- SCD Merge Wizard is an application which will help you generate T-SQL statement for merging data from two tables into one table in minute…☆44Sep 4, 2024Updated last year
- Outcomes Insights' Data Model for Clinical Research☆19Aug 12, 2025Updated 6 months ago
- This is an R package that implements a library of standard queries that run against the OMOP-CDM.☆18Jun 7, 2024Updated last year
- Scripts for SQL Server☆49Jan 26, 2025Updated last year
- PowerShell module to deploy Synapse workspace (and more) in Microsoft Azure.☆25Aug 26, 2025Updated 6 months ago
- Edit Open Data Contract Standard in Excel☆35Dec 1, 2025Updated 3 months ago
- ☆29Jul 15, 2023Updated 2 years ago
- workshop for building mssql always on basic availability group on window and Linux. Build Amazon FSx for managed shared file service, AWS…☆24Aug 9, 2021Updated 4 years ago
- Computation of adherence to medications from Electronic Healthcare Data in R☆31Jan 15, 2026Updated last month
- Terraform module which creates Snowflake RBAC resources using a simple configuration model. DISCLAIMER: Please see the following module t…☆12Jul 3, 2023Updated 2 years ago
- A framework for moving data into a data warehouse.☆56Sep 7, 2021Updated 4 years ago
- SSIS Multiple Hash makes it possible to generate many Hash values from each input row. Hash's supported include MD5 and SHA1.☆37Mar 15, 2023Updated 2 years ago
- Useful scripts for working with Microsoft SQL Server☆29Aug 14, 2019Updated 6 years ago
- Statistical modeling lies at the heart of data science. Well crafted statistical models allow data scientists to draw conclusions about t…☆11Jan 21, 2026Updated last month
- Graph-based tool for disambiguation and linking of named entities to Linked Data sets for Digital Humanities and heritage texts☆28Sep 20, 2021Updated 4 years ago
- ☆12May 28, 2024Updated last year
- Multi-Perspective Relevance Matching with Hierarchical ConvNets for Social Media Search (Rao et al. AAAI'19)☆27Nov 21, 2022Updated 3 years ago
- R package to easily interface with OMOP-formatted EHR data.☆37Mar 4, 2020Updated 5 years ago
- ☆11Dec 17, 2025Updated 2 months ago
- Various machine learning approaches are widely applied for short-term solar power forecasting, which is highly demanded for renewable ene…☆13Feb 18, 2020Updated 6 years ago
- A big collection of SQL Server Queries and documeantations to fix your SQL Server's bottle neck☆76Dec 11, 2017Updated 8 years ago
- Code files used for Insight Quest blog posts☆83Jun 21, 2021Updated 4 years ago
- Functions to map between ICD-10 terms and PheCodes for UK Biobank hospital electronic health records☆39Dec 27, 2022Updated 3 years ago
- Sample data files to support practice exercises within SAS certification preparation guides.☆39Aug 7, 2024Updated last year
- Data engineering interviews Q&A for data community by data community☆66Jun 7, 2020Updated 5 years ago
- Compilation of Scripts (mostly SQL ones) used to administer SQL Server☆10Jun 24, 2022Updated 3 years ago