hipagesgroup / data-tools
Common Python tools and utilities for data engineering, ETL, Exploration, etc. made opensource and packaged, making it easy to use in any environment.
☆13Updated this week
Alternatives and similar repositories for data-tools
Users that are interested in data-tools are comparing it to the libraries listed below
Sorting:
- ☆12Updated last year
- Outcomes Insights' Data Model for Clinical Research☆18Updated 3 weeks ago
- Resources and documentation for UK Biobank to OMOP CDM v5.3.1 conversion☆9Updated 4 years ago
- ☆10Updated 3 years ago
- Techniques for Scraping the Web in Python☆26Updated 6 years ago
- A data science enviornment for Ubuntu 14.04 server and desktop☆14Updated 4 years ago
- ☆9Updated 3 months ago
- A minimal example of how to use streamlit on Heroku☆21Updated 4 years ago
- Boring ML Generated Site☆19Updated 2 years ago
- Jupyter Notebooks and other code for 4CE data visualizations.☆13Updated 2 years ago
- This repository auto-configures an Apache Pinot and Superset cluster for analyzing IRA tweets from FiveThirtyEight.☆11Updated 4 years ago
- Hephaestus - ETL and ML tools for OHDSI - OMOP CDM☆13Updated 2 years ago
- Add-on package for using the Gridster library with Shiny☆25Updated 9 years ago
- Enables creating a AWS Lambda package that bundles R and a Python Lambda function for calculating survival statistics☆24Updated 8 years ago
- A Plotly Dash application to interact with the API endpoints of openFDA☆22Updated 4 years ago
- 🔍Your Data Quality Detector / Gain insight into your data and get it ready for use before you start working with it 💡📊🛠💎☆16Updated 2 years ago
- this repo contains the draft, images, and code for the Medium blog post on altair themes.☆12Updated 6 years ago
- articat: data artifact catalog☆17Updated 3 months ago
- Getting Great Expectations setup to run on DataBricks with Spark Dataframes.☆13Updated 2 years ago
- Helper code to interact with Rasgo via our SDK, PyRasgo☆40Updated 2 years ago
- This repo demonstrates how to load a sample Parquet formatted file from an AWS S3 Bucket. A python job will then be submitted to a Apach…☆19Updated 8 years ago
- A collection of data science examples implemented across a variety of languages and libraries.☆33Updated 9 years ago
- This code is to demonstrate the use of esquisse to generate ggplot2 with drag and drop☆9Updated 6 years ago
- ☆7Updated 6 years ago
- Easy Interactive Data Profiling for Big Data (and Small Data)☆14Updated 10 years ago
- Content for healthcare.ai, old posts, some hosted notebooks☆14Updated 7 years ago
- An R package for performing empirical calibration of observational study estimates☆10Updated 3 months ago
- A paper comparing Dask and Spark☆10Updated 2 years ago
- Cohort extractor tool which can generate dummy data, or real data against OpenSAFELY-compliant research databases☆38Updated 3 weeks ago
- ☆19Updated 10 months ago