hackalog / easydata
A flexible template for doing reproducible data science in Python.
☆109Updated 11 months ago
Alternatives and similar repositories for easydata:
Users that are interested in easydata are comparing it to the libraries listed below
- Up Your Bus Number: A Primer for Reproducible Data Science☆68Updated 6 years ago
- Dockerized ML Cookiecutter☆73Updated 2 years ago
- Bulwark is a package for convenient property-based testing of pandas dataframes.☆224Updated 4 years ago
- Automatically export Jupyter notebooks to various file formats (.py, .html, and more) on save.☆77Updated last year
- The easy way to write your own flavor of Pandas☆303Updated last week
- Dockerfiles for images used as part of the Orbyter toolset☆44Updated 11 months ago
- Summarise and explore Pandas DataFrames☆98Updated 4 years ago
- A web frontend for scheduling Jupyter notebook reports☆252Updated 4 months ago
- DataFrame support for scikit-learn.☆63Updated last year
- A library that unifies the API for most commonly used libraries and modeling techniques for time-series forecasting in the Python ecosyst…☆151Updated last year
- 💫 PyScaffold extension for data-science projects☆158Updated 2 weeks ago
- Start a data science project with modern tools☆193Updated last year
- A short tutorial for data scientists on how to write tests for code + data.☆119Updated 4 years ago
- A Kedro plugin that provides pandas dropin replacements for the pandas datasets (e.g modin and cuDF)☆12Updated 4 years ago
- Data exploration library with a pandas-like API☆74Updated 4 years ago
- Sample projects using Ploomber.☆86Updated last year
- vtreat is a data frame processor/conditioner that prepares real-world data for predictive modeling in a statistically sound manner. Distr…☆120Updated 3 months ago
- Tries to shrink your Pandas column dtypes with no data loss so you have more spare RAM☆84Updated last year
- Test-Driven Data Analysis Functions☆298Updated last week
- Accelerate data science☆116Updated 3 years ago
- stratx is a library for A Stratification Approach to Partial Dependence for Codependent Variables☆66Updated 11 months ago
- This article compares open-source Python packages for pipeline/workflow development: Airflow, Luigi, Gokart, Metaflow, Kedro, PipelineX.☆57Updated 4 years ago
- Clustergram - Visualization and diagnostics for cluster analysis in Python☆123Updated last week
- Demo for voila☆67Updated 2 years ago
- Automated Exploratory Data Analysis. Simplifying Data Exploration☆34Updated 4 years ago
- Marshmallow Schema generator for Pandas DataFrames☆24Updated 4 years ago
- Decorators that logs stats.☆110Updated last month
- a python grammar for evolutionary algorithms and heuristics☆189Updated 3 years ago
- Tutorial for implementing data validation in data science pipelines☆33Updated 2 years ago
- A package for data science practitioners. This library implements a number of helpful, common data transformations with a scikit-learn fr…☆57Updated 3 years ago