iterative / cookiecutter-data-science
A logical, reasonably standardized, but flexible project structure for doing and sharing data science work.
☆27Updated last year
Alternatives and similar repositories for cookiecutter-data-science:
Users that are interested in cookiecutter-data-science are comparing it to the libraries listed below
- A use case of a reproducible machine learning pipeline using Dask, DVC, and MLflow.☆23Updated 5 years ago
- A package for data science practitioners. This library implements a number of helpful, common data transformations with a scikit-learn fr…☆57Updated 3 years ago
- Automatically export Jupyter notebooks to various file formats (.py, .html, and more) on save.☆77Updated last year
- Public repository for versioning machine learning data☆42Updated 3 years ago
- Summarise and explore Pandas DataFrames☆98Updated 4 years ago
- Dockerized ML Cookiecutter☆73Updated 2 years ago
- The easiest way to integrate Kedro and Great Expectations☆53Updated 2 years ago
- Altair backend for pandas plotting☆102Updated 4 years ago
- A toolbox 🧰 for Jupyter notebooks 📙: testing, experiment tracking, debugging, profiling, and more!☆68Updated 7 months ago
- Automated Jupyter notebook testing. 📙☆41Updated last year
- captures logs and makes cron more fun☆76Updated 7 months ago
- kedro cli plugin for generating a static kedro viz site (html, css, js) that can be deployed on many serverless tools.☆27Updated 2 years ago
- A short tutorial for data scientists on how to write tests for code + data.☆119Updated 4 years ago
- Tries to shrink your Pandas column dtypes with no data loss so you have more spare RAM☆84Updated last year
- SciKIt-learn Pipeline in PAndas☆42Updated last year
- A flexible template for doing reproducible data science in Python.☆109Updated 11 months ago
- Primrose modeling framework for simple production models☆32Updated last year
- 💫 PyScaffold extension for data-science projects☆158Updated 3 weeks ago
- Kedro-Accelerator speeds up pipelines by parallelizing I/O in the background.☆35Updated 3 years ago
- Bulwark is a package for convenient property-based testing of pandas dataframes.☆224Updated 4 years ago
- Data exploration library with a pandas-like API☆74Updated 4 years ago
- Decorators that logs stats.☆110Updated last month
- Use pathlib syntax to easily work with Pandas series containing file paths.☆69Updated last year
- Dockerfiles for images used as part of the Orbyter toolset☆44Updated 11 months ago
- The goal of pandas-log is to provide feedback about basic pandas operations. It provides simple wrapper functions for the most common fun…☆216Updated 3 years ago
- Material for the Jupytext+Papermill blog post☆31Updated 4 years ago
- A collection of helpers for Jupyter/IPython☆47Updated 3 years ago
- 🍦 Deployment tool for online machine learning models☆97Updated 2 years ago
- Pandas Adapters For Scikit-Learn☆53Updated 6 years ago
- Kedro Wings automatically creates catalog entries to simplify Kedro pipeline writing. See the video here: https://www.youtube.com/watch?v…☆22Updated 2 years ago