iterative / cookiecutter-data-scienceLinks
A logical, reasonably standardized, but flexible project structure for doing and sharing data science work.
☆27Updated last year
Alternatives and similar repositories for cookiecutter-data-science
Users that are interested in cookiecutter-data-science are comparing it to the libraries listed below
Sorting:
- Automatically export Jupyter notebooks to various file formats (.py, .html, and more) on save.☆77Updated last year
- A use case of a reproducible machine learning pipeline using Dask, DVC, and MLflow.☆23Updated 6 years ago
- captures logs and makes cron more fun☆77Updated 8 months ago
- A package for data science practitioners. This library implements a number of helpful, common data transformations with a scikit-learn fr…☆57Updated 3 years ago
- Altair backend for pandas plotting☆102Updated 4 years ago
- Dockerized ML Cookiecutter☆74Updated 2 years ago
- Public repository for versioning machine learning data☆42Updated 3 years ago
- Dockerfiles for images used as part of the Orbyter toolset☆44Updated last year
- Kedro Wings automatically creates catalog entries to simplify Kedro pipeline writing. See the video here: https://www.youtube.com/watch?v…☆22Updated 2 years ago
- The easiest way to integrate Kedro and Great Expectations☆52Updated 2 years ago
- Summarise and explore Pandas DataFrames☆98Updated 4 years ago
- A small python library that can clump lists of data together.☆149Updated 3 years ago
- Automated Jupyter notebook testing. 📙☆41Updated last year
- ForML - A development framework and MLOps platform for the lifecycle management of data science projects☆106Updated 2 years ago
- A tool for compiling trained SKLearn models into other representations (such as SQL, Sympy or Excel formulas)☆174Updated 2 years ago
- Kedro-Accelerator speeds up pipelines by parallelizing I/O in the background.☆35Updated 3 years ago
- Data exploration library with a pandas-like API☆74Updated 4 years ago
- 🎯 kettle is a CLI tool for creating and deploying cloud functions & docker containers for machine learning☆32Updated 2 years ago
- Decorators that logs stats.☆112Updated 2 months ago
- Pandas Adapters For Scikit-Learn☆53Updated 6 years ago
- vtreat is a data frame processor/conditioner that prepares real-world data for predictive modeling in a statistically sound manner. Distr…☆121Updated 4 months ago
- SciKIt-learn Pipeline in PAndas☆42Updated last year
- Easy to use util for profiling in production☆11Updated last year
- A short tutorial for data scientists on how to write tests for code + data.☆120Updated 4 years ago
- Mini module with syntax sugar for pandas/sklearn☆107Updated 4 years ago
- A flexible template for doing reproducible data science in Python.☆110Updated last year
- Start a data science project with modern tools☆196Updated last year
- python library for automated dataset normalization☆115Updated last year
- Hypergol is a Data Science/Machine Learning productivity toolkit to accelerate any projects into production with autogenerated code, stan…☆53Updated 2 years ago
- Cluster tools for running Dask on Databricks☆14Updated last year