iterative / cookiecutter-data-science
A logical, reasonably standardized, but flexible project structure for doing and sharing data science work.
☆26Updated last year
Alternatives and similar repositories for cookiecutter-data-science:
Users that are interested in cookiecutter-data-science are comparing it to the libraries listed below
- Automatically export Jupyter notebooks to various file formats (.py, .html, and more) on save.☆75Updated last year
- Public repository for versioning machine learning data☆42Updated 3 years ago
- Dockerized ML Cookiecutter☆71Updated 2 years ago
- Kedro Wings automatically creates catalog entries to simplify Kedro pipeline writing. See the video here: https://www.youtube.com/watch?v…☆22Updated 2 years ago
- The easiest way to integrate Kedro and Great Expectations☆53Updated 2 years ago
- Dockerfiles for images used as part of the Orbyter toolset☆44Updated 9 months ago
- A use case of a reproducible machine learning pipeline using Dask, DVC, and MLflow.☆23Updated 5 years ago
- Altair backend for pandas plotting☆102Updated 3 years ago
- A package for data science practitioners. This library implements a number of helpful, common data transformations with a scikit-learn fr…☆57Updated 3 years ago
- Dvc + Streamlit = ❤️☆40Updated last year
- Start a data science project with modern tools☆189Updated last year
- Primrose modeling framework for simple production models☆33Updated 11 months ago
- a python grammar for evolutionary algorithms and heuristics☆189Updated 2 years ago
- A short tutorial for data scientists on how to write tests for code + data.☆119Updated 4 years ago
- Automated Data Science and Machine Learning library to optimize workflow.☆104Updated 2 years ago
- Automated Jupyter notebook testing. 📙☆41Updated last year
- Bulwark is a package for convenient property-based testing of pandas dataframes.☆224Updated 4 years ago
- kedro cli plugin for generating a static kedro viz site (html, css, js) that can be deployed on many serverless tools.☆27Updated 2 years ago
- Kedro-Accelerator speeds up pipelines by parallelizing I/O in the background.☆35Updated 2 years ago
- A collection of helpers for Jupyter/IPython☆47Updated 3 years ago
- 💫 PyScaffold extension for data-science projects☆156Updated this week
- ☆40Updated last year
- Marshmallow Schema generator for Pandas DataFrames☆24Updated 4 years ago
- Decorators that logs stats.☆108Updated last year
- ☆24Updated last year
- A small python library that can clump lists of data together.☆148Updated 3 years ago
- Fast, resilient and reproducible data analysis with cached SQL queries☆30Updated last year
- A flexible template for doing reproducible data science in Python.☆109Updated 9 months ago
- A toolbox 🧰 for Jupyter notebooks 📙: testing, experiment tracking, debugging, profiling, and more!☆68Updated 5 months ago
- Tries to shrink your Pandas column dtypes with no data loss so you have more spare RAM☆83Updated last year