drivendataorg / cookiecutter-data-science
A logical, reasonably standardized, but flexible project structure for doing and sharing data science work.
☆8,655Updated this week
Alternatives and similar repositories for cookiecutter-data-science:
Users that are interested in cookiecutter-data-science are comparing it to the libraries listed below
- Visual analysis and diagnostic tools to facilitate machine learning model selection.☆4,323Updated last month
- Voilà turns Jupyter notebooks into standalone web applications☆5,620Updated 2 weeks ago
- A library of extension and helper modules for Python's data analysis and machine learning libraries.☆4,982Updated last month
- 1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.☆12,799Updated this week
- Jupyter Notebooks as Markdown Documents, Julia, Python or R scripts☆6,783Updated last week
- Build, Deploy and Manage AI/ML Systems☆8,644Updated this week
- 📚 Parameterize, execute, and analyze notebooks☆6,113Updated 2 months ago
- A package which efficiently applies any function to a pandas dataframe or series in the fastest available manner☆2,592Updated last year
- Tools for diffing and merging of Jupyter notebooks.☆2,716Updated 6 months ago
- Open source platform for the machine learning lifecycle☆19,824Updated this week
- Create delightful software with Jupyter Notebooks☆5,040Updated this week
- An open source python library for automated feature engineering☆7,393Updated this week
- Parallel computing with task scheduling☆13,045Updated this week
- A simple and efficient tool to parallelize Pandas operations on all available CPUs☆3,732Updated 8 months ago
- 🦉 Data Versioning and ML Experiments☆14,282Updated 2 weeks ago
- Source code for my collection of articles on using pandas.☆1,548Updated 2 years ago
- Missing data visualization module for Python.☆4,069Updated 10 months ago
- Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of big tabular data at a billion rows per s…☆8,357Updated 5 months ago
- Bayesian Modeling and Probabilistic Programming in Python☆8,921Updated this week
- Modin: Scale your Pandas workflows by changing a single line of code☆10,064Updated this week
- Fit interpretable models. Explain blackbox machine learning.☆6,426Updated last week
- A scikit-learn compatible neural network library that wraps PyTorch☆5,986Updated last week
- Plotting library for IPython/Jupyter notebooks☆3,650Updated last month
- A cross-platform command-line utility that creates projects from cookiecutters (project templates), e.g. Python package projects, C proje…☆23,226Updated this week
- Answers to 120 commonly asked data science interview questions.☆3,762Updated last year
- nbconvert as a web service: Render Jupyter Notebooks as static web pages☆2,235Updated 2 months ago
- Kedro is a toolbox for production-ready data science. It uses software engineering best practices to help you create data engineering and…☆10,228Updated this week
- An open-source, low-code machine learning library in Python☆9,223Updated this week
- Interactive Widgets for the Jupyter Notebook☆3,207Updated 2 weeks ago
- ☆2,564Updated 2 years ago