drivendataorg / cookiecutter-data-science
A logical, reasonably standardized, but flexible project structure for doing and sharing data science work.
☆8,356Updated 3 months ago
Related projects ⓘ
Alternatives and complementary repositories for cookiecutter-data-science
- Visual analysis and diagnostic tools to facilitate machine learning model selection.☆4,293Updated last month
- Missing data visualization module for Python.☆3,963Updated 6 months ago
- 1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.☆12,543Updated last week
- 📚 Parameterize, execute, and analyze notebooks☆5,977Updated last month
- Voilà turns Jupyter notebooks into standalone web applications☆5,465Updated 2 weeks ago
- Feature engineering package with sklearn like functionality☆1,927Updated last week
- Declarative statistical visualization library for Python☆9,384Updated this week
- Jupyter Notebooks as Markdown Documents, Julia, Python or R scripts☆6,653Updated 2 months ago
- An open source python library for automated feature engineering☆7,272Updated this week
- A Python Automated Machine Learning tool that optimizes machine learning pipelines using genetic programming.☆9,739Updated 3 months ago
- Quickly build Explainable AI dashboards that show the inner workings of so-called "blackbox" machine learning models.☆2,312Updated 4 months ago
- A library of extension and helper modules for Python's data analysis and machine learning libraries.☆4,909Updated this week
- Evidently is an open-source ML and LLM observability framework. Evaluate, test, and monitor any AI-powered system or data pipeline. Fro…☆5,405Updated this week
- STUMPY is a powerful and scalable Python library for modern time series analysis☆3,666Updated this week
- Open Source Platform for developing, scaling and deploying serious ML, AI, and data science systems☆8,256Updated this week
- A simple and efficient tool to parallelize Pandas operations on all available CPUs☆3,686Updated 4 months ago
- A next-generation curated knowledge sharing platform for data scientists and other technical professions.☆5,482Updated 2 months ago
- Tools for diffing and merging of Jupyter notebooks.☆2,677Updated last month
- Source code for my collection of articles on using pandas.☆1,539Updated last year
- Modin: Scale your Pandas workflows by changing a single line of code☆9,898Updated last month
- Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of big tabular data at a billion rows per s…☆8,299Updated last month
- A curated list of references for MLOps☆12,621Updated 5 months ago
- Automatically visualize your pandas dataframe via a single print! 📊 💡☆5,211Updated 8 months ago
- Parallel computing with task scheduling☆12,604Updated this week
- A Grammar of Graphics for Python☆4,048Updated this week
- A Python Package to Tackle the Curse of Imbalanced Datasets in Machine Learning☆6,849Updated this week
- A guideline for building practical production-level deep learning systems to be deployed in real world applications.☆4,355Updated last year
- With Holoviews, your data visualizes itself.☆2,707Updated this week
- Fit interpretable models. Explain blackbox machine learning.☆6,297Updated this week
- A package which efficiently applies any function to a pandas dataframe or series in the fastest available manner☆2,541Updated 8 months ago