drivendataorg / cookiecutter-data-scienceLinks
A logical, reasonably standardized, but flexible project structure for doing and sharing data science work.
☆9,337Updated last month
Alternatives and similar repositories for cookiecutter-data-science
Users that are interested in cookiecutter-data-science are comparing it to the libraries listed below
Sorting:
- Visual analysis and diagnostic tools to facilitate machine learning model selection.☆4,373Updated 7 months ago
- Missing data visualization module for Python.☆4,143Updated last year
- Kedro is a toolbox for production-ready data science. It uses software engineering best practices to help you create data engineering and…☆10,560Updated last week
- Modin: Scale your Pandas workflows by changing a single line of code☆10,289Updated this week
- A library of extension and helper modules for Python's data analysis and machine learning libraries.☆5,073Updated 3 months ago
- Voilà turns Jupyter notebooks into standalone web applications☆5,823Updated last month
- 📚 Parameterize, execute, and analyze notebooks☆6,277Updated 3 months ago
- Jupyter Notebooks as Markdown Documents, Julia, Python or R scripts☆7,001Updated last week
- A package which efficiently applies any function to a pandas dataframe or series in the fastest available manner☆2,630Updated last year
- Dask tutorial☆1,859Updated last year
- Statsmodels: statistical modeling and econometrics in Python☆10,986Updated last week
- Tools for diffing and merging of Jupyter notebooks.☆2,778Updated last year
- An open source python library for automated feature engineering☆7,543Updated last week
- 1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.☆13,170Updated this week
- Parallel computing with task scheduling☆13,509Updated 2 weeks ago
- cuDF - GPU DataFrame Library☆9,231Updated this week
- A library of sklearn compatible categorical variable encoders☆2,462Updated 3 months ago
- Feature engineering package with sklearn like functionality☆2,134Updated last month
- Source code for my collection of articles on using pandas.☆1,570Updated 2 years ago
- Pandas integration with sklearn☆2,842Updated 2 years ago
- Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of big tabular data at a billion rows per s…☆8,433Updated last week
- Create delightful software with Jupyter Notebooks☆5,189Updated 2 months ago
- RISE: "Live" Reveal.js Jupyter/IPython Slideshow Extension☆3,724Updated last year
- Evidently is an open-source ML and LLM observability framework. Evaluate, test, and monitor any AI-powered system or data pipeline. Fro…☆6,670Updated this week
- Quickly build Explainable AI dashboards that show the inner workings of so-called "blackbox" machine learning models.☆2,449Updated 2 months ago
- A simple and efficient tool to parallelize Pandas operations on all available CPUs☆3,792Updated last year
- A Grammar of Graphics for Python☆4,374Updated last week
- A Python Package to Tackle the Curse of Imbalanced Datasets in Machine Learning☆7,042Updated last month
- A scikit-learn compatible neural network library that wraps PyTorch☆6,118Updated last month
- 🦉 Data Versioning and ML Experiments☆14,948Updated this week