drivendataorg / cookiecutter-data-scienceLinks
A logical, reasonably standardized, but flexible project structure for doing and sharing data science work.
☆9,251Updated last week
Alternatives and similar repositories for cookiecutter-data-science
Users that are interested in cookiecutter-data-science are comparing it to the libraries listed below
Sorting:
- 1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.☆13,132Updated last week
- Visual analysis and diagnostic tools to facilitate machine learning model selection.☆4,375Updated 6 months ago
- Jupyter Notebooks as Markdown Documents, Julia, Python or R scripts☆6,976Updated 2 weeks ago
- Missing data visualization module for Python.☆4,131Updated last year
- A library of extension and helper modules for Python's data analysis and machine learning libraries.☆5,064Updated 2 months ago
- Bayesian Modeling and Probabilistic Programming in Python☆9,247Updated this week
- An open source python library for automated feature engineering☆7,537Updated 2 weeks ago
- 📚 Parameterize, execute, and analyze notebooks☆6,261Updated 2 months ago
- Tools for diffing and merging of Jupyter notebooks.☆2,776Updated 11 months ago
- A package which efficiently applies any function to a pandas dataframe or series in the fastest available manner☆2,630Updated last year
- An interactive grid for sorting, filtering, and editing DataFrames in Jupyter notebooks☆3,078Updated last year
- Kedro is a toolbox for production-ready data science. It uses software engineering best practices to help you create data engineering and…☆10,526Updated last week
- Voilà turns Jupyter notebooks into standalone web applications☆5,807Updated 2 weeks ago
- Dask tutorial☆1,859Updated last year
- A Python Package to Tackle the Curse of Imbalanced Datasets in Machine Learning☆7,034Updated 3 weeks ago
- A library for debugging/inspecting machine learning classifiers and explaining their predictions☆2,776Updated 4 months ago
- Pandas integration with sklearn☆2,842Updated 2 years ago
- Source code for my collection of articles on using pandas.☆1,568Updated 2 years ago
- Declarative visualization library for Python☆10,011Updated last week
- Kaggle Python docker image☆2,614Updated last week
- Plotting library for IPython/Jupyter notebooks☆3,667Updated 3 weeks ago
- Clean APIs for data cleaning. Python implementation of R package Janitor☆1,452Updated this week
- Automated Machine Learning with scikit-learn☆7,946Updated last week
- Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of big tabular data at a billion rows per s…☆8,428Updated 2 weeks ago
- Python library that makes it easy for data scientists to create charts.☆3,611Updated 11 months ago
- Fast, flexible and easy to use probabilistic modelling in Python.☆3,472Updated 6 months ago
- A Python Automated Machine Learning tool that optimizes machine learning pipelines using genetic programming.☆9,980Updated this week
- RISE: "Live" Reveal.js Jupyter/IPython Slideshow Extension☆3,723Updated last year
- Statsmodels: statistical modeling and econometrics in Python☆10,946Updated this week
- A scikit-learn compatible neural network library that wraps PyTorch☆6,113Updated last month