drivendataorg / cookiecutter-data-science
A logical, reasonably standardized, but flexible project structure for doing and sharing data science work.
☆8,490Updated this week
Alternatives and similar repositories for cookiecutter-data-science:
Users that are interested in cookiecutter-data-science are comparing it to the libraries listed below
- Visual analysis and diagnostic tools to facilitate machine learning model selection.☆4,306Updated 3 months ago
- Declarative visualization library for Python☆9,518Updated this week
- 📚 Parameterize, execute, and analyze notebooks☆6,047Updated last week
- Create delightful software with Jupyter Notebooks☆4,986Updated 3 weeks ago
- 🦉 Data Versioning and ML Experiments☆14,088Updated this week
- Voilà turns Jupyter notebooks into standalone web applications☆5,538Updated last week
- Statsmodels: statistical modeling and econometrics in Python☆10,358Updated last week
- 1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.☆12,647Updated this week
- Modin: Scale your Pandas workflows by changing a single line of code☆9,975Updated last week
- Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of big tabular data at a billion rows per s…☆8,321Updated 3 months ago
- A package which efficiently applies any function to a pandas dataframe or series in the fastest available manner☆2,559Updated 9 months ago
- A light-weight, flexible, and expressive statistical data testing library☆3,546Updated this week
- Kedro is a toolbox for production-ready data science. It uses software engineering best practices to help you create data engineering and…☆10,112Updated this week
- A curated list of applied machine learning and data science notebooks and libraries across different industries (by @firmai)☆7,287Updated 3 months ago
- Missing data visualization module for Python.☆3,999Updated 8 months ago
- A library of extension and helper modules for Python's data analysis and machine learning libraries.☆4,941Updated 2 months ago
- Jupyter Notebooks as Markdown Documents, Julia, Python or R scripts☆6,710Updated 3 weeks ago
- Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce,…☆27,724Updated 9 months ago
- A simple and efficient tool to parallelize Pandas operations on all available CPUs☆3,716Updated 6 months ago
- Source code for my collection of articles on using pandas.☆1,542Updated 2 years ago
- aka "Bayesian Methods for Hackers": An introduction to Bayesian methods + probabilistic programming with a computation/understanding-firs…☆27,113Updated 6 months ago
- Quickly build Explainable AI dashboards that show the inner workings of so-called "blackbox" machine learning models.☆2,348Updated 2 weeks ago
- Automatically visualize your pandas dataframe via a single print! 📊 💡☆5,241Updated 9 months ago
- Data science interview questions and answers☆9,090Updated 4 months ago
- An open source python library for automated feature engineering☆7,334Updated this week
- Tools for diffing and merging of Jupyter notebooks.☆2,690Updated 3 months ago
- cuDF - GPU DataFrame Library☆8,597Updated this week
- Free online textbook of Jupyter notebooks for fast.ai Computational Linear Algebra course☆10,328Updated 9 months ago
- A Grammar of Graphics for Python☆4,106Updated last week
- Panel: The powerful data exploration & web app framework for Python☆4,952Updated this week