drivendataorg / cookiecutter-data-scienceLinks
A logical, reasonably standardized, but flexible project structure for doing and sharing data science work.
☆9,191Updated last month
Alternatives and similar repositories for cookiecutter-data-science
Users that are interested in cookiecutter-data-science are comparing it to the libraries listed below
Sorting:
- Missing data visualization module for Python.☆4,126Updated last year
- Visual analysis and diagnostic tools to facilitate machine learning model selection.☆4,370Updated 6 months ago
- Create delightful software with Jupyter Notebooks☆5,169Updated last month
- 1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.☆13,105Updated last week
- Voilà turns Jupyter notebooks into standalone web applications☆5,793Updated 3 weeks ago
- A Python Automated Machine Learning tool that optimizes machine learning pipelines using genetic programming.☆9,968Updated 2 weeks ago
- Jupyter Notebooks as Markdown Documents, Julia, Python or R scripts☆6,969Updated last week
- Tools for diffing and merging of Jupyter notebooks.☆2,772Updated 11 months ago
- A library of extension and helper modules for Python's data analysis and machine learning libraries.☆5,056Updated 2 months ago
- Free online textbook of Jupyter notebooks for fast.ai Computational Linear Algebra course☆10,578Updated last year
- An open source python library for automated feature engineering☆7,527Updated this week
- Data science interview questions and answers☆9,444Updated 4 months ago
- Statsmodels: statistical modeling and econometrics in Python☆10,900Updated last week
- A curated list of applied machine learning and data science notebooks and libraries across different industries (by @firmai)☆7,385Updated 10 months ago
- Evidently is an open-source ML and LLM observability framework. Evaluate, test, and monitor any AI-powered system or data pipeline. Fro…☆6,555Updated this week
- Declarative visualization library for Python☆9,959Updated last month
- The fastest ⚡️ way to build data pipelines. Develop iteratively, deploy anywhere. ☁️☆3,600Updated 2 months ago
- A package which efficiently applies any function to a pandas dataframe or series in the fastest available manner☆2,625Updated last year
- 📚 Parameterize, execute, and analyze notebooks☆6,248Updated last month
- Modin: Scale your Pandas workflows by changing a single line of code☆10,262Updated last week
- Source code for my collection of articles on using pandas.☆1,566Updated 2 years ago
- Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of big tabular data at a billion rows per s…☆8,424Updated last week
- Quickly build Explainable AI dashboards that show the inner workings of so-called "blackbox" machine learning models.☆2,426Updated 3 weeks ago
- A booklet on machine learning systems design with exercises. NOT the repo for the book "Designing Machine Learning Systems", which is `dm…☆9,538Updated 2 years ago
- Learn how to design, develop, deploy and iterate on production-grade ML applications.☆3,190Updated last year
- Kedro is a toolbox for production-ready data science. It uses software engineering best practices to help you create data engineering and…☆10,504Updated this week
- A curated list of awesome MLOps tools☆4,688Updated last month
- Source code accompanying O'Reilly book: Machine Learning Design Patterns☆2,025Updated 4 years ago
- A Grammar of Graphics for Python☆4,329Updated last week
- A curated list of awesome open source libraries to deploy, monitor, version and scale your machine learning☆19,039Updated this week