jgoerner / data-science-stack-cookiecutterLinks
π³ππ€Cookiecutter template to launch an awesome dockerized Data Science toolstack (incl. Jupyster, Superset, Postgres, Minio, AirFlow & API Star)
β213Updated last year
Alternatives and similar repositories for data-science-stack-cookiecutter
Users that are interested in data-science-stack-cookiecutter are comparing it to the libraries listed below
Sorting:
- ππ»π All material from the PyCon.DE 2018 Talk "Beyond Jupyter Notebooks - Building your own data science platform with Python & β¦β156Updated 6 years ago
- Cookiecutter template for data scientists working with Docker containersβ358Updated 3 years ago
- A web frontend for scheduling Jupyter notebook reportsβ253Updated 6 months ago
- Start a data science project with modern toolsβ197Updated last year
- Dockerized ML Cookiecutterβ75Updated 2 years ago
- manipulate pandas dataframes from the comfort of your browserβ172Updated 3 years ago
- The easiest way to integrate Kedro and Great Expectationsβ52Updated 2 years ago
- Repo that relates to the Medium blog 'Keeping your ML model in shape with Kafka, Airflow' andΒ MLFlow'β121Updated 2 years ago
- scaffold of Apache Airflow executing Docker containersβ85Updated 2 years ago
- A flexible template for doing reproducible data science in Python.β110Updated last year
- π« PyScaffold extension for data-science projectsβ159Updated 2 months ago
- Python ELT Studio, an application for building ELT (and ETL) data flows.β58Updated 3 years ago
- Examples of data science projects created with Kedro.β172Updated 2 years ago
- Boilerplate for bootstrapping scalable multi-page Dash applicationsβ255Updated 2 years ago
- Automated Data Science and Machine Learning library to optimize workflow.β104Updated 2 years ago
- π§ͺ Simple data science experimentation & tracking with jupyter, papermill, and mlflow.β183Updated 11 months ago
- Automatically export Jupyter notebooks to various file formats (.py, .html, and more) on save.β77Updated last year
- Tutorial like code for how to deploy airflow using docker and how to use the DockerOperator.β44Updated 5 years ago
- Airflow basics tutorialβ397Updated 3 years ago
- A Kedro plugin that provides pandas dropin replacements for the pandas datasets (e.g modin and cuDF)β12Updated 4 years ago
- ππ¨ Airflow tutorial for PyCon 2019β86Updated 2 years ago
- Learn how to add data validation and documentation to a data pipeline built with dbt and Airflow.β169Updated last year
- Template repository for data science lifecycle projectβ194Updated 4 years ago
- An example mini data warehouse for python project stats, template for new projectsβ179Updated 4 years ago
- A fork of the cookiecutter-data-science leveraging Docker for local development.β131Updated 5 years ago
- Jupyter notebook wrapper for plotly dash applicationsβ81Updated 3 years ago
- β Priceloop Engineering Conventions for Scala, Python, Git Workflow etcβ100Updated 2 years ago
- β111Updated 5 months ago
- Trumania is a scenario-based random dataset generator library in python 3β112Updated 3 years ago
- Quickly ingest messy CSV and XLS files. Export to clean pandas, SQL, parquetβ196Updated 2 years ago