jgoerner / data-science-stack-cookiecutterLinks
π³ππ€Cookiecutter template to launch an awesome dockerized Data Science toolstack (incl. Jupyster, Superset, Postgres, Minio, AirFlow & API Star)
β213Updated 2 years ago
Alternatives and similar repositories for data-science-stack-cookiecutter
Users that are interested in data-science-stack-cookiecutter are comparing it to the libraries listed below
Sorting:
- Start a data science project with modern toolsβ199Updated last year
- Cookiecutter template for data scientists working with Docker containersβ358Updated 3 years ago
- A web frontend for scheduling Jupyter notebook reportsβ253Updated 8 months ago
- Quickly ingest messy CSV and XLS files. Export to clean pandas, SQL, parquetβ197Updated 2 years ago
- β111Updated 7 months ago
- ππ»π All material from the PyCon.DE 2018 Talk "Beyond Jupyter Notebooks - Building your own data science platform with Python & β¦β156Updated 6 years ago
- scaffold of Apache Airflow executing Docker containersβ86Updated 2 years ago
- Dockerized ML Cookiecutterβ75Updated 2 years ago
- ππ¨ Airflow tutorial for PyCon 2019β85Updated 2 years ago
- manipulate pandas dataframes from the comfort of your browserβ174Updated 3 years ago
- The easiest way to integrate Kedro and Great Expectationsβ53Updated 2 years ago
- With single command build a beautiful web scraping tool for scheduled scraping and store scraped data in postgres databaseβ22Updated 2 weeks ago
- Viewflow is an Airflow-based framework that allows data scientists to create data models without writing Airflow code.β126Updated 4 years ago
- A frictionless integrated platform for notebookβ83Updated 2 years ago
- A flexible template for doing reproducible data science in Python.β110Updated last year
- python automatic data quality check toolkitβ282Updated 4 years ago
- A simple guide to understand Prefect and make it work with your own docker-compose configuration.β163Updated last year
- Example DAGs using hooks and operators from Airflow Pluginsβ346Updated 7 years ago
- JupyterHub extension for ContainDS Dashboardsβ201Updated 11 months ago
- Template repository for data science lifecycle projectβ194Updated 5 years ago
- a collection of resources and blogs about Apache Supersetβ86Updated 3 years ago
- Automated Data Science and Machine Learning library to optimize workflow.β104Updated 2 years ago
- A GitHub Action that makes it easy to use Great Expectations to validate your data pipelines in your CI workflows.β80Updated last year
- This article compares open-source Python packages for pipeline/workflow development: Airflow, Luigi, Gokart, Metaflow, Kedro, PipelineX.β57Updated 5 years ago
- Test-Driven Data Analysis Functionsβ299Updated 2 weeks ago
- Materials for "Docker for Data Science" tutorial presented at PyCon 2018 in Cleveland, OHβ155Updated 4 years ago
- Examples of data science projects created with Kedro.β172Updated 2 years ago
- (project & tutorial) dag pipeline tests + ci/cd setupβ88Updated 4 years ago
- A library for recording and reading data in notebooks.β294Updated 3 years ago
- Containerized and Script-controlled JupyterLab Project Environmentβ106Updated 6 years ago