Codility / cookiecutter-data-scienceLinks
A logical, reasonably standardized, but flexible project structure for doing and sharing data science work.
β13Updated 6 years ago
Alternatives and similar repositories for cookiecutter-data-science
Users that are interested in cookiecutter-data-science are comparing it to the libraries listed below
Sorting:
- ππ¨ Airflow tutorial for PyCon 2019β88Updated 3 years ago
- Techniques for Scraping the Web in Pythonβ27Updated 7 years ago
- A starter kit for new chapters of WiMLDSβ43Updated last year
- AWS Big Data Certificationβ25Updated last year
- List of Machine Learning & Data Science Conferencesβ82Updated 6 years ago
- Code to build a simple analytics data pipeline with Pythonβ102Updated 8 years ago
- A tutorial to build your first flask applicationβ77Updated 2 years ago
- performing sentiment analysis on the whatsapp chats.β23Updated 8 years ago
- The goal of this repository is to detect the outliers for a dataset & see the impact of these outliers on predictive modelsβ22Updated 7 years ago
- This is a repo for all the tutorials put out by H2O.ai. This includes learning paths for Driverless AI, H2O-3, Sparkling Water and more..β¦β134Updated last year
- bamboolib - template for creating your own binder notebookβ21Updated 4 years ago
- Helper class to simplify common read-only BigQuery tasks.β110Updated 3 months ago
- Processing tweets using Spark Streaming and identifying top trending hashtags using a real-time simple dashboardβ42Updated 3 years ago
- How to use Python to understand data and transform the data into a tidy format ready to be used for modelling and visualisation.β36Updated 6 years ago
- A Repository consisting of various visualisation libraries and toolsβ78Updated 6 years ago
- Unpivoted and cleaned data sets on the COVID-19 pandemicβ84Updated 2 years ago
- This repo demonstrates how to load a sample Parquet formatted file from an AWS S3 Bucket. A python job will then be submitted to a Apachβ¦β19Updated 9 years ago
- β26Updated 8 years ago
- A collection of handy cheat sheets for programming, data science, version control, statistics, and probability.β80Updated 7 years ago
- Shapley Values with H2O AutoML Example (ML Interpretability)β19Updated 6 years ago
- π A blog post about report generation and automation in pythonβ40Updated 6 years ago
- β38Updated 8 years ago
- A Jupyter notebook that uses the Watson Visual Recognition and Natural Language Understanding services to enrich Facebook Analytics and uβ¦β44Updated last year
- β19Updated 10 years ago
- Starter repository for Manning PBC: Discovering and Tracking Disease Outbreaks with Data Science and Pythonβ18Updated 3 years ago
- Live Twitter sentiment analysis using Python, Apache Spark Streaming, Kafka, NLTK, SocketIOβ20Updated 8 years ago
- Materials for "Docker for Data Science" tutorial presented at PyCon 2018 in Cleveland, OHβ160Updated 5 years ago
- Basic tutorial of using Apache Airflowβ36Updated 7 years ago
- Sentiment Analysis of a Twitter Topic with Spark Structured Streamingβ55Updated 7 years ago
- Production ready templates for deploying Driverless AI (DAI) scorers. https://h2oai.github.io/dai-deployment-templates/β18Updated 5 months ago