d6t / d6tflow-template
Project template for highly effective data science workflows
☆29Updated 11 months ago
Alternatives and similar repositories for d6tflow-template:
Users that are interested in d6tflow-template are comparing it to the libraries listed below
- ☆19Updated 4 years ago
- Hands on Unsupervised Learning with Python [Video], Published by Packt☆29Updated 2 years ago
- ☆14Updated 2 years ago
- Jupyter notebooks for learning Python and Data Science, companion to Data Science Solutions book.☆36Updated 4 years ago
- Code for my presentation: Using PySpark to Process Boat Loads of Data☆20Updated 7 years ago
- Start your journey into social media analysis of politicans by using Python (Tutorial)☆21Updated 5 years ago
- Pyspark in Google Colab: A simple machine learning (Linear Regression) model☆36Updated 5 years ago
- Materials for Machine Learning with H2O Open Platform at ODSC Masterclass Summit 2017☆12Updated 8 years ago
- Python Machine Learning (ML) project that demonstrates the archetypal ML workflow within a Jupyter notebook, with automated model deploym…☆61Updated 2 years ago
- Repo for PyData 2019 Tutorial - New Trends in Estimation and Inference☆25Updated 5 years ago
- Blog post on ETL pipelines with Airflow☆23Updated 4 years ago
- Data Science for Good Projects☆49Updated 6 years ago
- A repository filled with various data science projects.☆32Updated 4 years ago
- Slides and code examples for H2O tutorials at various events☆56Updated 7 years ago
- JupyterCon Missing Data Talk 2018☆23Updated 6 years ago
- Material for UW Extension Data Science 350☆19Updated 7 years ago
- Slides and materials for most of my talks by year☆92Updated last year
- ☆40Updated 7 years ago
- A code-based tutorial for production level data streaming with PySpark plus Optimus for data cleaning, Confluent Kafka, & Apache Drill u…☆26Updated 5 years ago
- Python data science and machine learning from Ted Petrou with Dunder Data☆54Updated 2 years ago
- Tutorial covering a new workflow available going from pandas to scikit-learn☆40Updated 2 years ago
- Experimental library for sampling and validating scikit-learn parameters☆10Updated 5 years ago
- Crestle version of fast.ai courses☆14Updated 7 years ago
- The goal of this repository is to detect the outliers for a dataset & see the impact of these outliers on predictive models☆23Updated 6 years ago
- Examples of how Python can speed up tasks that are cumbersome in Excel☆13Updated 8 years ago
- Advanced Text Analytics for Business☆15Updated 7 years ago
- Work for Mastering Large Datasets with Python☆18Updated 2 years ago
- Predict whether a student will correctly answer a problem based on past performance using automated feature engineering☆32Updated 4 years ago
- Distributed, large-scale, benchmarking framework for rigorous assessment of automatic machine learning repositories, projects, and librar…☆30Updated 2 years ago
- Teaching materials for the text analytics course☆19Updated 6 years ago