WillKoehrsen / mastering-large-datasets
Work for Mastering Large Datasets with Python
☆19Updated 2 years ago
Alternatives and similar repositories for mastering-large-datasets:
Users that are interested in mastering-large-datasets are comparing it to the libraries listed below
- Pyspark in Google Colab: A simple machine learning (Linear Regression) model☆36Updated 6 years ago
- Data Science for Good Projects☆49Updated 6 years ago
- Code repo for Packt course I developed, "Beginning Data Wrangling with Python"☆30Updated 4 years ago
- ☆12Updated 6 years ago
- Powerful rapid automatic EDA and feature engineering library with a very easy to use API 🌟☆53Updated 3 years ago
- ☆26Updated 5 years ago
- Python data science and machine learning from Ted Petrou with Dunder Data☆55Updated 2 years ago
- Few tutorials on pandas, matplotlib and seaborn☆26Updated 8 years ago
- Small example on how you can detect multicollinearity☆13Updated 3 years ago
- Classic Computer Science Problems with Python☆28Updated 5 years ago
- "Building a Recommender System from Scratch" Workshop Material for PyDataDC 2018☆24Updated 6 years ago
- The repository for the course in Udemy☆16Updated 5 years ago
- Exploratory Data Analysis with Pandas and Python 3.x, published by Packt☆44Updated 2 years ago
- Work related to the Airbus Ship Detection Challenge https://www.kaggle.com/c/airbus-ship-detection☆13Updated 6 years ago
- My presentation at ODSC India 2018 about Deep Learning with Apache Spark☆27Updated 6 years ago
- ☆7Updated 6 years ago
- Detailed notes and code to learn the basics of machine learning with scikit-learn.☆35Updated 8 years ago
- Guide for applying Unit Testing in data-driven projects☆19Updated 4 years ago
- ☆19Updated 4 years ago
- Recurrent Neural Networks for Timeseries☆24Updated 5 years ago
- OptimalFlow is an omni-ensemble and scalable automated machine learning Python toolkit, which uses Pipeline Cluster Traversal Experiments…☆27Updated last year
- Writing Primer for Data Scientists☆18Updated 5 years ago
- Tips for Advanced Feature Engineering☆52Updated 4 years ago
- Create Interactive Dashboards with Streamlit and Python Coursera☆10Updated 4 years ago
- Applying automated feature engineering to the Kaggle Home Credit Default Risk Competition☆19Updated 6 years ago
- Tutorial covering a new workflow available going from pandas to scikit-learn☆40Updated 2 years ago
- Contains code and presentation for my interactive hack session, 'Effective Feature Engineering: A Structured Approach to Building Better …☆30Updated 4 years ago
- Building simple ML apps with Streamlit☆24Updated 4 years ago
- How to do data science with Optimus, Spark and Python.☆19Updated 5 years ago
- Detailed notes and codes on learning pandas quickly for machine learning.☆26Updated 8 years ago