codspire / chicago-taxi-trips-analysisLinks
Analysis of City Of Chicago Taxi Trip Dataset Using AWS EMR, Spark, PySpark, Zeppelin and Airbnb's Superset
☆15Updated 8 years ago
Alternatives and similar repositories for chicago-taxi-trips-analysis
Users that are interested in chicago-taxi-trips-analysis are comparing it to the libraries listed below
Sorting:
- ☆19Updated 4 years ago
- Big Data Demystified meetup and blog examples☆31Updated last year
- Public course material☆35Updated 6 years ago
- Code to 1) scrap wikipedia page view counts, and to 2) conduct time series analysis with GAM☆47Updated 8 years ago
- Materials for Machine Learning with H2O Open Platform at ODSC Masterclass Summit 2017☆12Updated 8 years ago
- H2OAI Driverless AI Code Samples and Tutorials☆38Updated last year
- Predict whether a student will correctly answer a problem based on past performance using automated feature engineering☆32Updated 5 years ago
- Slides and materials for most of my talks by year☆92Updated 2 years ago
- Pyspark in Google Colab: A simple machine learning (Linear Regression) model☆38Updated 6 years ago
- Apache Spark Application Development -- George Jen, Jen Tek LLC☆16Updated 2 years ago
- Blog post on ETL pipelines with Airflow☆24Updated 4 months ago
- Analytics on Apache Projects for Diversity☆18Updated 6 years ago
- PySpark Machine Learning Examples☆45Updated 7 years ago
- Source code for the MC technical blog post "Data Observability in Practice Using SQL"☆40Updated last year
- Demo notebooks inside a docker for end-to-end examples☆112Updated 7 years ago
- Brian Farris' Talk on Reinforcement Learning and Multi-Armed Bandits for the Data Incubator☆30Updated 7 years ago
- Data Exploration in PySpark made easy - Pyspark_dist_explore provides methods to get fast insights in your Spark DataFrames.☆102Updated 6 years ago
- Notes for Data Science 350 Class☆24Updated 8 years ago
- ☆59Updated 4 years ago
- Project template for highly effective data science workflows☆29Updated last month
- How to use Python to understand data and transform the data into a tidy format ready to be used for modelling and visualisation.☆36Updated 6 years ago
- ☆39Updated 8 years ago
- A series of Jupyter notebooks that walk you through Machine Learning with Apache Spark ecosystem using Spark MLlib, PyTorch and TensorFlo…☆87Updated 2 years ago
- Predict taxi trip duration based on historical trips using automated feature engineering☆62Updated 5 years ago
- Data Scientist code test☆19Updated 5 years ago
- MLflow App Library☆77Updated 7 years ago
- Simple demonstration of how to build a complex real time machine learning visualization tool.☆16Updated 9 years ago
- ☆16Updated 2 years ago
- Using Luigi to create a Machine Learning Pipeline using the Rossman Sales data from Kaggle☆33Updated 9 years ago
- How to do data science with Optimus, Spark and Python.☆19Updated 6 years ago