codspire / chicago-taxi-trips-analysisLinks
Analysis of City Of Chicago Taxi Trip Dataset Using AWS EMR, Spark, PySpark, Zeppelin and Airbnb's Superset
☆15Updated 7 years ago
Alternatives and similar repositories for chicago-taxi-trips-analysis
Users that are interested in chicago-taxi-trips-analysis are comparing it to the libraries listed below
Sorting:
- ☆19Updated 4 years ago
- Big Data Demystified meetup and blog examples☆31Updated 9 months ago
- Examples of how Python can speed up tasks that are cumbersome in Excel☆13Updated 8 years ago
- Machine Learning in Snowflake☆24Updated 5 years ago
- Notebooks for the ML Link Prediction Course☆14Updated 4 years ago
- Predict whether a student will correctly answer a problem based on past performance using automated feature engineering☆32Updated 4 years ago
- Using Luigi to create a Machine Learning Pipeline using the Rossman Sales data from Kaggle☆33Updated 8 years ago
- Build your feature store with macros right within your dbt repository☆38Updated 2 years ago
- Blog post on ETL pipelines with Airflow☆23Updated 4 years ago
- Predict the poverty of households in Costa Rica using automated feature engineering.☆23Updated 4 years ago
- Sample Notebooks for PipelineAI☆44Updated 2 years ago
- How to do data science with Optimus, Spark and Python.☆19Updated 5 years ago
- Python library for deploying models built using Python to Alteryx Promote.☆15Updated 3 years ago
- Pyspark in Google Colab: A simple machine learning (Linear Regression) model☆36Updated 6 years ago
- Repository of Notebooks taken from https://neo4j.com/graph-algorithms-book/☆26Updated 5 years ago
- A simple Spark TDD example☆26Updated 7 years ago
- Simple samples for writing ETL transform scripts in Python☆22Updated 3 years ago
- Building an API with the FastAPI framework to serve a scikit-learn model.☆18Updated 6 years ago
- Labs and data files for a full-day Spark workshop☆24Updated last week
- Simple demonstration of how to build a complex real time machine learning visualization tool.☆16Updated 9 years ago
- Code to 1) scrap wikipedia page view counts, and to 2) conduct time series analysis with GAM☆47Updated 7 years ago
- Docker compose and Google Colab demo to build a CDC with Delta Lake☆15Updated 2 years ago
- How to use Python to understand data and transform the data into a tidy format ready to be used for modelling and visualisation.☆37Updated 5 years ago
- H2OAI Driverless AI Code Samples and Tutorials☆37Updated 7 months ago
- Code for my presentation: Using PySpark to Process Boat Loads of Data☆20Updated 7 years ago
- Materials for Machine Learning with H2O Open Platform at ODSC Masterclass Summit 2017☆12Updated 8 years ago
- Understanding Seattle Bike Count data☆18Updated 8 years ago
- Extract, PreProcess, and Analyze big data on GPUs☆21Updated 6 years ago
- ☆24Updated 6 years ago
- Source code for the MC technical blog post "Data Observability in Practice Using SQL"☆38Updated 10 months ago