codspire / chicago-taxi-trips-analysis
Analysis of City Of Chicago Taxi Trip Dataset Using AWS EMR, Spark, PySpark, Zeppelin and Airbnb's Superset
☆15Updated 7 years ago
Related projects ⓘ
Alternatives and complementary repositories for chicago-taxi-trips-analysis
- Big Data Demystified meetup and blog examples☆31Updated 2 months ago
- ☆19Updated 3 years ago
- How to use Python to understand data and transform the data into a tidy format ready to be used for modelling and visualisation.☆37Updated 5 years ago
- Examples of how Python can speed up tasks that are cumbersome in Excel☆13Updated 8 years ago
- How to do data science with Optimus, Spark and Python.☆18Updated 5 years ago
- Source code for the MC technical blog post "Data Observability in Practice Using SQL"☆36Updated 3 months ago
- A simple Spark TDD example☆26Updated 7 years ago
- Sample Notebooks for PipelineAI☆44Updated 2 years ago
- ☆11Updated 6 years ago
- Blog post on ETL pipelines with Airflow☆23Updated 4 years ago
- Machine Learning in Snowflake☆24Updated 5 years ago
- ☆26Updated 10 months ago
- Project template for highly effective data science workflows☆29Updated 7 months ago
- Mastering Spark for Data Science, published by Packt☆46Updated last year
- PyConDE & PyData Berlin 2019 Airflow Workshop: Airflow for machine learning pipelines.☆46Updated last year
- Deep Learning with Apache Spark and Deep Cognition☆58Updated 6 years ago
- Materials for Machine Learning with H2O Open Platform at ODSC Masterclass Summit 2017☆12Updated 7 years ago
- Python bindings for Matroid API☆16Updated last month
- Predict whether a student will correctly answer a problem based on past performance using automated feature engineering☆32Updated 4 years ago
- Spark NLP for Streamlit☆15Updated 3 years ago
- A code-based tutorial for production level data streaming with PySpark plus Optimus for data cleaning, Confluent Kafka, & Apache Drill u…☆26Updated 5 years ago
- Business Data Analysis by HiPIC of CalStateLA☆20Updated 6 years ago
- Workshop for Spark and Databricks☆54Updated 4 years ago
- ☆18Updated 3 months ago
- Simple demonstration of how to build a complex real time machine learning visualization tool.☆16Updated 8 years ago
- ☆14Updated 6 years ago
- Snowflake Guide: Building a Recommendation Engine Using Snowflake & Amazon SageMaker☆31Updated 3 years ago
- Extract, PreProcess, and Analyze big data on GPUs☆21Updated 6 years ago
- Apache Spark Application Development -- George Jen, Jen Tek LLC☆15Updated last year
- A Scalable Data Cleaning Library for PySpark.☆26Updated 5 years ago