codspire / chicago-taxi-trips-analysis
Analysis of City Of Chicago Taxi Trip Dataset Using AWS EMR, Spark, PySpark, Zeppelin and Airbnb's Superset
☆15Updated 7 years ago
Alternatives and similar repositories for chicago-taxi-trips-analysis:
Users that are interested in chicago-taxi-trips-analysis are comparing it to the libraries listed below
- Examples of how Python can speed up tasks that are cumbersome in Excel☆13Updated 8 years ago
- Analytics on Apache Projects for Diversity☆18Updated 5 years ago
- Source code for the MC technical blog post "Data Observability in Practice Using SQL"☆36Updated 6 months ago
- Python bindings for Matroid API☆16Updated 3 weeks ago
- Blog post on ETL pipelines with Airflow☆23Updated 4 years ago
- H2OAI Driverless AI Code Samples and Tutorials☆37Updated 3 months ago
- How to do data science with Optimus, Spark and Python.☆19Updated 5 years ago
- Analyzing Clickstream Data using Markov Chains and data mining SPACE algorithm☆29Updated 6 years ago
- Predict whether a student will correctly answer a problem based on past performance using automated feature engineering☆32Updated 4 years ago
- IBM Data Science Experience Desktop was built for those who want to download and play locally. Analyze, learn, and build with the tools y…☆33Updated 5 years ago
- A simple Spark TDD example☆26Updated 7 years ago
- This repo demonstrates how to load a sample Parquet formatted file from an AWS S3 Bucket. A python job will then be submitted to a Apach…☆19Updated 8 years ago
- Spark Projects for the Berkeley Data Science Course☆11Updated 9 years ago
- ☆13Updated 5 years ago
- ☆19Updated 3 years ago
- Apache Spark Application Development -- George Jen, Jen Tek LLC☆15Updated last year
- Build your feature store with macros right within your dbt repository☆38Updated 2 years ago
- Big Data Demystified meetup and blog examples☆31Updated 5 months ago
- How to use Python to understand data and transform the data into a tidy format ready to be used for modelling and visualisation.☆37Updated 5 years ago
- Know your ML Score based on Sculley's paper☆34Updated 5 years ago
- Extract, PreProcess, and Analyze big data on GPUs☆21Updated 6 years ago
- Project template for highly effective data science workflows☆29Updated 9 months ago
- A Scalable Data Cleaning Library for PySpark.☆26Updated 5 years ago
- PySpark, Databrick, h2o, MLlib☆18Updated 8 years ago
- ☆26Updated 5 years ago
- Jupyter notebooks for learning Python and Data Science, companion to Data Science Solutions book.☆36Updated 4 years ago
- ☆21Updated last year
- Work for Mastering Large Datasets with Python☆18Updated 2 years ago
- Bootcamp 7 Student Project Presentation☆22Updated 7 years ago
- The goal of this project is to offer an AWS EMR template using Spot Fleet and On-Demand Instances that you can use quickly. Just focus on…☆26Updated 2 years ago