codspire / chicago-taxi-trips-analysis
Analysis of City Of Chicago Taxi Trip Dataset Using AWS EMR, Spark, PySpark, Zeppelin and Airbnb's Superset
☆15Updated 7 years ago
Alternatives and similar repositories for chicago-taxi-trips-analysis:
Users that are interested in chicago-taxi-trips-analysis are comparing it to the libraries listed below
- Big Data Demystified meetup and blog examples☆31Updated 6 months ago
- Examples of how Python can speed up tasks that are cumbersome in Excel☆13Updated 8 years ago
- Analytics on Apache Projects for Diversity☆18Updated 5 years ago
- Mastering Spark for Data Science, published by Packt☆47Updated 2 years ago
- Source code for the MC technical blog post "Data Observability in Practice Using SQL"☆36Updated 6 months ago
- Code for my presentation: Using PySpark to Process Boat Loads of Data☆20Updated 7 years ago
- Bosch Kaggle competion: Reduce manufacturing failures (https://www.kaggle.com/c/bosch-production-line-performance)☆24Updated 8 years ago
- Pyspark in Google Colab: A simple machine learning (Linear Regression) model☆36Updated 5 years ago
- Sample Notebooks for PipelineAI☆44Updated 2 years ago
- Partly lecture and partly a hands-on tutorial and workshop, this is a three part series on how to get started with MLflow. In this four p…☆38Updated 3 years ago
- ☆16Updated 6 years ago
- Python bindings for Matroid API☆16Updated last month
- ☆26Updated last year
- Productivity Utilities for Data Science with Python Notebooks☆6Updated 5 years ago
- Webscikit is a set of tools to run a webserver as a JSON Webservice for scikit-learn predictions. It comes with two examples: boston and …☆9Updated 7 years ago
- Materials for Apache Arrow workshop at VLDB 2019☆42Updated 4 years ago
- Apache Spark Application Development -- George Jen, Jen Tek LLC☆15Updated last year
- Documentation and resources for deploying JupyterHub on Hadoop☆18Updated 5 years ago
- Public course material☆35Updated 5 years ago
- Partly lecture and partly a hands-on tutorial and workshop, this is a three part series on how to get started with MLflow. In this four p…☆34Updated 4 years ago
- A set of widgets for Python's Orange Machine Learning to work with Apache Spark ML☆15Updated 8 years ago
- Analyzing Clickstream Data using Markov Chains and data mining SPACE algorithm☆29Updated 6 years ago
- Predict whether a student will correctly answer a problem based on past performance using automated feature engineering☆32Updated 4 years ago
- Spark NLP for Streamlit☆15Updated 3 years ago
- Blog post on ETL pipelines with Airflow☆23Updated 4 years ago
- Fork of the Freely Extensible Biomedical Record Linkage program☆24Updated 8 years ago
- Simple validator for submissions to DrivenData competitions☆19Updated 5 years ago
- ☆19Updated 3 years ago
- Set of iPython and Jupyter extensions to improve user experience☆50Updated 5 years ago
- Machine Learning in Snowflake☆24Updated 5 years ago