codspire / chicago-taxi-trips-analysis
Analysis of City Of Chicago Taxi Trip Dataset Using AWS EMR, Spark, PySpark, Zeppelin and Airbnb's Superset
☆15Updated 7 years ago
Alternatives and similar repositories for chicago-taxi-trips-analysis:
Users that are interested in chicago-taxi-trips-analysis are comparing it to the libraries listed below
- ☆19Updated 4 years ago
- Materials for Machine Learning with H2O Open Platform at ODSC Masterclass Summit 2017☆12Updated 8 years ago
- Code for my presentation: Using PySpark to Process Boat Loads of Data☆20Updated 7 years ago
- ☆16Updated 7 years ago
- Big Data Demystified meetup and blog examples☆31Updated 8 months ago
- A simple Spark TDD example☆26Updated 7 years ago
- This repo demonstrates how to load a sample Parquet formatted file from an AWS S3 Bucket. A python job will then be submitted to a Apach…☆19Updated 8 years ago
- ☆16Updated last year
- Spark NLP for Streamlit☆15Updated 3 years ago
- Model management example using Polyaxon, Argo and Seldon☆23Updated 6 years ago
- ☆13Updated 7 years ago
- Data Scientist code test☆19Updated 4 years ago
- Source code for the MC technical blog post "Data Observability in Practice Using SQL"☆38Updated 9 months ago
- Distributed, large-scale, benchmarking framework for rigorous assessment of automatic machine learning repositories, projects, and librar…☆30Updated 2 years ago
- How to do data science with Optimus, Spark and Python.☆19Updated 5 years ago
- ☆11Updated 6 years ago
- Predict the poverty of households in Costa Rica using automated feature engineering.☆23Updated 4 years ago
- Code to 1) scrap wikipedia page view counts, and to 2) conduct time series analysis with GAM☆47Updated 7 years ago
- Sample Notebooks for PipelineAI☆44Updated 2 years ago
- Partly lecture and partly a hands-on tutorial and workshop, this is a three part series on how to get started with MLflow. In this four p…☆39Updated 4 years ago
- H2OAI Driverless AI Code Samples and Tutorials☆37Updated 6 months ago
- Repo demonstrating a Dagster pipeline to generate Neo4j Graph☆21Updated 3 years ago
- Building an API with the FastAPI framework to serve a scikit-learn model.☆18Updated 6 years ago
- Python bindings for Matroid API☆16Updated 3 months ago
- Simple validator for submissions to DrivenData competitions☆19Updated 5 years ago
- Playing with Instacart data in Neo4j☆16Updated 7 years ago
- Get Twitter trends with twitter4j, stream it to a Kafka topic, save it to MongoDB and visualize in Google Maps☆14Updated 3 years ago
- Project template for highly effective data science workflows☆29Updated last year
- Analytics on Apache Projects for Diversity☆18Updated 5 years ago
- A Scalable Data Cleaning Library for PySpark.☆27Updated 6 years ago