codspire / chicago-taxi-trips-analysis
Analysis of City Of Chicago Taxi Trip Dataset Using AWS EMR, Spark, PySpark, Zeppelin and Airbnb's Superset
☆15Updated 7 years ago
Alternatives and similar repositories for chicago-taxi-trips-analysis:
Users that are interested in chicago-taxi-trips-analysis are comparing it to the libraries listed below
- ☆19Updated 4 years ago
- Big Data Demystified meetup and blog examples☆31Updated 7 months ago
- H2OAI Driverless AI Code Samples and Tutorials☆37Updated 5 months ago
- ☆11Updated 6 years ago
- ☆16Updated last year
- Example of orchestrating dependent Databricks jobs using Airflow☆11Updated 5 years ago
- Blog post on ETL pipelines with Airflow☆23Updated 4 years ago
- Examples of how Python can speed up tasks that are cumbersome in Excel☆13Updated 8 years ago
- Repository of Notebooks taken from https://neo4j.com/graph-algorithms-book/☆26Updated 5 years ago
- Python bindings for Matroid API☆16Updated 2 months ago
- Performance Benchmarks☆21Updated 5 months ago
- Data Scientist code test☆19Updated 4 years ago
- Predict whether a student will correctly answer a problem based on past performance using automated feature engineering☆32Updated 4 years ago
- Operations Research Algorithms☆17Updated last year
- Source code for the MC technical blog post "Data Observability in Practice Using SQL"☆36Updated 8 months ago
- ☆27Updated 2 years ago
- Apache Spark Application Development -- George Jen, Jen Tek LLC☆16Updated last year
- Mastering Spark for Data Science, published by Packt☆47Updated 2 years ago
- Repo demonstrating a Dagster pipeline to generate Neo4j Graph☆21Updated 3 years ago
- This repo demonstrates how to load a sample Parquet formatted file from an AWS S3 Bucket. A python job will then be submitted to a Apach…☆19Updated 8 years ago
- Getting Great Expectations setup to run on DataBricks with Spark Dataframes.☆13Updated 2 years ago
- Analyzing Clickstream Data using Markov Chains and data mining SPACE algorithm☆29Updated 6 years ago
- Posts, presentations and papers I've written.☆39Updated 4 years ago
- ☆12Updated 8 years ago
- notebooks for nlp-on-spark☆13Updated 8 years ago
- Materials for Machine Learning with H2O Open Platform at ODSC Masterclass Summit 2017☆12Updated 8 years ago
- Materials for Apache Arrow workshop at VLDB 2019☆42Updated 4 years ago
- Interactive notebooks containing demonstration code of the splink library☆38Updated last year
- Spark NLP for Streamlit☆15Updated 3 years ago
- Business Data Analysis by HiPIC of CalStateLA☆20Updated 6 years ago