codspire / chicago-taxi-trips-analysis
Analysis of City Of Chicago Taxi Trip Dataset Using AWS EMR, Spark, PySpark, Zeppelin and Airbnb's Superset
☆15Updated 7 years ago
Alternatives and similar repositories for chicago-taxi-trips-analysis:
Users that are interested in chicago-taxi-trips-analysis are comparing it to the libraries listed below
- Examples of how Python can speed up tasks that are cumbersome in Excel☆13Updated 8 years ago
- ☆19Updated 3 years ago
- Big Data Demystified meetup and blog examples☆31Updated 6 months ago
- Materials for Machine Learning with H2O Open Platform at ODSC Masterclass Summit 2017☆12Updated 7 years ago
- Analytics on Apache Projects for Diversity☆18Updated 5 years ago
- Sample Notebooks for PipelineAI☆44Updated 2 years ago
- ☆26Updated last year
- Spark NLP for Streamlit☆15Updated 3 years ago
- Slide and notebook used for my talk on vaex at the Pandas summit 2019 @ Lodnon☆11Updated 5 years ago
- H2OAI Driverless AI Code Samples and Tutorials☆37Updated 3 months ago
- Blog post on ETL pipelines with Airflow☆23Updated 4 years ago
- Analyzing Clickstream Data using Markov Chains and data mining SPACE algorithm☆29Updated 6 years ago
- Pyspark in Google Colab: A simple machine learning (Linear Regression) model☆36Updated 5 years ago
- Demo notebooks inside a docker for end-to-end examples☆113Updated 6 years ago
- Brian Farris' Talk on Reinforcement Learning and Multi-Armed Bandits for the Data Incubator☆30Updated 6 years ago
- Simple demonstration of how to build a complex real time machine learning visualization tool.☆16Updated 8 years ago
- Apache Spark Application Development -- George Jen, Jen Tek LLC☆15Updated last year
- Labs and data files for a full-day Spark workshop☆24Updated last year
- Code for my presentation: Using PySpark to Process Boat Loads of Data☆20Updated 7 years ago
- Simple validator for submissions to DrivenData competitions☆19Updated 5 years ago
- Predict whether a student will correctly answer a problem based on past performance using automated feature engineering☆32Updated 4 years ago
- Python bindings for Matroid API☆16Updated last month
- Use Kafka and Apache Spark streaming to perform click stream analytics☆76Updated 4 years ago
- A simple Spark TDD example☆26Updated 7 years ago
- Partly lecture and partly a hands-on tutorial and workshop, this is a three part series on how to get started with MLflow. In this four p…☆38Updated 3 years ago
- Data Scientist code test☆19Updated 4 years ago
- Jupyter notebooks for learning Python and Data Science, companion to Data Science Solutions book.☆36Updated 4 years ago
- Sample techniques for a variety of feature extraction methods☆32Updated 3 years ago
- Supporting content (slides and exercises) for the Addison-Wesley (Pearson) video series covering best practices for developing scalable S…☆66Updated 9 years ago
- A repository filled with various data science projects.☆32Updated 4 years ago